DATASCIENCE WITH S-A-S & PYTHON

2546 Reviews
4.8/5
1880 Learners

Overview

Learn Data Science with S-A-S  . 25+  Live Projects 

Duration : 100 Hrs | 5 Major Projects | 10 Minor Projects | 100 + Assignments

Data Sets , installations , Interview Preparations , Repeat the session until 6 months are all attractions of this particular course

Trainer :- Experienced Data Science Consultant

Watch
INTRO VIDEO

Why Radical Technologies

  • Highly practical oriented training
  • 25000+ Man-hours of Real-time projects & scenarios
  • 10 to 20+ year Experienced corporate trainers With Real Time Experience.
  • Building up professionals by highly experienced professionals
  • 100 % quality assurance in training .
  • 10000+ Placement Records and 180+ MNC’s and Consultancies Tie up

Check Batch Schedulings

AUDIENCE

  • Engineering/Management Graduate or Post-graduate Fresher Students who want to make their career in Data Science Industry or want to be future Data Scientist.
  • Engineers who want to use a distributed computing engine for batch or stream processing or both
  • Analysts who want to leverage Spark for analyzing interesting datasets
  • Data Scientists who want a single engine for analyzing and modelling data 
  • MBA Graduates or business professionals who are looking to move to a heavily quantitative role.
  • Engineering Graduate/Professionals who want to understand basic statistics and lay a foundation for a career in Data Science
  • Working Professional or Fresh Graduate who have mostly worked in Descriptive analytics or not work anywhere and want to make the shift to being  data scientists
  • Professionals who’ve worked mostly with tools like Excel and want to learn how to use R for statistical analysis.

Course Curriculum

Course description

Red Hat System Administration I (RH124) equips you with Linux® administration “survival skills” by focusing on foundational Linux concepts and core tasks. You will learn how to apply command-line concepts and enterprise-level tools, starting you on your journey toward becoming a full-time Linux system administrator. This path continues with the follow-on course, Red Hat System Administration II (RH134).

Want to be Future Data Scientist

Introduction:  This course does not require a prior quantitative or mathematics background. It starts by introducing basic concepts such as the mean, median mode etc. and eventually covers all aspects of an analytics (or) data science career from analysing and preparing raw data to visualizing your findings. If you’re a programmer or a fresh graduate looking to switch into an exciting new career track, or a data analyst looking to make the transition into the tech industry – this course will teach you the basic to Advance techniques used by real-world industry data scientists.

Data Science, Statistics with S-A-S This course starts with  Data Science and Statistics using Python and then complete knowledge of Data Science with S-A-S. It covers both the  aspects of Statistical concepts and the practical implementation using  S-A-S. If you’re new to Programming , don’t worry – the course starts with a crash course to teach you all basic programming concepts. If you’ve done some programming before or you are new in Programming, you should pick it up quickly. This course shows you how to get set up on Microsoft Windows-based PC’s; the sample code will also run on Mac OS or Linux desktop systems.

Analytics: Using S-A-S  you can analyze and explore your data in an interactive environment with fast feedback. The course will show how to leverage the power of RDDs and Data frames to manipulate data with ease.

Real life examples: Every concept is explained with the help of examples, case studies and source code wherever necessary. The examples cover a wide array of topics and range from A/B testing in an Internet company context to the Capital Asset Pricing Model in a  finance context.  

Course Content

Introduction To Analytics

  • Analytics World
    • Introduction to Analytics
    • Concept of ETL
    • S-A-S in advanced analytics
  • Global Certification: Induction and walk through
    • Getting Started
    • Software installation
    • Introduction to GUI
    • Different components of the language
    • All programming windows
    • Concept of Libraries and Creating Libraries
    • Variable Attributes – (Name, Type, Length, Format, In format, Label)
    • Importing Data and Entering data manually
  • Understanding Datasets
    • Descriptor Portion of a Dataset (Proc Contents)
    • Data Portion of a Dataset
    • Variable Names and Values
    • Data Libraries

 Base S-A-S – Accessing The Data

  • Understanding Data Step Processing
    • Data Step and Proc Step
    • Data step execution
    • Compilation and execution phase
    • Input buffer and concept of PDV
  • Importing Raw Data Files
    • Column Input and List Input and Formatted methods
    • Delimiters, Reading missing and non standard values
    • Reading one to many and many to one records
    • Reading Hierarchical files
    • Creating raw data files and put statement
    • Formats / Informat
  • Importing and Exporting Data (Fixed Format / Delimited)
  • Proc Import / Delimited text files
  • Proc Export / Exporting Data
  • Datalines / Cards;
  • Atypical importing cases (mixing different style of inputs)
    • Reading Multiple Records per Observation
    • Reading “Mixed Record Types”
    • Sub-setting from a Raw Data File
    • Multiple Observations per Record
    • Reading Hierarchical Files
    •  
  • Concept of SAS library and SAS Catalog
  • Variable Types in SAS
  • Reading Data stored external to SAS
  • Importing Data by using Proc Import
  • Data Step SAS statements
  • SAS Functions
  • Appending and Merging using SAS
  • SAS Procedures like proc means, proc Univariate, proc append, proc freq and proc export.
  • SAS SQL
  • SAS Macros

Hypothesis Testing and ANOVA

  • One Sample t-test of comparing means
  • Two Sample t-test of comparing means
  • One Way ANOVA
  • Assumptions of ANOVA Modeling
  • n-way ANOVA
  • ANOVA Post Hoc Studies

Measure Model Performance

  • Apply the principles of honest assessment to model performance measurement
  • Assess classifier performance using the confusion matrix
  • Model selection and validation using training and validation data
  • Create and interpret graphs (ROC, lift, and gains charts) for model comparison and selection
  • Establish effective decision cut-off values for scoring

 Data Understanding, Managing And Manipulation

  • Understanding and Exploration Data
    • Introduction to basic Procedures – Proc Contents, Proc Print
  • Understanding and Exploration Data
    • Operators and Operands
    • Conditional Statements (Where, If, If then Else, If then Do and select when)
    • Difference between WHERE and IF statements and limitation of WHERE statements
    • Labels, Commenting
    • System Options (OBS, FSTOBS, NOOBS etc…)
  • Data Manipulation
    • Proc Sort – with options / De-Duping
    • Accumulator variable and By-Group processing
    • Explicit Output Statements
    • Nesting Do loops
    • Do While and Do Until Statement
    • Array elements and Range
  • Combining Datasets (Appending and Merging)
    • Concatenation
    • Interleaving
    • Proc Append
    • One To One Merging
    • Match Merging
    • IN = Controlling merge and Indicator

 Data Mining With Proc SQL

  • Introduction to Databases
  • Introduction to Proc SQL
  • Basics of General SQL language
  • Creating table and Inserting Values
  • Retrieve & Summarize data
  • Group, Sort & Filter
  • Using Joins (Full, Inner, Left, Right and Outer)
  • Reporting and summary analysis
  • Concept of Indexes and creating Indexes (simple and composite)
  • Connecting S-A-S to external Databases
  • Implicit and Explicit pass through methods

 Macros For Automation

  • Macro Parameters and Variables
  • Different types of Macro Creation
  • Defining and calling a macro
  • Using call Symput and Symget
  • Macros options (mprint symbolgen mlogic merror serror)

 Fundamental Of Statistics

  • Basic Statistics – Measures of Central Tendencies and Variance
  • Building blocks – Probability Distributions – Normal distribution – Central Limit Theorem
  • Inferential Statistics -Sampling – Concept of Hypothesis Testing
  • Statistical Methods – Z/t-tests( One sample, independent, paired), Anova, Correlations and Chi-square    
  • Levels of Measurement and Variable types
  • Descriptive Statistics and Picturing Distributions
  • Confidence Interval for the Mean

Introduction To Predictive Modelling

  • Introduction to Predictive Modeling
  • Types of Business problems – Mapping of Techniques
  • Different Phases of Predictive Modeling

 Data Preparation

  • Need of Data preparation
  • Data Audit Report and Its importance
  • Consolidation/Aggregation – Outlier treatment – Flat Liners – Missing values- Dummy creation – Variable Reduction
  • Variable Reduction Techniques – Factor & PCA Analysis

 Segmentation

  • Introduction to Segmentation
  • Types of Segmentation (Subjective Vs Objective, Heuristic Vs. Statistical)
  • Heuristic Segmentation Techniques (Value Based, RFM Segmentation and Life Stage Segmentation)
  • Behavioural Segmentation Techniques (K-Means Cluster Analysis)
  • Cluster evaluation and profiling
  • Interpretation of results – Implementation on new data

 Linear Regression

  • Introduction – Applications
  • Assumptions of Linear Regression
  • Building Linear Regression Model
  • Understanding standard metrics (Variable significance, R-square/Adjusted R-square, Global hypothesis ,etc)
  • Validation of Models (Re running Vs. Scoring)
  • Standard Business Outputs (Decile Analysis, Error distribution (histogram), Model equation, drivers etc.)
  • Interpretation of Results – Business Validation – Implementation on new data

 Logistic Regression

  • Introduction – Applications
  • Linear Regression Vs. Logistic Regression Vs. Generalized Linear Models
  • Building Logistic Regression Model
  • Understanding standard model metrics (Concordance, Variable significance, Hosmer Lemeshov Test, Gini, KS, Misclassification, etc)
  • Validation of Logistic Regression Models (Re running Vs. Scoring)
  • Standard Business Outputs (Decile Analysis, ROC Curve,
    Probability Cut-offs, Lift charts, Model equation, Drivers, etc)
  • Interpretation of Results – Business Validation -Implementation on new data

 Time Series Forecasting

  • Introduction – Applications
  • Time Series Components( Trend, Seasonality, Cyclicity and Level) and Decomposition
  • Classification of Techniques(Pattern based – Pattern less)
  • Basic Techniques – Averages, Smoothening, etc
  • Advanced Techniques – AR Models, ARIMA, etc
  • Understanding Forecasting Accuracy – MAPE, MAD, MSE, etc

 Introduction To Machine Learning

  • Statistical learning vs. Machine learning
  • Major Classes of Learning Algorithms -Supervised vs Unsupervised Learning
  • Concept of Overfitting and Under fitting (Bias-Variance Trade off) & Performance Metrics
  • Types of Cross validation(Train & Test, Bootstrapping, K-Fold validation etc)

 Regression & Classification Model Building

  • Recursive Partitioning(Decision Trees)
  • Ensemble Models(Random Forest, Bagging & Boosting)
  • K-Nearest neighbours

Training Options

Live Online Training

  • Highly practical oriented training
  • Installation of Software On your System
  • 24/7 Email and Phone Support
  • 100% Placement Assistance until you get placed
  • Global Certification Preparation
  • Trainer Student Interactive Portal
  • Assignments and Projects Guided by Mentors
  • And Many More Features

Course completion certificate and Global Certifications are part of our all Master Program

Live Classroom Training

  • Weekend / Weekdays / Morning / Evening Batches
  • 80:20 Practical and Theory Ratio
  • Real-life Case Studies
  • Easy Coverup if you missed any sessions
  • PSI | Kryterion | Redhat Test Centers
  • Life Time Video Classroom Access ( coming soon )
  • Resume Preparations and Mock Interviews
  • And Many More Features

Course completion certificate and Global Certifications are part of our all Master Program

Exam & Certification

Course Reviews

I had a wonderful experience in Radical technologies where i did training in Hadoop development under the guidance of Shanit Sir. He started from the very basic and covered and shared everything he knew in this field. He was brilliant and had a lot of experience in this field. We did hands on for every topic we covered, and that’s the most important thing because honestly theoretical knowledge cannot land you a job.
Rohit Agrawal Hadoop
I have recently completed Linux course under Anand Sir and can assuredly say that it is definitely the best Linux course in Pune. Since most of the Linux courses from other sources are strictly focused on clearing the certification, they will not provide an insight into real-world server administration, but that is not the case with Anand Sir’s course. Anand Sir being an experienced IT infrastructure professional has an excellent understanding of how a data center works and all these information is seamlessly integrated into his classes.
Manu Sunil Linux
I had undergone oracle DBA course under Chetan sir’s Guidance an it was a very good learning experience overall since they not only provide us with theoretical knowledge but also conduct lot of practical sessions which are really fruitful and also the way of teaching is very fine clear and crisp which is easier to understand , overall I had a great time for around 2 months , they really train you well.also make it a point to clear all your doubts and provide you with clear and in-depth concepts hence hope to join sometime again
Reema banerjee Oracle DBA
I have completed Oracle DBA 11g from Radical technology pune. Excellent trainer (chetna gupta ). The trainer kept the energy level up and kept us interested throughout. Very practical, hands on experience. Gave us real-time examples, excellent tips and hints. It was a great experience with Radical technologies.
Mrudul Bhokare Oracle DBA
Linux learning with Anand sir is truly different experience… I don’t have any idea about Linux and system but Anand sir taught with scratch…He has a great knowledge and the best trainer…he can solve all your queries related to Linux in very simple way and giving nice examples… 100 🌟 to Anand Sir.
Harsh Singh Parihar Linux
Prev
Next

Why we are the best Radical Technologies

Radical Technologies is truly progressing and offer best possible services. And recognition towards Radical Technologies is increasing steeply as the demand is growing rapidly.

Creative

0%

Innovative

0%

Student Friendly

0%

Practical Oriented

0%

Valued Certification

0%

Training FAQs

Similar Courses

ENQUIRE NOW
[]
1 Step 1
keyboard_arrow_leftPrevious
Nextkeyboard_arrow_right
FormCraft - WordPress form builder