Data Science with R

Course Overview

R is a powerful language used widely for data analysis and statistical computing. In this course, you will be able to learn data science using R. By the end of this course, you will have good exposure to building predictive models using machine learning on your own.

At the end of the training, participants will be able to:

  1. Explain the concept of Business Decisions and Analytics
  2. Understand the R Programming
  3. Describe Data Structures
  4. Discuss Data Visualization
  5. Understand Statistics
  6. Explain the Regression Analysis
  7. Describe Classification 
  8. Explain Clustering
  9. Understand Association

Pre-requisite

Some knowledge of programming and statistics

Duration

3 days

Course Outline

  1. Business Decisions
  2. Business Analysis
  3. Types of Analytics
  4. Applications of Business Analytics
  1. R for Data Analytics
  2. Steps to perform Data Analysis in R
  3. Basic Syntax in R
  4. Data Types and Variables
  5. Operators
  6. Conditional Statements
  7. Loops
  8. Loop Control Statements
  9. Functions
  10. Components of a Function
  11. Built-in Functions
  1. Steps for working with data
  2. Identifying Data Structures
  3. Assigning values to Data Structures
  4. Manipulating Data
  1. Introduction
  2. Graphics used for Data Visualization
  3. ggplot2
  4. File formats of graphic outputs
  1. What is Hypothesis?
  2. Types of Hypothesis
  3. Types of Data Sampling
  4. Types of Errors
  5. Confidence Level
  6. Critical Region
  7. Level of Significance
  1. Accordion Content
  2. Hypothesis Test
  3. Types of Hypothesis Test
  4. Hypothesis Test about population means
  5. Hypothesis Test about population Variance
  6. Hypothesis Test about population Proportions
  1. What is Regression Analysis?
  2. Types of Regression Models
  3. Linear Regression
  4. Non-linear Regression
  5. Non-linear to Linear Models
  6. Principle Component Analysis
  1. What is Classification
  2. Types of Classification
  3. Logistic Regression
  4. Support Vector Machines
  5. K-Nearest Neighbors(KNN)
  6. Naïve Bayes Classifier
  7. Decision Trees
  8. Random Forest Classification
  9. Evaluating Classifier Models
  1. Clustering and its applications
  2. Clustering Methods
  3. K-means Clustering
  4. Hierarchical Clustering
  5. Density-based Clustering
  1. Association Rules
  2. Apriori Algorithm

Hadoop Administration

Reviews