MOC 20773 A: Analyzing Big Data with Microsoft R

PT16726
Training Summary
The main purpose of the course is to give students the ability to use Microsoft R Server to create and run an analysis on a large dataset, and show how to utilize it in Big Data environments, such as a Hadoop or Spark cluster, or a SQL Server database.
Prerequisites
In addition to their professional experience, students who attend this course should have:
  • Programming experience using R, and familiarity with common R packages
  • Knowledge of common statistical methods and data analysis best practices.
  • Basic knowledge of the Microsoft Windows operating system and its core functionality.
  • Working knowledge of relational databases.
Duration
3 Days/Lecture & Lab
Audience
The primary audience for this course is people who wish to analyze large datasets within a big data environment. The secondary audience is developers who need to integrate R analyses into their solutions.
Course Topics
Microsoft R Server and R Client
  • Exploring Big Data
  • Visualizing Big Data
  • Processing Big Data
  • Parallelizing Analysis Operations
  • Creating and Evaluating Regression Models
  • Creating and Evaluating Partitioning Models
  • Processing Big Data in SQL Server and Hadoop

Related Scheduled Courses