Loading Course Schedule...
PT16726
Training Summary
The main purpose of the course is to give students the ability to use Microsoft R Server to create and run an analysis on a large dataset, and show how to utilize it in Big Data environments, such as a Hadoop or Spark cluster, or a SQL Server database.
Prerequisites
In addition to their professional experience, students who attend this course should have:
- Programming experience using R, and familiarity with common R packages
- Knowledge of common statistical methods and data analysis best practices.
- Basic knowledge of the Microsoft Windows operating system and its core functionality.
- Working knowledge of relational databases.
Duration
3 Days/Lecture & Lab
Audience
The primary audience for this course is people who wish to analyze large datasets within a big data environment. The secondary audience is developers who need to integrate R analyses into their solutions.
Course Topics
Microsoft R Server and R Client
- Exploring Big Data
- Visualizing Big Data
- Processing Big Data
- Parallelizing Analysis Operations
- Creating and Evaluating Regression Models
- Creating and Evaluating Partitioning Models
- Processing Big Data in SQL Server and Hadoop