MOC 20775 A: Performing Data Engineering on Microsoft HD Insight

Catalog Home Databases, Business Intelligence & Data Science Data Science

MOC 20775 A: Performing Data Engineering on Microsoft HD Insight

  Add Course to watch list
  View full course outline
  Request in your area
Instructor Led Self-Paced eLearning

The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

In addition to their professional experience, students who attend this course should have:

  • Programming experience using R, and familiarity with common R packages
  • Knowledge of common statistical methods and data analysis best practices.
  • Basic knowledge of the Microsoft Windows operating system and its core functionality.
  • Working knowledge of relational databases.

5 Days/Lecture & Lab

The primary audience for this course is data engineers, data architects, data scientists, and data developers who plan to implement big data engineering workflows on HDInsight.

Getting Started with HDInsight

  • Deploying HDInsight Clusters
  • Authorizing Users to Access Resources
  • Loading Data into HDInsight
  • Troubleshooting HDInsight
  • Implementing Batch Solutions
  • Design Batch ETL Solutions for Big Data with Spark
  • Analyze Data with Spark SQL
  • Analyze Data with Hive and Phoenix
  • Stream Analytics
  • Implementing Streaming Solutions with Kafka and HBase
  • Develop Big Data Real-Time Processing Solutions with Apache Storm
  • Create Spark Streaming Applications




< >

Course is eligible for vouchers

Recently Viewed Courses:

Copyright © 2018 ProTech. All Rights Reserved.

Sign In Create Account

Navigation

Social Media