Apache Spark for Developers

Catalog Home Databases, Business Intelligence & Data Science Data Science

There are no scheduled dates for this course.

  Available by Request

This course will introduce Apache Spark. The students will learn how to use Spark for data analysis and write Spark applications. This course has been completely updated for latest Spark version 2.x! Spark version 2 has lots of changes compared to v1. This course covers the latest Spark v2 features.

Before taking this course, students should have familiarity with either Java / Scala / Python language (our labs in Scala and Python – we provide a quick Scala introduction). Students should also have a basic understanding of Linux development environment (command line navigation / running commands).

3 Days/Lecture & Lab

This course is designed for developers and data analysts.

Scala primer

  • Spark Basics
  • Spark Shell
  • RDDs (Condensed coverage)
  • Spark Dataframes & Datasets
  • Spark API programming (Scala / Python)
  • Spark SQL
  • Spark and Hadoop
  • Machine Learning (ML / MLib)
  • GraphX
  • Spark Streaming
  • Spark Performance and Tuning




< >

Copyright © 2018 ProTech. All Rights Reserved.

Sign In Create Account

Navigation

Social Media