Loading Course Schedule...
PT10321
Summary
This course will introduce Apache Spark. The students will learn how Spark fits into the Big Data ecosystem, and how to use Spark for data analysis. The course covers Spark shell for interactive data analysis, Spark internals, RDDs, Dataframes and Spark SQL.
Prerequisites
Before taking this course, students should have a familiarity with programming language and the Linux environment (navigating command line, editing files).
Duration
2 Days/Lecture & Lab
Audience
This course is ideal for business users (business analysts / data analysts).
Topics
Spark Basics::First Look at Spark::RDDs in Depth::Spark SQL & Dataframes