Analyzing Data With Apache Spark (For Data Analysts)

PT15156
Summary
This course will introduce Apache Spark. The students will learn how Spark fits into the Big Data ecosystem, and how to use Spark for data analysis. This class is taught with Python language and using Jupyter environment
Prerequisites
The prerequisite for this course includes analyst background (familiarity with SQL, Scripting ..etc)
Duration
2 Days/Lecture & Lab
Audience
This course is designed for data analysts and business analysts.
Topics
  • Spark Introduction
  • First Look at Spark
  • Spark Data structures
  • Caching
  • Dataframes / Datasets
  • Spark SQL
  • Spark and Hadoop
  • Workshops

Related Scheduled Courses