Spark V2 for Data Analysts

PT22154
Summary
This course will introduce Apache Spark. The students will learn how Spark fits into the Big Data ecosystem, and how to use Spark for data analysis. This class is taught with Python language and using Jupyter environment
Prerequisites
Analyst background (familiarity with SQL, Scripting .etc)
Duration
3 Days/Lecture & Lab
Audience
This course is designed for Data Analysts and Business Analysts.
Topics
  • Spark Introduction
  • First Look at Spark
  • Spark Data structures
  • Caching
  • Dataframes / Datasets
  • Spark SQL
  • Spark and Hadoop
  • Spark and Hadoop

Related Scheduled Courses