Impala - An Open Source SQL Engine for Hadoop

PT10026
Training Summary
This is an ideal course package for individuals who want to understand the basic concepts of Massively Parallel Processing or MPP SQL query engine that runs on Apache Hadoop. On completing this course, learners will be able to interpret the role of Impala in the Big Data Ecosystem. The course focuses on the basics of Impala. It further provides an overview of the superior performance of Impala, against other popular SQL-on-Hadoop systems.
Prerequisites
Fundamental Knowledge of programming language and Hadoop components is the basic course prerequisite. However, participants are expected to have knowledge of SQL commands.
Duration
2 Days/Lecture & Labs
Audience
AnalystsData scientistsHadoop administrator and developersSQL developersData warehouse developersDatabase administrators and developers
Course Topics
  • Course Introduction
  • Introduction to Impala
  • Querying with Hive and Impala
  • Data Storage and File Format
  • Working with Impala

Related Scheduled Courses