Loading Course Schedule...
PT10009
Summary
This course will allow you to focus on implementation of Hadoop jobs along with the development of hive and pig enquiries. This course has been designed by a team of highly experienced industry professionals to provide in-depth knowledge and skills to the learner in order to become a successful Hadoop developer. The complete curriculum extensively covers all the topics required to gain an expertise in Hadoop ecosystem.
Prerequisites
There are no prerequisites for this course.
Duration
3 Days/Lecture & Labs
Topics
- Introduction to Hadoop
- Parallelizing Program Execution
- Meeting the challenges of parallel programming
- Parallel programming with MapReduce
- Implementing Real-World MapReduce Jobs
- Applying the Hadoop MapReduce paradigm
- Building complex MapReduce jobs
- Customizing MapReduce
- Solving common data manipulation problems
- Implementing Partitioners and comparators
- Persisting Big Data with Distributed Data Stores
- Making the case for distributed data
- Interfacing with Hadoop Distributed File System (HDFS)
- Sharing reference data with Distributed Cache
- Structuring data with HBase
- Comparing HBase to other types of NoSQL data stores
- Simplifying Data Analysis with Query Languages
- Unleashing the power of SQL with Hive
- Executing workflows with Pig
- Managing and Deploying Big Data Solutions
- Testing and debugging Hadoop code
- Deploying, monitoring and tuning performance