Loading Course Schedule...
PT0092
Summary
This course focuses on leveraging Hadoop for building highly scalable applications that work with large volumes of data. It addresses the limitations of traditional multi-tier application and relational databases when dealing with very large datasets. The NoSQL initiative, Hadoop Distributed File System (HDFS), HBase database and Hive will be discussed in detail. Each section introduces and explains a concept or technology, followed, as relevant and necessary, by an interactive coding section to help you understand the concept. This course combines concepts and hands-on labs in good proportion to illustrate key concepts and practical applications of these concepts.
Prerequisites
Required Basic Java Programming experience Basic SQL experience Knowledge of distributed computing concepts (such as basic JEE concepts, etc.) Recommended Strong Java and JEE programming background
Duration
5 Days/Lecture & Lab
Audience
This course is designed for Java Developers and Architects who have a good understanding of distributed computing concepts.
Topics
- Introduction to Hadoop
- Introduction to HDFS, OpenSSH and Hadoop Cluster
- Introduction to HBase and Hive
- Introduction to ZooKeeper
- Introduction to SQOOP
- The MapReduce API
- Debugging MapReduce applications