Data Analysis & Machine Learning Using Python

PT16793
Training Summary
Python is one of the fastest-growing high-level programming languages which enables clear programs on small and large scales. Widely considered the programming language of choice for serious developers, it is easy to learn and deploy, with design features that emphasize clarity of syntax, easy readability, and easy comprehension. Python can be used for large and small applications - to create web apps, games, or even a search engine. As programming in Python is much simpler than C, C++ and Java, this is the preferred language in many engineering, science and business applications. This course will introduce the participants to the design methodologies used to address Data Analysis/Machine Learning scenarios that use Python packages such as Scipy, Pandas, Statsmodels and Scikit-learn to build solutions. Participants will learn to manipulate, process, analyze and clean data using powerful libraries and tools. With the introduction of various reallife scenarios, they will learn the practical applications of Data Analysis using Python and how to integrate this modern and dynamic language with Hadoop.
Prerequisites
There are no prerequisites for this course.
Duration
5 Days/Lecture & Lab
Audience
This course is designed for those wanting to learn the design methodologies used to address Data Analysis/Machine Learning scenarios that use Python packages such as Scipy, Pandas, Statsmodels and Scikit-learn to build solutions.
Course Topics
Data analysis - Why Python?
  • Data Analysis - Application Scenarios
  • Design methodologies in Data Analysis solutions
  • Overview of available Frameworks for Data Analysis
  • Matplotlib
  • Scripy and Numpy
  • Pandas
  • IPython Toolkit
  • Scikit-Learn
  • Python in Hadoop Ecosystem

Related Scheduled Courses