Apache Hadoop Architecture Development and Administration
- Offered byPairview Training
Apache Hadoop Architecture Development and Administration at Pairview Training Overview
Duration | 4 days |
Total fee | ₹2.40 Lakh |
Mode of learning | Online |
Difficulty level | Advanced |
Credential | Certificate |
Apache Hadoop Architecture Development and Administration at Pairview Training Highlights
- Pairview Certification for achieving this competency
- Interactive training classes in 4 days (daytime timetable) or 8 evenings (evening timetable)
- Hands-on practice after the classes with a work-based assignment in 14-21 days
- Access to support by our mentors during the course period
Apache Hadoop Architecture Development and Administration at Pairview Training Course details
- For data architects
- For data engineers
- For database developers and administrators that are looking to advance their career to Big Data engineering
- At the end of this course, learner will be able to:
- Understand the concept of HDFS and MapReduce framework
- Be able to develop robust data processing applications
- Be able to write Hadoop codes
- Learn best practice in a Hadoop development environment
- This course will equip delegates with the skills and knowledge to become an Apache Hadoop Developer
- Delegates will be exposed to different industry use case scenarios, the core concepts (HDFS and MapReduce) and implementation of Hadoop, how to develop robust data processing applications and Hadoop Distributed Files System (HDFS)
- You will also learn best practice Hadoop Development, debugging and implementation of workflows
Apache Hadoop Architecture Development and Administration at Pairview Training Curriculum
Module 1: Understanding big data and Hadoop
The Hadoop Project and Hadoop Components
The Hadoop Distributed File System
Introduction to Big Data & Big Data Challenges
Module 2: Hadoop Architecture
Hadoop 2.x Cluster Architecture
Federation and High Availability Architecture
Typical Production Hadoop Cluster
Module 3: HDFS Architecture and Concepts
Hadoop 2.x Cluster Architecture
Federation and High Availability Architecture
Typical Production Hadoop Cluster
Module 4: HDFS Architecture and Concepts
HDFS Concepts
HDFS Architecture
Module 5: Hadoop MapReduce Framework
Differences between the Old and New MapReduce APIs
Traditional way vs MapReduce way
Why MapReduce
Module 6: The Hadoop Ecosystem
Introduction to the Eco-System
Introduction to Pig
Introduction to Hive
Module 7: An Introduction to hive and Pig
The Motivation for Hive and Pig
Hive Overview
Module 8: Apache Hive
Introduction to Apache Hive
Hive vs Pig
Hive Installation and Configuration
Module 9: Apache Pig
Introduction to Apache Pig
MapReduce vs Pig
Pig Installation and Configuration
Module 10: Integrating Hadoop into the Enterprise workflow
Introduction to SQOOP
Integrating Hadoop into an Existing Enterprise
Hadoop vs Relational Databases