Simplilearn
Simplilearn Logo

Big Data and Hadoop Spark Developer 

  • Offered bySimplilearn
  • Private Institute
  • Estd. 2010

Big Data and Hadoop Spark Developer
 at 
Simplilearn 
Overview

Duration

22 hours

Mode of learning

Online

Difficulty level

Intermediate

Credential

Certificate

Future job roles

CRUD, .Net, CSR, Credit risk, Senior Software Developer

Big Data and Hadoop Spark Developer
 at 
Simplilearn 
Highlights

  • Earn your certificate in mastering this domain
  • Earn Hadoop 2.7 experience certificate
  • A great course for learning Big Data
  • Certification Course
Read more
Details Icon

Big Data and Hadoop Spark Developer
 at 
Simplilearn 
Course details

More about this course
  • Simplilearn's Big Data Hadoop course lets students master the concepts of the Hadoop framework, Big data tools, and methodologies
  • Achieving a Big Data Hadoop certification prepares students for success as a Big Data Developer
  • This Big Data and Hadoop training help you understand how the various components of the Hadoop ecosystem fit into the Big Data processing lifecycle
  • Take this Big Data and Hadoop online training to explore Spark applications, parallel processing, and functional programming

Big Data and Hadoop Spark Developer
 at 
Simplilearn 
Curriculum

Lesson 1 Course Introduction

Course Introduction

Accessing Practice Lab

Lesson 2 Introduction to Big Data and Hadoop

Introduction to Big Data and Hadoop

Introduction to Big Data

Big Data Analytics

What is Big Data

Four Vs Of Big Data

Case Study: Royal Bank of Scotland

Challenges of Traditional System

Distributed Systems

Introduction to Hadoop

Components of Hadoop Ecosystem: Part One

Components of Hadoop Ecosystem: Part Two

Components of Hadoop Ecosystem: Part Three

Commercial Hadoop Distributions

Demo: Walkthrough of Simplilearn Cloudlab

Key Takeaways

Knowledge Check

Lesson 3 Hadoop Architecture,Distributed Storage (HDFS) and YARN

Hadoop Architecture Distributed Storage (HDFS) and YARN

What Is HDFS

Need for HDFS

Regular File System vs HDFS

Characteristics of HDFS

HDFS Architecture and Components

High Availability Cluster Implementations

HDFS Component File System Namespace

Data Block Split

Data Replication Topology

HDFS Command Line

Demo: Common HDFS Commands

HDFS Command Line

YARN Introduction

YARN Use Case

YARN and Its Architecture

Resource Manager

How Resource Manager Operates

Application Master

How YARN Runs an Application

Tools for YARN Developers

Demo: Walkthrough of Cluster Part One

Demo: Walkthrough of Cluster Part Two

Key Takeaways

Knowledge Check

Hadoop Architecture,Distributed Storage (HDFS) and YARN

Lesson 4 Data Ingestion into Big Data Systems and ETL

Data Ingestion into Big Data Systems and ETL

Data Ingestion Overview Part One

Data Ingestion

Apache Sqoop

Sqoop and Its Uses

Sqoop Processing

Sqoop Import Process

Assisted Practice: Import into Sqoop

Sqoop Connectors

Demo: Importing and Exporting Data from MySQL to HDF

Apache Sqoop

Apache Flume

Flume Model

Scalability in Flume

Components in Flume's Architecture

Configuring Flume Components

Demo: Ingest Twitter Data

Apache Kafka

Aggregating User Activity Using Kafka

Kafka Data Model

Partitions

Apache Kafka Architecture

Producer Side API Example

Consumer Side API

Demo: Setup Kafka Cluster

Consumer Side API Example

Kafka Connect

Key Takeaways

Demo: Creating Sample Kafka Data Pipeline using Producer and Consumer

Knowledge Check

Data Ingestion into Big Data Systems and ETL

Lesson 5 Distributed Processing - MapReduce Framework and Pig

Distributed Processing MapReduce Framework and Pig

Distributed Processing in MapReduce

Word Count Example

Map Execution Phases

Map Execution Distributed Two Node Environment

MapReduce Jobs

Hadoop MapReduce Job Work Interaction

Setting Up the Environment for MapReduce Development

Set of Classes

Creating a New Project

Advanced MapReduce

Data Types in Hadoop

OutputFormats in MapReduce

Using Distributed Cache

Joins in MapReduce

Replicated Join

Introduction to Pig

Components of Pig

Pig Data Model

Pig Interactive Modes

Pig Operations

Various Relations Performed by Developers

Demo: Analyzing Web Log Data Using MapReduce

Demo: Analyzing Sales Data and Solving KPIs using PIG

Apache Pig

Demo: Wordcount

Key takeaways

Knowledge Check

Distributed Processing - MapReduce Framework and Pig

Other courses offered by Simplilearn

– / –
6 months
– / –
1.53 L
11 months
– / –
– / –
4 days
– / –
1.5 L
4 months
– / –
View Other 283 CoursesRight Arrow Icon

Big Data and Hadoop Spark Developer
 at 
Simplilearn 
Students Ratings & Reviews

5/5
Verified Icon2 Ratings
J
Janmejay Rai
Big Data and Hadoop Spark Developer
Offered by Simplilearn
5
Other: Took Big Data Hadoop Developer course from Simplilearn & it was really a great experience. The instructor was more focused on hands-on rather than slides which kept the interest in the session and also helped us to keep up the learning pace. The content provided was also very good. Being a beginner in Big Data & Hadoop, it was very easy for me to catch the concepts. Instructor was very patience & helpful and used to help us in understanding the concepts by repeating it multiple times. I would really recommend Sinplilearn for learning.
Reviewed on 9 Jan 2020Read More
Thumbs Up IconThumbs Down Icon
R
Rahul
Big Data and Hadoop Spark Developer
Offered by Simplilearn
5
Big Data & Hadoop
Other: Got a 50% SALARY HIKE  with BIG DATA CERTIFICATION
Reviewed on 2 Apr 2019Read More
Thumbs Up IconThumbs Down Icon
View All 2 ReviewsRight Arrow Icon
qna

Big Data and Hadoop Spark Developer
 at 
Simplilearn 

Student Forum

chatAnything you would want to ask experts?
Write here...