The Ultimate Hands-On Hadoop - Tame your Big Data!
- Offered byUDEMY
The Ultimate Hands-On Hadoop - Tame your Big Data! at UDEMY Overview
Duration | 15 hours |
Total fee | ₹3,199 |
Mode of learning | Online |
Difficulty level | Intermediate |
Official Website | Go to Website |
Credential | Certificate |
The Ultimate Hands-On Hadoop - Tame your Big Data! at UDEMY Highlights
- Earn a certificate from Udemy
- Learn from industry experts
- 2 downloadable resources
- Access on mobile and TV
The Ultimate Hands-On Hadoop - Tame your Big Data! at UDEMY Course details
- Software engineers and programmers who want to understand the larger Hadoop ecosystem, and use it to store, analyze, and vend \big data\ at scale
- Project program or product managers who want to understand the lingo and high-level architecture of Hadoop
- Data analysts and database administrators who are curious about Hadoop and how it relates to their work
- System architects who need to understand the components available in the Hadoop ecosystem program and how they fit together
- Design distributed systems that manage "big data" using Hadoop and related data engineering technologies
- Use HDFS and MapReduce for storing and analyzing data at scale
- Use Pig and Spark to create scripts to process data on a Hadoop cluster in more complex ways
- Analyze relational data using Hive and MySQL
- Analyze non-relational data using HBase, Cassandra, and MongoDB
- Query data interactively with Drill, Phoenix, and Presto
- Choose an appropriate data storage technology for your application
- Understand how Hadoop clusters are managed by YARN, Tez, Mesos, Zookeeper, Zeppelin, Hue, and Oozie
- Publish data to your Hadoop cluster using Kafka, Sqoop, and Flume
- Consume streaming data using Spark Streaming, Flink, and Storm
- The world of Hadoop and "Big Data" can be intimidating - hundreds of different technologies with cryptic names form the Hadoop ecosystem
- With this Hadoop tutorial, you'll not only understand what those systems are and how they fit together - but you'll go hands-on and learn how to use them to solve real business problems! Learn and master the most popular big data technologies in this comprehensive course, taught by a former engineer and senior manager from Amazon and IMDb
- We'll go way beyond Hadoop itself, and dive into all sorts of distributed systems you may need to integrate with
- Install and work with a real Hadoop installation right on your desktop with Hortonworks (now part of Cloudera) and the Ambari UI Manage big data on a cluster with HDFS and MapReduce Write programs to analyze data on Hadoop with Pig and Spark Store and query your data with Sqoop , Hive , MySQL , HBase , Cassandra , MongoDB , Drill , Phoenix , and Presto Design real-world systems using the Hadoop ecosystem Learn how your cluster is managed with YARN , Mesos , Zookeeper , Oozie , Zeppelin , and Hue Handle streaming data in real time with Kafka , Flume , Spark Streaming , Flink , and Storm Understanding Hadoop is a highly valuable skill for anyone working at companies with large amounts of data. Almost every large company you might want to work at uses Hadoop in some way, including Amazon, Ebay, Facebook, Google, LinkedIn, IBM, Spotify, Twitter, and Yahoo! And it's not just technology companies that need Hadoop; even the New York Times uses Hadoop for processing images
- This course is comprehensive, covering over 25 different technologies in over 14 hours of video lectures
- It's filled with hands-on activities and exercises, so you get some real experience in using Hadoop - it's not justheory
- You'll find a range of activities in this course for people at every level
- If you're a project manager who just wants to learn the buzzwords, there are web UI's for many of the activities in the course that require no programming knowledge
- If you're comfortable with command lines, we'll show you how to work with them too
The Ultimate Hands-On Hadoop - Tame your Big Data! at UDEMY Curriculum
Learn all the buzzwords! And install the Hortonworks Data Platform Sandbox
Tips for Using This Course
Warning for Apple M1 users
Using Hadoop's Core: HDFS and MapReduce
HDFS: What it is, and how it works
Alternate MovieLens download location
Programming Hadoop with Pig
Introducing Ambari
Introducing Pig
Programming Hadoop with Spark
Why Spark?
The Resilient Distributed Dataset (RDD)
Using relational data stores with Hadoop
What is Hive?
[Activity] Use Hive to find the most popular movie
Using non-relational data stores with Hadoop
Why NoSQL?
What is HBase
Querying your Data Interactively
Overview of Drill
[Activity] Setting up Drill
Managing your cluster
Tez explained
[Activity] Use Hive on Tez and measure the performance benefit