Mastering Big Data Analytics
- Offered byGreat Learning
Mastering Big Data Analytics at Great Learning Overview
Duration | 19 hours |
Total fee | Free |
Mode of learning | Online |
Difficulty level | Intermediate |
Official Website | Explore Free Course |
Credential | Certificate |
Mastering Big Data Analytics at Great Learning Highlights
- Earn a certificate of completion
Mastering Big Data Analytics at Great Learning Course details
- Map reduce
- HDFS
- YARN
- Hive
- Apache Hadoop
- Pyspark
- Kafka
- Spark streaming
- The Big Data Analytics course will introduce you to prominent big data tools, with a few demonstrations and case studies for each of these tools
- The course shall focus on working with each of these tools for analytics purposes
- It shall begin with a briefing on Hadoop, discussing the framework and its different versions
- You will learn about the Hive tool to work with SQL and illustrations, the Spark tool for steaming and analyzing, the RDD and PySpark concepts, working and functioning
- In the latter part of the Mastering Big Data Analytics course, you will understand working with Apache Kafka and advanced Spark concepts
- The course also includes projects you can work with and five assessments to evaluate your gains on each topic
Mastering Big Data Analytics at Great Learning Curriculum
Big Data touch
Getting Started: Hadoop
Hadoop Framework : Stepping into Hadoop
HDFS: What and Why?
Working on HDFS
Hadoop 2.x - YARN
Mapreduce: A Programming Paradigm
Closer look to Map reduce
Practical approach to Map reduce
Hadoop 1.x vs Hadoop 2.x
Hadoop 3.x
Apache hive : Teasing the Honey bee
Hive illustration : Basics
Hive Illustration : External tables in hive
Hive illustration : Loading different file formats
Hive illustration : Loading data into Hive tables
Hive illustration : Simple Operations on Hive table
Hive illustration : Query Operations on Hive table
Hive illustration : Querying complex structures
Hive illustration : Views
Getting started - Spark Basics
Spark and Hadoop - Face to face
Spark - Architecture
RDDs - Building blocks of Spark
RDDs continued
Spark tterminologies
Pyspark - Getting hands dirty
Spark - MLlib
Pyspark - Clustering
Music data - Study the case - 01
Music data - Study the case - 02
Music data - Study the case - 03
Spark streaming and Real time data analytics
Spark streaming Architecture
Real-time Data Analysis on Twitter Data : Demo
Case study - Ad tech - 01
Case study - Ad tech - 02
Kafka - What and Where?
Kafka - Key components_Broker_Producer
Kafka - Key components_Topics_Partitions
Kafka - Key components_Consumer_Replicas
Kafka - APIs and Clusters
How Kafka Works with Examples
Zookeeper - Basic principles
Live Kafka demo with Twitter
Configure the Spark
Spark Properties
Performance Tuning
Data serialization
Memory tuning
Garbage collection
Memory usage and levels of parallelism
Data locality and broadcasting
Job scheduling
Modes in cluster management
Dynamic resource allocation
Decommission of executors
Application schedule
Mastering Big Data Analytics at Great Learning Faculty details
Other courses offered by Great Learning
Mastering Big Data Analytics at Great Learning Students Ratings & Reviews
- 4-54