Building Open Source Software (OSS) Analytics Solutions with Azure HDInsight
- Offered byMicrosoft
Building Open Source Software (OSS) Analytics Solutions with Azure HDInsight at Microsoft Overview
Duration | 5 hours |
Total fee | Free |
Mode of learning | Online |
Schedule type | Self paced |
Difficulty level | Intermediate |
Official Website | Explore Free Course |
Credential | Certificate |
Building Open Source Software (OSS) Analytics Solutions with Azure HDInsight at Microsoft Course details
- Introduction to the Open source Analytics Offering
- Choose the correct HDInsight Configuration to build open source analytics solutions
- Creating and configuring a HDInsight cluster
- Run Petabyte level OSS NoSQL databases with HDInsight HBase
- Perform advanced streaming data transformations with Apache Spark and Kafka in Azure HDInsight
- Perform Zero ETL analytics with HDInsight Interactive Query
- Manage enterprise security in HDInsight
- In this learning path, the learner will be introduced to HDInsight and how to apply this technology to solve a range of real world challenges
- At the end of this course, you will learn that Azure HDInsight is a fully managed cloud service that enables you to efficiently process massive amounts of data using the most popular open source frameworks
Building Open Source Software (OSS) Analytics Solutions with Azure HDInsight at Microsoft Curriculum
Introduction to the Open source Analytics Offering
Introduction
What is HDInsight?
How does HDInsight work
When to use HDInsight
Knowledge check
Summary
Choose the correct HDInsight Configuration to build open source analytics solutions
Introduction
HDInsight configuration options
Decision criteria for selecting the correct HDInsight configuration option
Analyze a scenario and map it to a HDInsight configuration option
Cost optimization strategies for HDinsight clusters
Knowledge check
Summary
Creating and configuring a HDInsight cluster
Introduction
Creating an HDInsight cluster
Exercise - Create an HDInsight cluster via the Azure portal
Opening a Jupyter Notebook on HDInsight Spark cluster
Exercise - Execute queries on HDInsight Spark cluster
Enable monitoring of HDInsight jobs
Common provisioning Issues
Exercise - Monitor an HDInsight cluster
Summary
Knowledge check
Run Petabyte level OSS NoSQL databases with HDInsight HBase
Introduction
Describe Apache HBase
Explain HDInsight HBase clusters architecture and application patterns
Improve the write and read performance of HBase clusters
Determine migration and high availability strategies in HDInsight HBase
Use Apache Phoenix on HDInsight HBase
Determine HDInsight HBase cluster performance
Perform benchmarking in HBase
Knowledge check
Summary
Perform advanced streaming data transformations with Apache Spark and Kafka in Azure HDInsight
Introduction
Use HDInsight Spark and Kafka
Stream data with Apache Kafka
Describe Spark structured streaming
Create a Kafka and Spark architecture
Exercise - Provision HDInsight to perform advanced streaming data transformations
Exercise - Create the Kafka producer
Exercise - Stream Kafka data to a Jupyter notebook and window the data
Replicate data to a secondary cluster
Knowledge check
Summary
Perform Zero ETL analytics with HDInsight Interactive Query
Introduction
When should you use HDInsight Interactive Query
HDInsight interactive queries
Exercise - Provision HDInsight to perform adhoc analytics
Exercise - Upload and query data in HDInsight
Integrate Apache Spark and Hive LLAP queries
Create a large scale interactive query dashboard for Evaluating Real Estate Trends
Summary
Knowledge check
Manage enterprise security in HDInsight
Introduction
Describe HDInsight security areas
Implement Network security
Understand operating system security
Manage application/ middleware security
Implement data access security
Knowledge check
Summary