Cloudera Data Analyst Training
- Offered byCloudera
Cloudera Data Analyst Training at Cloudera Overview
Duration | 2 hours |
Mode of learning | Online |
Credential | Certificate |
Cloudera Data Analyst Training at Cloudera Highlights
- Earn a certificate from Cloudera after completion
Cloudera Data Analyst Training at Cloudera Course details
- For SQL developers
- For data analysts
- For business intelligence specialists
- For developers
- For system architects
- For database administrators
- Create tables using a variety of data types, delimiters, and file formats
- Create new tables using existing tables to define the schema
- Improve query performance by creating partitioned tables in the metastore
- Alter tables to modify existing schema
- Create views in order to simplify queries
- Training course will teach students to apply traditional data analytics and business intelligence skills to big data
- This course presents the tools data professionals need to access, manipulate, transform, and analyze complex data sets using SQL and familiar scripting languages
Cloudera Data Analyst Training at Cloudera Curriculum
Introduction
Apache Hadoop Fundamentals
The Motivation for Hadoop
Hadoop Overview
Data Storage: HDFS
Distributed Data Processing: YARN, MapReduce, and Spark
Data Processing and Analysis: Pig, Hive, and Impala
Database Integration: Sqoop
Other Hadoop Data Tools
Exercise Scenario Explanation
Introduction to Apache Hive and Impala
What Is Hive?
What Is Impala?
Why Use Hive and Impala?
Schema and Data Storage
Comparing Hive and Impala to Traditional
Querying with Apache Hive and Impala
Databases and Tables
Basic Hive and Impala Query Language Syntax
Data Types
Using Hue to Execute Queries
Using Beeline (Hive's Shell)
Using the Impala Shell
Common Operators and Built-In Functions
Operators
Scalar Functions
Aggregate Functions
Data Management
Data Storage
Creating Databases and Tables
Loading Data
Altering Databases and Tables
Simplifying Queries with Views
Storing Query Results
Data Storage and Performance
Partitioning Tables
Loading Data into Partitioned Tables
When to Use Partitioning
Choosing a File Format
Using Avro and Parquet File Formats