Certificate Program in Data Engineering
- Offered byHero Vired
Certificate Program in Data Engineering at Hero Vired Overview
Duration | 9 months |
Total fee | ₹2.50 Lakh |
Mode of learning | Online |
Credential | Certificate |
Certificate Program in Data Engineering at Hero Vired Highlights
- Earn a Certificate from Hero Vired including placement assistance
- Live projects & assignments integrated through the curriculum
- 70+ Live sessions with global & Indian faculty for all-round exposure
- Personalise Career Services to build out personal and professional brand
- Be prepared for Data, Big Data, Senior Data and Data Warehouse Engineer roles
- Opportunity to work 7+ Government data projects, industry projects, and case studies
Certificate Program in Data Engineering at Hero Vired Course details
- Learning with industry-acclaimed data engineering tools
- Dedicated Program Manager to solve your queries, discussion Forums and Community
- Program and deploy an application layer for data loading and transformation
- Extract relevant data from multiple sources
- Load the data into unstructured or semi-structured data lakes and warehouses
- Transform data into relevant dashboards or design for data analysis or reporting
- Understanding data pipelines and its deployment on cloud infrastructure
- With this program, participants will acquire essential skills and gain insights into leveraging big data to efficiently solve business problem
- The program begins with learning to use advanced SQL queries to extract the relevant data for business problem statements
- The program then dives into the processes required to pipeline big data into data lakes and data warehouses using Kafka & Apache Airflow
- Particpants will learn how to engineer data systems that efficiently extract, transform, and load data into consumable and usable information for business analysis
- The program explores the concepts and techniques of engineering data systems using Python, SQL & NoSQL, Airflow, Kafka, Spark, Scala, Hive, AWS S3, Azure, and MongoDB
- The program focuses on teaching you how to Extract, Transform and Load (ETL) live and pre-stored data in the most efficient manner and helps to gain an understanding of the relevant storage infrastructure required to build, deploy and test application layers for efficient data loading
Certificate Program in Data Engineering at Hero Vired Curriculum
Programming Fundamentals: Python
Basic Python Data Structures
Python Text Processing
Object Oriented Programming
Basic I/O using open() directive, csv module
Transforming CSV and JSON files
Tabular data manipulation with pandas
Tabular data manipulation with pandas
Working with NumPy, broadcasting, vectorizing operations
Working with logger, config files, accessing environment variables, building a python wheel, Unit testing
SQL
Introduction to SQL
Loading data in SQL
Querying data from SQL
SQL WIndow Functions
NoSQL Databases
When to use a NoSQL Database
Working with MongoDB and DynamoDB
Selecting appropriate Primary keys
Data Warehousing Basics
Create Normalised tables
Star and Snowflake Schema
Denormalise a database into Star and Snowflake Schema
Data Warehousing concepts: ETL process
Scala Programming for Spark
Variables, methods, classes and objects
Package and package objects
Higher order functions in Scala
Scala Collections and SBT
Transforming data using Spark
Hadoop architecture
Querying data using HIVE: Running Hive CLI, DDL and DML operations
Data manipulation using Spark SQL and Data frames
Loading data from S3 to spark and doing transformations
Create data lakes on S3, Glue and Amazon Athena
Orchestrating data pipelines
Building a spark application using SBT
Writing unit tests for spark applications
Create pipelines using Apache Airflow
Connect Different Data Sources using Apache Airflow
Stream Processing
Create DAGs
Monitor pipelines using web UI
Spark Streaming: Structured Streams
Kafka as a streaming source and sink
Software Engineering Essentials
Event based spark streams
Stream Processing Flink
CI/CD Pipelines, code standards, version control, debugging, docker, Kubernetes*, Microservices (FastAPI)