Apache Spark Getting Started
- Offered bySkillsoft
Apache Spark Getting Started at Skillsoft Overview
Duration | 1 hour |
Mode of learning | Online |
Difficulty level | Beginner |
Official Website | Go to Website |
Credential | Certificate |
Future job roles | CRUD, .Net, CSR, Credit risk, Senior Software Developer |
Apache Spark Getting Started at Skillsoft Highlights
- Earn a digital badge after completion of course from Skillsoft
Apache Spark Getting Started at Skillsoft Course details
Explore the basics of Apache Spark, an analytics engine used for big data processing,it's an open source, cluster computing framework built on top of Hadoop
Discover how it allows operations on data with both its own library methods and with SQL, while delivering great performance
Learn the characteristics, components, and functions of Spark, Hadoop, RDDS, the spark session, and master and worker notes. Install PySpark
Apache Spark Getting Started at Skillsoft Curriculum
Course Overview
Introduction to Spark and Hadoop
Resilient Distributed Datasets (RDDs)
RDD Operations
Spark DataFrames
Spark Architecture
Spark Installation
Working with RDDs
Creating DataFrames from RDDs
Contents of a DataFrame
The SQLContext
The map() Function of an RDD
Accessing the Contents of a DataFrame
DataFrames in Spark and Pandas
Exercise: Working with Spark