Linkedin Learning
Linkedin Learning Logo

Apache PySpark by Example 

  • Offered byLinkedin Learning

Apache PySpark by Example
 at 
Linkedin Learning 
Overview

Duration

2 hours

Total fee

1,400

Mode of learning

Online

Difficulty level

Intermediate

Credential

Certificate

Apache PySpark by Example
 at 
Linkedin Learning 
Highlights

  • Earn a sharable certificate
Details Icon

Apache PySpark by Example
 at 
Linkedin Learning 
Course details

Skills you will learn
More about this course
  • This practical, hands-on course helps you get comfortable with PySpark, explaining what it has to offer and how it can enhance your data science work
  • To begin, instructor Jonathan Fernandes digs into the Spark ecosystem, detailing its advantages over other data science platforms, APIs, and tool sets
  • Next, he looks at the DataFrame API and how it's the platform's answer to many big data challenges. Finally, he goes over Resilient Distributed Datasets (RDDs), the building blocks of Spark

Apache PySpark by Example
 at 
Linkedin Learning 
Curriculum

Introduction

Apache PySpark

What you should know

Introduction to Apache Spark

The Apache Spark ecosystem

Why Spark?

Spark origins and Databricks

Spark components

Partitions, transformations, lazy evaluations, and actions

Technical Setup

Set up the lab environment

Download a dataset

Importing

Working with the DataFrame API

The DataFrame API

Working with DataFrames

Schemas

Working with columns

Working with rows

Challenge

Solution

Functions

Built-in functions

Working with dates

User-defined functions

Working with joins

Challenge

Solution

Resilient Distributed Datasets (RDDs)

RDDs

Working with RDDs

Conclusion

Next steps

Faculty Icon

Apache PySpark by Example
 at 
Linkedin Learning 
Faculty details

Jonathan Fernandes, Generative AI | Large Language Models | NLP

Other courses offered by Linkedin Learning

– / –
1 hours
Intermediate
25 K
1 month
– / –
– / –
1 hours
Advanced
1.85 K
1 hours
Intermediate
View Other 504 CoursesRight Arrow Icon
qna

Apache PySpark by Example
 at 
Linkedin Learning 

Student Forum

chatAnything you would want to ask experts?
Write here...