Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Apprendre Introduction to PySpark | Spark Basics
Introduction to Big Data with Apache Spark in Python
course content

Contenu du cours

Introduction to Big Data with Apache Spark in Python

Introduction to Big Data with Apache Spark in Python

1. Big Data Basics
2. Spark Basics
3. Spark SQL

book
Introduction to PySpark

What is PySpark?

It provides Python APIs for Spark’s core functionalities, including Spark SQL, DataFrames, RDDs (Resilient Distributed Datasets), and MLlib (machine learning library).

It also allows integration with other Python libraries and tools, making it easier to build data pipelines, perform analysis, and apply machine learning models.

Tout était clair ?

Comment pouvons-nous l'améliorer ?

Merci pour vos commentaires !

Section 2. Chapitre 4
We're sorry to hear that something went wrong. What happened?
some-alt