Section 1. Chapter 11
single
Challenge: Exploratory Analysis of a Flights Dataset
Swipe to show menu
Task
Swipe to start coding
You are given a flights dataset as a list of rows. Load it into a DataFrame using createDataFrame and complete the following steps, storing results in the specified variables:
- Count the total number of rows – store in
total_rows; - Find the airline with the most delayed flights (
Delay == 1) – store the airline code as a string inmost_delayed_airline; - Count the number of delayed flights (
Delay == 1) – store indelayed_count; - Find the top 3 busiest routes (unique
AirportFrom+AirportTopairs by flight count) – store as a list of tuples[(origin, destination, count), ...]intop_routes.
Print all results.
Solution
Everything was clear?
Thanks for your feedback!
Section 1. Chapter 11
single
Ask AI
Ask AI
Ask anything or try one of the suggested questions to begin our chat