Section 3. Chapter 4
single
Challenge: Polars Data Aggregation
Swipe to show menu
In this challenge, you will use polars to efficiently perform data aggregation on large datasets. Specifically, you are tasked with grouping a large DataFrame by one column and computing the mean of another column for each group. This is a common operation in data analysis, especially when working with big data, as it allows you to summarize and extract insights from subsets of your data without loading everything into memory at once.
Task
Swipe to start coding
Write a function using polars that groups a DataFrame by a specified column and computes the mean of another column for each group.
- The function must take a
pl.DataFrame, agroup_colstring, and avalue_colstring as arguments. - The function must return a new DataFrame containing each unique value in
group_coland the mean ofvalue_colfor that group. - The resulting DataFrame must have a column named
"mean_"followed by thevalue_colname, containing the computed mean values.
Solution
Everything was clear?
Thanks for your feedback!
Section 3. Chapter 4
single
Ask AI
Ask AI
Ask anything or try one of the suggested questions to begin our chat