Visualizing the Dynamics Across Clusters
The selective pair of months on the scatter plot looked good, didn't it? Maybe there were no key differences between 'areas' on the plot, but at least there were no outliers outside the respective groups, and in general, all groups were disjoint.
Finally, let's find out the yearly dynamics for each cluster, i.e. let's build the line plot representing the monthly averages for each group of points.
Swipe to start coding
- Extract the necessary columns (month's names and temperatures) within the
colvariable:
- Firstly, extract the 2-13 column names as
listtype, and save them within thecolvariable. - Then add the
'prediction'string to the listcol.
- Calculate the monthly average temperatures for each cluster, and save the result within
monthly_datavariable:
- Firstly group the observations of
colcolumn ofdataby'prediction'. - Then calculate
.mean()of grouped table. - Then apply
.stack()to stack the table (already done). - Finally reset the indices using
.reset_index()method.
-
Assign list
['Group', 'Month', 'Temp']as column names for transformed data withinmonthly_datavariable. -
Build the line plot
'Month'vs'Temp'for eachGroupusingmonthly_dataDataFrame.
Lösning
Tack för dina kommentarer!
single
Fråga AI
Fråga AI
Fråga vad du vill eller prova någon av de föreslagna frågorna för att starta vårt samtal
Sammanfatta detta kapitel
Explain code
Explain why doesn't solve task
Awesome!
Completion rate improved to 3.57
Visualizing the Dynamics Across Clusters
Svep för att visa menyn
The selective pair of months on the scatter plot looked good, didn't it? Maybe there were no key differences between 'areas' on the plot, but at least there were no outliers outside the respective groups, and in general, all groups were disjoint.
Finally, let's find out the yearly dynamics for each cluster, i.e. let's build the line plot representing the monthly averages for each group of points.
Swipe to start coding
- Extract the necessary columns (month's names and temperatures) within the
colvariable:
- Firstly, extract the 2-13 column names as
listtype, and save them within thecolvariable. - Then add the
'prediction'string to the listcol.
- Calculate the monthly average temperatures for each cluster, and save the result within
monthly_datavariable:
- Firstly group the observations of
colcolumn ofdataby'prediction'. - Then calculate
.mean()of grouped table. - Then apply
.stack()to stack the table (already done). - Finally reset the indices using
.reset_index()method.
-
Assign list
['Group', 'Month', 'Temp']as column names for transformed data withinmonthly_datavariable. -
Build the line plot
'Month'vs'Temp'for eachGroupusingmonthly_dataDataFrame.
Lösning
Tack för dina kommentarer!
single