Weather Data: Complete and Ward Linkages
The last chart was good, but if you remember the K-Means and K-Medoids algorithms results, you may remember that there was at least one more line that unlike all the others goes downwards close to July. The average linkage in hierarchical clustering didn't catch that dynamic.
We saw that for complete and ward linkages there is sense to consider 4 clusters. Let's find out will they catch that?
Swipe to start coding
- Import
numpywithnpalias. - Iterate over the
linkageslist. At each step:
- Create a hierarchical clustering model with 4 clusters and method
jnamedmodel. - Fit the numerical data of
tempand predict the labels. Add predicted labels as the'prediction'column totemp. - Create a
temp_resDataFrame with monthly averages for each group. To do it group the values oftempby the'prediction'column, calculate themean, and then apply the.stack()method. - Add column
'method'totemp_resDataFrame with valuejbeing repeated the number of rows intemp_restimes. - Merge
resandtemp_resdataframes using.concatfunction ofpd.
- Reassign the column names of
resto['Group', 'Month', 'Temp', "Method"]. - Within the
FacetGridfunction set thecolparameter to'Method'. This will build a separate chart for each value of the'Method'column. - Within the
.mapfunction set theseabornline plot function as the first parameter.
Solución
¡Gracias por tus comentarios!
single
Pregunte a AI
Pregunte a AI
Pregunte lo que quiera o pruebe una de las preguntas sugeridas para comenzar nuestra charla
Resumir este capítulo
Explicar el código en file
Explicar por qué file no resuelve la tarea
Genial!
Completion tasa mejorada a 3.57
Weather Data: Complete and Ward Linkages
Desliza para mostrar el menú
The last chart was good, but if you remember the K-Means and K-Medoids algorithms results, you may remember that there was at least one more line that unlike all the others goes downwards close to July. The average linkage in hierarchical clustering didn't catch that dynamic.
We saw that for complete and ward linkages there is sense to consider 4 clusters. Let's find out will they catch that?
Swipe to start coding
- Import
numpywithnpalias. - Iterate over the
linkageslist. At each step:
- Create a hierarchical clustering model with 4 clusters and method
jnamedmodel. - Fit the numerical data of
tempand predict the labels. Add predicted labels as the'prediction'column totemp. - Create a
temp_resDataFrame with monthly averages for each group. To do it group the values oftempby the'prediction'column, calculate themean, and then apply the.stack()method. - Add column
'method'totemp_resDataFrame with valuejbeing repeated the number of rows intemp_restimes. - Merge
resandtemp_resdataframes using.concatfunction ofpd.
- Reassign the column names of
resto['Group', 'Month', 'Temp', "Method"]. - Within the
FacetGridfunction set thecolparameter to'Method'. This will build a separate chart for each value of the'Method'column. - Within the
.mapfunction set theseabornline plot function as the first parameter.
Solución
¡Gracias por tus comentarios!
single