Aprenda Model Interpretation and Deployment

Deslize para mostrar o menu

Understanding how to interpret your predictive models and prepare them for deployment is a critical skill in any data science workflow. Model interpretation helps you explain the influence of different predictors on your model's outcomes, while deployment ensures your trained models are ready to make predictions on new, unseen data. Two common interpretation techniques are reviewing variable importance and examining model coefficients. Variable importance tells you which predictors most influence the model's predictions. Model coefficients, particularly in linear models, provide insight into the direction and magnitude of each variable's effect.


              123456789101112131415161718192021222324252627
            
options(crayon.enabled = FALSE)
library(tidymodels)
library(ggplot2)

# 1. Specify the linear regression model
lm_spec <- linear_reg() %>%
  set_engine("lm") %>%
  set_mode("regression")

# 2. Fit the model predicting mpg based on hp, wt, cyl, and disp
lm_fit <- lm_spec %>%
  fit(mpg ~ hp + wt + cyl + disp, data = mtcars)

# 3. Extract feature importance using tidy()
lm_importance <- tidy(lm_fit) %>%
  filter(term != "(Intercept)") %>%
  mutate(abs_estimate = abs(estimate)) %>%
  arrange(desc(abs_estimate))

# 4. Visualize the results
ggplot(lm_importance, aes(x = reorder(term, abs_estimate), y = abs_estimate)) +
  geom_col(fill = "steelblue") +
  coord_flip() +
  labs(title = "Feature Importance (Linear Regression)", 
       x = "Feature", 
       y = "Absolute Impact on MPG") +
  theme_minimal()

After building and interpreting your model, you need to ensure your work can be reused for future predictions. This involves saving the trained model object and any preprocessing steps, then loading them whenever new data arrives. The typical steps include:

Save both the trained model and the recipe used for preprocessing;
Load these objects into your R session when needed;
Use the bake() function from the recipes package to preprocess new data with the saved recipe;
Use the predict() function with your saved model to generate predictions on the new, preprocessed data.

This approach ensures your model's predictions remain consistent and reliable, even after deployment.

Tudo estava claro?

Obrigado pelo seu feedback!

Seção 1. Capítulo 9

Pergunte à IA

Pergunte o que quiser ou experimente uma das perguntas sugeridas para iniciar nosso bate-papo

Seção 1. Capítulo 9