Adding functional workflow

davidrsch · davidrsch · commit 9fffc49d63af · 2025-08-17T11:06:12.000+02:00
diff --git a/vignettes/workflows_functional.Rmd b/vignettes/workflows_functional.Rmd
@@ -0,0 +1,315 @@
+---
+title: "Tidymodels Workflow with Functional Keras Models (Multi-Input)"
+output: rmarkdown::html_vignette
+vignette: >
+  %\VignetteIndexEntry{Tidymodels Workflow with Functional Keras Models (Multi-Input)}
+  %\VignetteEngine{knitr::rmarkdown}
+  %\VignetteEncoding{UTF-8}
+---
+
+```{r, include = FALSE}
+knitr::opts_chunk$set(
+  collapse = TRUE,
+  comment = "#>",
+  eval = FALSE # Set to TRUE to run all chunks when knitting
+)
+# Suppress verbose Keras output for the vignette
+options(keras.fit_verbose = 0)
+set.seed(123)
+```
+
+## Introduction
+
+This vignette demonstrates a complete `tidymodels` workflow for a regression task using a Keras functional model defined with `kerasnip`. We will use the Ames Housing dataset to predict house prices. A key feature of this example is the use of a multi-input Keras model, where numerical and categorical features are processed through separate input branches.
+
+`kerasnip` allows you to define complex Keras architectures, including those with multiple inputs, and integrate them seamlessly into the `tidymodels` ecosystem for robust modeling and tuning.
+
+## Setup
+
+First, we load the necessary packages.
+
+```{r load-packages}
+library(kerasnip)
+library(tidymodels)
+library(keras3)
+library(dplyr)       # For data manipulation
+library(ggplot2)     # For plotting
+library(future)      # For parallel processing
+library(finetune)    # For racing
+```
+
+## Data Preparation
+
+We'll use the Ames Housing dataset, which is available in the `modeldata` package. We will then split the data into training and testing sets.
+
+```{r data-prep}
+# Select relevant columns and remove rows with missing values
+ames_df <- ames |>
+  select(Sale_Price, Gr_Liv_Area, Year_Built, Neighborhood, Bldg_Type, Overall_Cond, Total_Bsmt_SF, contains("SF")) |>
+  na.omit()
+
+# Split data into training and testing sets
+set.seed(123)
+ames_split <- initial_split(ames_df, prop = 0.8, strata = Sale_Price)
+ames_train <- training(ames_split)
+ames_test <- testing(ames_split)
+
+# Create cross-validation folds for tuning
+ames_folds <- vfold_cv(ames_train, v = 5, strata = Sale_Price)
+```
+
+## Recipe for Preprocessing
+
+We will create a `recipes` object to preprocess our data. This recipe will:
+*   Predict `Sale_Price` using all other variables.
+*   Normalize all numerical predictors.
+*   Create dummy variables for categorical predictors.
+*   Collapse each group of predictors into a single matrix column using `step_collapse()`.
+
+This final step is crucial for the multi-input Keras model, as the `kerasnip` functional API expects a list of matrices for multiple inputs, where each matrix corresponds to a distinct input layer.
+
+```{r create-recipe}
+ames_recipe <- recipe(Sale_Price ~ ., data = ames_train) |>
+  step_normalize(all_numeric_predictors()) |>
+  step_collapse(all_numeric_predictors(), new_col = "numerical_input") |> 
+  step_dummy(Neighborhood) |> 
+  step_collapse(starts_with("Neighborhood"), new_col = "neighborhood_input") |> 
+  step_dummy(Bldg_Type) |> 
+  step_collapse(starts_with("Bldg_Type"), new_col = "bldg_input") |>
+  step_dummy(Overall_Cond) |> 
+  step_collapse(starts_with("Overall_Cond"), new_col = "condition_input")
+
+# You can prep and bake the recipe to see the processed data
+# prep(ames_recipe) |> bake(new_data = ames_train)
+```
+
+## Define Keras Functional Model with `kerasnip`
+
+Now, we define our Keras functional model using `kerasnip`'s layer blocks. This model will have four distinct input layers: one for numerical features and three for categorical features. These branches will be processed separately and then concatenated before the final output layer.
+
+```{r define-kerasnip-model}
+# Define layer blocks for multi-input functional model
+
+# Input blocks for numerical and categorical features
+input_numerical <- function(input_shape) {
+  layer_input(shape = input_shape, name = "numerical_input")
+}
+
+input_neighborhood <- function(input_shape) {
+  layer_input(shape = input_shape, name = "neighborhood_input")
+}
+
+input_bldg <- function(input_shape) {
+  layer_input(shape = input_shape, name = "bldg_input")
+}
+
+input_condition <- function(input_shape) {
+  layer_input(shape = input_shape, name = "condition_input")
+}
+
+# Processing blocks for each input type
+dense_numerical <- function(tensor, units = 32, activation = "relu") {
+  tensor |>
+    layer_dense(units = units, activation = activation)
+}
+
+dense_categorical <- function(tensor, units = 16, activation = "relu") {
+  tensor |>
+    layer_dense(units = units, activation = activation)
+}
+
+# Concatenation block
+concatenate_features <- function(numeric, neighborhood, bldg, condition) {
+  layer_concatenate(list(numeric, neighborhood, bldg, condition))
+}
+
+# Output block for regression
+output_regression <- function(tensor) {
+  layer_dense(tensor, units = 1, name = "output")
+}
+
+# Create the kerasnip model specification function
+create_keras_functional_spec(
+  model_name = "ames_functional_mlp",
+  layer_blocks = list(
+    numerical_input = input_numerical,
+    neighborhood_input = input_neighborhood,
+    bldg_input = input_bldg,
+    condition_input = input_condition,
+    processed_numerical = inp_spec(dense_numerical, "numerical_input"),
+    processed_neighborhood = inp_spec(dense_categorical, "neighborhood_input"),
+    processed_bldg = inp_spec(dense_categorical, "bldg_input"),
+    processed_condition = inp_spec(dense_categorical, "condition_input"),
+    combined_features = inp_spec(
+      concatenate_features,
+      c(
+        processed_numerical = "numeric",
+        processed_neighborhood = "neighborhood",
+        processed_bldg = "bldg",
+        processed_condition = "condition"
+      )
+    ),
+    output = inp_spec(output_regression, "combined_features")
+  ),
+  mode = "regression"
+)
+
+# Clean up the spec when the vignette is done knitting
+on.exit(remove_keras_spec("ames_functional_mlp"), add = TRUE)
+```
+
+## Model Specification
+
+We'll define our `ames_functional_mlp` model specification and set some hyperparameters to `tune()`. Note how the arguments are prefixed with their corresponding block names (e.g., `processed_numerical_units`).
+
+```{r define-tune-spec}
+# Define the tunable model specification
+functional_mlp_spec <- ames_functional_mlp(
+  # Tunable parameters for numerical branch
+  processed_numerical_units = tune(),
+  # Tunable parameters for categorical branch
+  processed_neighborhood_units = tune(),
+  processed_bldg_units = tune(),
+  processed_condition_units = tune(),
+  # Fixed compilation and fitting parameters
+  compile_loss = "mean_squared_error",
+  compile_optimizer = "adam",
+  compile_metrics = c("mean_absolute_error"),
+  fit_epochs = 50,
+  fit_batch_size = 32,
+  fit_validation_split = 0.2,
+  fit_callbacks = list(callback_early_stopping(monitor = "val_loss", patience = 5))
+) |>
+  set_engine("keras")
+
+print(functional_mlp_spec)
+```
+
+## Create Workflow
+
+A `workflow` combines the recipe and the model specification. 
+
+```{r create-workflow}
+ames_wf <- workflow() |>
+  add_recipe(ames_recipe) |>
+  add_model(functional_mlp_spec)
+
+print(ames_wf)
+```
+
+## Define Tuning Grid
+
+We will create a regular grid for our hyperparameters.
+
+```{r create-tuning-grid}
+# Define the tuning grid
+params <- extract_parameter_set_dials(ames_wf) |>
+  update(
+    processed_numerical_units = hidden_units(range = c(32, 128)),
+    processed_neighborhood_units = hidden_units(range = c(16, 64)),
+    processed_bldg_units = hidden_units(range = c(16, 64)),
+    processed_condition_units = hidden_units(range = c(16, 64))
+  )
+functional_mlp_grid <- grid_regular(params, levels = 3)
+
+print(functional_mlp_grid)
+```
+
+## Tune Model
+
+Now, we'll use `tune_race_anova()` to perform cross-validation and find the best hyperparameters.
+
+```{r tune-model}
+# Note: Parallel processing with `plan(multisession)` is currently not working
+# with Keras models due to backend conflicts.
+# plan(multisession) 
+
+set.seed(123)
+ames_tune_results <- tune_race_anova(
+  ames_wf,
+  resamples = ames_folds,
+  grid = functional_mlp_grid,
+  metrics = metric_set(rmse, mae, rsq), # Evaluate regression metrics
+  control = control_race(save_pred = TRUE, save_workflow = TRUE)
+)
+```
+
+## Inspect Tuning Results
+
+We can inspect the tuning results to see which hyperparameter combinations performed best.
+
+```{r inspect-results}
+# Show the best performing models based on RMSE
+show_best(ames_tune_results, metric = "rmse", n = 5)
+
+# Autoplot the results
+# autoplot(ames_tune_results) # Currently does not work due to a label issue.
+
+# Select the best hyperparameters
+best_functional_mlp_params <- select_best(ames_tune_results, metric = "rmse")
+print(best_functional_mlp_params)
+```
+
+## Finalize Workflow and Fit Model
+
+Once we have the best hyperparameters, we finalize the workflow and fit the model on the entire training dataset.
+
+```{r finalize-fit}
+# Finalize the workflow with the best hyperparameters
+final_ames_wf <- finalize_workflow(ames_wf, best_functional_mlp_params)
+
+# Fit the final model on the full training data
+final_ames_fit <- fit(final_ames_wf, data = ames_train)
+
+print(final_ames_fit)
+```
+
+### Inspect Final Model
+
+You can extract the underlying Keras model and its training history for further inspection.
+
+```{r inspect-final-keras-model-summary}
+# Extract the Keras model summary
+final_ames_fit |>
+  extract_fit_parsnip() |>
+  extract_keras_model() |>
+  summary()
+```
+
+```{r inspect-final-keras-model-plot}
+# Plot the Keras model
+final_ames_fit |>
+  extract_fit_parsnip() |>
+  extract_keras_model() |>
+  plot()
+```
+
+```{r inspect-final-keras-model-history}
+# Plot the training history
+final_ames_fit |>
+  extract_fit_parsnip() |>
+  extract_keras_history() |>
+  plot()
+```
+
+## Make Predictions and Evaluate
+
+Finally, we will make predictions on the test set and evaluate the model's performance.
+
+```{r predict-evaluate}
+# Make predictions on the test set
+ames_test_pred <- predict(final_ames_fit, new_data = ames_test)
+
+# Combine predictions with actuals
+ames_results <- tibble::tibble(
+  Sale_Price = ames_test$Sale_Price,
+  .pred = ames_test_pred$.pred
+)
+
+print(head(ames_results))
+
+# Evaluate performance using yardstick metrics
+metrics_results <- metric_set(rmse, mae, rsq)(ames_results, truth = Sale_Price, estimate = .pred)
+
+print(metrics_results)
+```