4 Interpretability Analysis

4.1 SHAP + Logistic Regression

Initialize a DALEX explainer for the trained logistic regression model by converting the binary target into 0/1 and linking the predictor data—so that we can later compute SHAP‐style explanations.

Code

train_data <- readRDS("train_data.rds")
test_data  <- readRDS("test_data.rds")
lr_model     <- readRDS("lr_model.rds")
pruned_tree  <- readRDS("pruned_tree.rds")
model_weights <- readRDS("model_weights.rds")
library(DALEX)
library(ggplot2)  
library(lime)
library(dplyr)
library(knitr)
library(kableExtra)

y_binary <- ifelse(train_data$CreditRisk == "Bad", 1, 0)
explainer_lr <- DALEX::explain(
  model   = lr_model,
  data    = train_data[, setdiff(names(train_data), "CreditRisk")],
  y       = y_binary,
  label   = "Logistic Regression",
  verbose = FALSE
)

4.1.1 Global SHAP‐style Importance (Permutation)

Code

vi_lr <- DALEX::model_parts(
  explainer = explainer_lr,
  type      = "variable_importance",
  B         = 100
)
plot(vi_lr) +
  ggtitle("Logistic Regression Variable Importance (Permutation)") +
  theme_minimal()

Permutation-based variable importance was used to assess which predictors most influence our logistic regression credit‐risk model by measuring the drop in AUC when each feature is shuffled; when plotted, “Status” clearly causes the largest loss (around 0.85 → 0.75), followed by “CreditHistory,” “Purpose,” and “Duration,” whereas features like “Telephone,” “Dependents,” and “Housing” sit near the top with almost no AUC loss, indicating minimal importance—this ordering highlights that an applicant’s status and credit history are the key drivers of model performance, while peripheral attributes contribute little.

4.1.2 Local SHAP for a Single Observation

Code

shap_local_lr <- DALEX::predict_parts(
  explainer       = explainer_lr,
  new_observation = test_data[1, setdiff(names(test_data), "CreditRisk"), drop = FALSE],
  type            = "shap"
)
plot(shap_local_lr) +
  ggtitle("Local SHAP Explanation (Test Sample 1)") +
  theme_minimal()

Local SHAP values were computed to understand how each feature in our logistic regression model pushes the prediction for a single test case toward “Bad” (positive contribution) or “Good” (negative contribution) credit risk; for this sample, a high credit history level (“CreditHistory = L5”), strong savings (“Savings = L5”), longer duration, older age, and basic property (“Property = L1”) all increase the likelihood of a “Bad” label, while poor status (“Status = L1”), a moderate number of existing credits (“ExistingCreditsCount = L2”), and a high installment rate (“InstallmentRate = L4”) counteract that by pushing the prediction toward “Good.”

4.2 Configure LIME for Decision Tree

Define the model_type and predict_model methods so that LIME knows how to call our pruned rpart tree and obtain class‐probability outputs.

Code

library(lime)
model_type.rpart <- function(x, ...) "classification"
predict_model.rpart <- function(model, newdata, ...) {
  as.data.frame(predict(model, newdata = newdata, type = "prob"))
}

4.2.1 Local LIME Explanation for Decision Tree

Code

lime_explainer_tree <- lime(
  x     = train_data[, setdiff(names(train_data), "CreditRisk")],
  model = pruned_tree,
  bin_continuous = TRUE
)
lime_explanation_tree <- lime::explain(
  x         = test_data[1, setdiff(names(test_data), "CreditRisk"), drop = FALSE],
  explainer = lime_explainer_tree,
  n_features = 5,
  n_labels   = 1
)
plot_features(lime_explanation_tree) +
  ggtitle("LIME Explanation for Decision Tree (Test Sample 1)") +
  theme_minimal()

LIME was applied to the pruned decision tree for a single test case to show which features support or contradict its “Good” prediction—here, a low application status (Status = L1) strongly contradicts the Good outcome, while strong savings (Savings = L5), shorter loan duration (Duration ≤ 12), excellent credit history (CreditHistory = L5), and housing status (Housing = L2) all contribute in favor of predicting “Good”.

4.3 Global LIME Analysis for Logistic Regression

Retrain the logistic model (after dropping a column), register it with LIME, and generate explanations across the first 100 test instances to capture global patterns in feature contributions.

Code

library(dplyr)
train_data <- train_data %>% select(-Dependents)
test_data  <- test_data  %>% select(-Dependents)
lr_model   <- glm(
  CreditRisk ~ .,
  data   = train_data,
  family = binomial,
  weights = model_weights
)

model_type.glm <- function(x, ...) "classification"
predict_model.glm <- function(model, newdata, ...) {
  p <- predict(model, newdata = newdata, type = "response")
  data.frame(Bad = p, Good = 1 - p)
}

set.seed(2023)
train_features <- train_data %>% select(-CreditRisk)
test_features  <- test_data  %>% select(-CreditRisk)
lime_explainer  <- lime(
  x              = train_features,
  model          = lr_model,
  bin_continuous = TRUE,
  quantile_bins  = FALSE,
  n_bins         = 5
)
lime_explanations <- lime::explain(
  x             = test_features[1:100, ],
  explainer     = lime_explainer,
  n_labels      = 1,
  n_features    = 10,
  n_permutations = 5000,
  distance_method = "gower",
  kernel_width    = 0.75
)

4.3.1 Visualize global LIME importance and distributions

Code

global_importance <- lime_explanations %>%
  group_by(feature) %>%
  summarise(
    Avg_Weight = mean(abs(feature_weight)),
    Frequency  = n() / nrow(test_features[1:100, ]),
    .groups    = "drop"
  ) %>%
  arrange(desc(Avg_Weight))
ggplot(global_importance, aes(x = reorder(feature, Avg_Weight), y = Avg_Weight, fill = Frequency)) +
  geom_col(width = 0.7) +
  scale_fill_gradient(low = "#FEE6CE", high = "#E6550D") +
  labs(title = "Global Feature Importance via LIME", x = "Feature", y = "Avg |Weight|") +
  coord_flip() +
  theme_minimal() +
  theme(plot.title = element_text(face = "bold", size = 14))

Global LIME importance was calculated by averaging each feature’s absolute local weight across 100 test cases—plotted here as horizontal bars colored by how often each feature appeared—revealing that being a “ForeignWorker,” application “Status,” and “CreditHistory” drive the model most strongly, while factors like “ResidenceDuration” and “Age” contribute least.

4.3.2 Top 5 Most Influential Features

Feature	Importance	Frequency
ForeignWorker	0.361	1.00
Status	0.201	0.91
CreditHistory	0.120	0.90
Duration	0.120	0.60
Purpose	0.118	0.57

4.3.3 Impact Direction (% Positive vs Negative)

Code

impact_tbl <- lime_explanations %>%
  group_by(feature) %>%
  summarise(
    Positive = mean(feature_weight > 0) * 100,
    Negative = mean(feature_weight < 0) * 100,
    .groups  = "drop"
  ) %>%
  arrange(desc(Positive)) %>%
  slice_head(n = 5) %>%
  rename(Feature = feature)

impact_tbl %>%
  kable(
    format  = "html",
    digits  = 1,
    align   = "c"
  ) %>%
  kable_styling(
    full_width        = FALSE,
    bootstrap_options = c("striped", "hover", "condensed", "responsive")
  )

Feature	Positive	Negative
Duration	81.7	18.3
Property	78.4	21.6
OtherDebtors	72.0	28.0
Status	65.9	34.1
Purpose	64.9	35.1

The results show that foreign-worker status, application status, and credit history strongly affect credit risk predictions. However, practical factors like loan duration, having no collateral, and existing debts clearly signal higher risk. This means lenders should carefully consider these practical financial details when reviewing loan applications.