Compare Current Run Against Previous — check_benchmark_regression • bouncer

Checks if current metrics represent a regression from the last run.

Usage

check_benchmark_regression(
  conn,
  step_name,
  format,
  current_metrics,
  lower_is_better = c("mlogloss", "logloss", "log_loss", "brier_score", "poisson_loss",
    "rmse", "mae"),
  regression_threshold = 2
)

Arguments

conn: DBI connection
step_name: Character. Pipeline step name
format: Character. Cricket format
current_metrics: Named list of current metric values
lower_is_better: Character vector of metric names where lower = better (default: common loss metrics)
regression_threshold: Numeric. Percentage threshold to flag regression (default 2 = 2% worse triggers warning)

Value

List with is_regression (logical), details (data.frame), messages (character vector)