Skip to contents

Checks if current metrics represent a regression from the last run.

Usage

check_benchmark_regression(
  conn,
  step_name,
  format,
  current_metrics,
  lower_is_better = c("mlogloss", "logloss", "log_loss", "brier_score", "poisson_loss",
    "rmse", "mae"),
  regression_threshold = 2
)

Arguments

conn

DBI connection

step_name

Character. Pipeline step name

format

Character. Cricket format

current_metrics

Named list of current metric values

lower_is_better

Character vector of metric names where lower = better (default: common loss metrics)

regression_threshold

Numeric. Percentage threshold to flag regression (default 2 = 2% worse triggers warning)

Value

List with is_regression (logical), details (data.frame), messages (character vector)