scheduler_perf: define thresholds per test case and set up alerts for results #124774
Labels
kind/feature
Categorizes issue or PR as related to a new feature.
needs-triage
Indicates an issue or PR lacks a `triage/foo` label and requires one.
sig/scheduling
Categorizes an issue or PR as relevant to SIG Scheduling.
/kind feature
/sig scheduling
Discussion with sig-scalability: https://kubernetes.slack.com/archives/C09QZTRH7/p1715262959575039
What
We have scheduler-perf, and it'd be great if we could have an alert-ish stuff based on the result.
Based on the discussion with sig-scalability, the easiest way is to change scheduler_perf so that it can fail if the results show degradation, and monitor/alert the failures via testgrid.
"if the results show degradation" > for this, we probably have to define reasonable thresholds per test case.
Why
The current pain point is that perf-dash visualizes it, but no one actually doesn't care much, and consequently we've overlooked degradation several times actually.
The text was updated successfully, but these errors were encountered: