Benchmark_icml
“When AI Benchmarks Plateau: A Systematic Study of Benchmark Saturation” has been accepted to ICML 2026 in Seoul! In this work, we examine how widely used AI benchmarks evolve over time and identify common saturation patterns that can limit their usefulness for evaluating continued model progress.