The degradation may be more significant within the day than at the same time eve... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		seunosewa 26 days ago \| parent \| context \| favorite \| on: Claude Code daily benchmarks for degradation track... The degradation may be more significant within the day than at the same time every day.

GoatInGrey 26 days ago [–]

Sure, but it's still useful insight to see how it performs over time. Of course, cynically, Anthropic could game the benchmark by routing this benchmark's specific prompts to an unadulterated instance of the model.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact