Despite the fact that their models are used in hiring, business, education, etc ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		matusp 8 months ago \| parent \| context \| favorite \| on: GPT-5: Key characteristics, pricing and system car... Despite the fact that their models are used in hiring, business, education, etc this multibillion company uses one benchmark with very artificial questions (BBQ) to evaluate how fair their model is. I am a little bit disappointed.

xmorse 8 months ago [–]

It's because these industries don't create their own benchmarks. The only ones creating evals are the AI company themselves or open source software engineers

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact