Interesting/unfortunate/expected that GPT-5 isn't touted as AGI or some other ou...

throwaway525753 · 2025-08-07T10:24:27 1754562267

At this point it's pretty obvious that the easy scaling gains have been made already and AI labs are scrounging for tricks to milk out extra performance from their huge matrix product blobs:

-Reasoning, which is just very long inference coupled with RL

-Tool use aka an LLM with glue code to call programs based on its output

-"Agents" aka LLMs with tools in a loop

Those are pretty neat tricks, and not at all trivial to get actionable results from (from an engineering point of view), mind you. But the days of the qualitative intelligence leaps from GPT-2 to 3, or 3 to 4, are over. Sure, benchmarks do get saturated, but at incredible cost and forcing AI researchers to make up new "dimensions of scaling" as the ones they were previously banking on stalled. And meanwhile it's all your basic next token prediction blob running it all, just with a few optimizing tricks.

My hunch is that there won't be a wondorous life turning AGI (poorly defined anyway), just consolidating existing gains (distillation, small language models, MoE, quality datasets, etc.) and finding new dimensions and sources of data (biological data and 'sense-data' for robotics come to mind).

binary132 · 2025-08-07T10:46:11 1754563571

This is the worst they’ll ever be! It’s not just going to be an ever slower asymptotic improvement that never quite manages to reach escape velocity but keeps costing orders of magnitude more to research, train, and operate….

nialv7 · 2025-08-07T09:22:50 1754558570

I wonder whether the markets will crash if gpt5 flops. Because it might be the model that cements the idea that, yes, we have hit a wall.

qsort · 2025-08-07T09:30:05 1754559005

I'm the first to call out ridiculous behavior by AI companies but short of something massively below expectations this can't be bad for openai. GPT-5 is going to be positioned as a product for the general public first and foremost. Not everyone cares about coding benchmarks.

nialv7 · 2025-08-07T09:51:54 1754560314

llama 4 basically (arguably) destroyed Meta's LLM lab, and it wasn't even that bad of a model.

sho_hn · 2025-08-07T12:53:16 1754571196

Did it? Could you summarize the highlights? Morale, brain drain, ...?

benterix · 2025-08-07T10:37:50 1754563070

> massively below expectations

Well, the problem is that the expectations are already massive, mostly thanks to sama's strategy of attracting VC.

ben_w · 2025-08-07T10:15:16 1754561716

OpenAI's announcements are generally a lot more grounded than the hype surrounding them and their stuff.

e.g. if you look at Altman's blog of "superintelligence in a few thousand days", what he actually wrote doesn't even disagreeing with LeCun (famously a nay-sayer) about the timeline.

naveen99 · 2025-08-07T12:00:08 1754568008

Few thousands days is decades.

ben_w · 2025-08-07T13:50:14 1754574614

"A few thousand days" is a minimum of 5.5 years; LeCun has similar timelines.

Imustaskforhelp · 2025-08-07T09:04:35 1754557475

Yeah, I guess it wouldn't be that big but it will have a lot of hype around it.

I doubt it can even beat opus 4.1