Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think the promise back when all the separate reasoning / multimodal models were out was that GPT-5 would be the model to bring it all together (which mostly comes down to audio/video I think since o3/o4 do images really well).


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: