Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Should be easy to test by picking two similar models with different publishing dates (before and after ARC v2), and also comparing with/without the new reasoning technique from the article.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: