Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Someone on HN has educated me that gpt4 and 3 should be on a similar param count. This is based on inference times of gpt4 vs gpt3.5 pre-speedup (where distilled version was used only post-speedup in the turbo version).


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: