Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yes, but k-token lookup was already a thing with markov chains. Transformers are indeed better, but just because they model language distributions better than mostly-empty arrays of (token-count)^(context).


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: