Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Does this mean you could fit its whole working set in the cache hierarchy of a modern high end GPU getting near 100% ALU utilisation?


It streams the weights. This is going to be what limits performance, not alu utilization.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: