Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

$1.5kpm for SOTA. 128gb you run DSV4 Flash.
 help



What's the point of running it locally though? Inference for open models is quite cheap already. They could just selfhost, anyway. The experience of running LLMs locally will be excruciatingly bad in comparison at least for the near future.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: