$1.5kpm for SOTA. 128gb you run DSV4 Flash. | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		sourcecodeplz 1 day ago \| parent \| context \| favorite \| on: Uber's $1,500/month AI limit is a useful signal fo... $1.5kpm for SOTA. 128gb you run DSV4 Flash.
		help

pqtyw 1 day ago [–]

What's the point of running it locally though? Inference for open models is quite cheap already. They could just selfhost, anyway. The experience of running LLMs locally will be excruciatingly bad in comparison at least for the near future.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact