Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

but then it will either overfit or you need to train it on 20 times the amount of data ...


I'm taking about when using a LLM, which doesn't involve training and thus no overfitting.


for an llm to exhibit a verbal relationship between counting and tokens you have to train it on that. maybe you mean something like a plugin or extension but that's something else and has nothing to do with llms specifically.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: