Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

No, like the other comment said, it's just using the `n` parameter in an OpenAI style API. For example, vLLM and llamacpp have support for it.


Ah, it's the same model, multiple runs, then? Not actually N different models?


Correct.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: