Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

>I'll throw in a free bonus no one has mentioned yet, at least publicly: you can take the average of the N embeddings forming a document to figure out if you should look at the N embeddings individually.

Hey, I do this! :D

But I've seen it mentioned in several places, it wasn't completely my own idea.



This reminds me a lot of simhashing e.g. https://matpalm.com/resemblance/simhash/




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: