We had seen interesting developpments around vector databases, but then people s...

ENGNR · on Sept 27, 2023

Curious, do it mean saving into Postgres with pgvector, or some other technique?

EGreg · on Sept 27, 2023

I really don't understand how people figure out the vectors to actually store in the databases, regardless of the underlying storage model.

Isn't that itself the province of an LLM? Say I have a bunch of text. How do I save the text search "by similarity"? Sphinx and semantic search was hard, I remember. Facebook had Faiss. And here we are supposed to just save vectors on commodity hardware BEFORE using an LLM?

martythemaniak · on Sept 27, 2023

> Isn't that itself the province of an LLM?

It is! The steps are:

1. Take a bunch of text, run it through an LLM in embedding mode. The LLM turns the text into a vector. If the text is longer than the LLM context window, chunk it.

2. Store the vector in a vector DB.

3. Use the LLM to generate a vector of your question.

4. Query the vectordb for all similar vectors (that fit in the context window)

5. Get the text from all those vectors. Concatenate the text with the question from step 3.

6. Step 5 is your prompt. The LLM can now answer your question with a collection of similar/relevant text already provided to the LLM in the context window along with your question.

sterlind · on Sept 27, 2023

You don't even need an LLM. You can use Word2Vec, or even yank the embeddings matrix from the bottom layer of an LLM. And you can use CLIP and BLIP for images and audio, respectively.

mewpmewp2 · on Sept 27, 2023

I had scaling issues with pgvector. So would also like to know what regular db can scale?