Does this project support Vector DB search and send to (local) LLM to generate answers? #183

orchardc · 2023-09-27T20:01:33Z

orchardc
Sep 27, 2023

The project looks really good and thanks for sharing it.

I'm using Python + LangChain, I have a question : Does this project support Vector DB search and send to (local) LLM to generate answers? It would be great to see a how-do example.

SignalRT · 2023-11-04T21:00:03Z

SignalRT
Nov 4, 2023
Collaborator

You can get the Vector DB support with Semantic Kernel. I'm working on a pipeline to process documents to be able to chat about the documentation referencing the sources, and it works. In my experience I get far worst results that making the same with OpenAI.

The current state of llama.cpp seems not to be the right one to have a great RAG system. The best open model we could use, seems to be BGE or something similar. Is seems that in llama.cpp they are working on supporting this models, but it's not yet done:

ggerganov/llama.cpp#3667

Right now it's possible to make what you are asking, but you will get worst results due to impossibility to support the best models to support Massive Text Embedding:

https://huggingface.co/spaces/mteb/leaderboard

1 reply

AshD Jan 4, 2024

Would the Mistral 7B v0.2 Instruct be a good choice here?

Thanks,
Ash

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does this project support Vector DB search and send to (local) LLM to generate answers? #183

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Does this project support Vector DB search and send to (local) LLM to generate answers? #183

orchardc Sep 27, 2023

Replies: 1 comment · 1 reply

SignalRT Nov 4, 2023 Collaborator

AshD Jan 4, 2024

orchardc
Sep 27, 2023

Replies: 1 comment 1 reply

SignalRT
Nov 4, 2023
Collaborator