-
Notifications
You must be signed in to change notification settings - Fork 179
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chore: Improve quickstart #374
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A few minor nits.
|
||
- Automatic creation and synchronization of vector embeddings for your data | ||
- Seamless vector and semantic search | ||
- Retrieval Augmented Generation(RAG) directly in SQL |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- Retrieval Augmented Generation(RAG) directly in SQL | |
- Retrieval Augmented Generation (RAG) directly in SQL |
@@ -125,7 +134,9 @@ This section will walk you through the steps to get started with pgai and Ollama | |||
|
|||
1. **Search the embeddings** | |||
|
|||
We'll search the embeddings for the concept of "properties of light" | |||
We'll search the embeddings for the concept of "properties of light" even though these words are not in the text of the articles. This is only possible because of vector embeddings that capture the semantic meaning of the text. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We'll search the embeddings for the concept of "properties of light" even though these words are not in the text of the articles. This is only possible because of vector embeddings that capture the semantic meaning of the text. | |
We'll search the embeddings for the concept of "properties of light" even though these words are not in the text of the articles. This is possible because vector embeddings capture the semantic meaning of the text. |
``` | ||
And now you don't need to do anything to update the embeddings. The vectorizer will automatically create the embeddings for the new row with any intervention from you. After a few seconds, you can run the search query again to see the new embedding. | ||
And now you don't need to do anything to update the embeddings. The vectorizer will automatically create the embeddings for the new row with any intervention from you. After a few seconds, you can run the following search query to see the new embedding: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
And now you don't need to do anything to update the embeddings. The vectorizer will automatically create the embeddings for the new row with any intervention from you. After a few seconds, you can run the following search query to see the new embedding: | |
And now you don't need to do anything to update the embeddings. The vectorizer will automatically create the embeddings for the new row without any intervention from you. After a few seconds, you can run the following search query to see the new embedding: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good!
- I think we need to add somewhere early, what the auto-generated (in this case
wiki_embeddings
) view does. I don't know that it's something that's exactly analogous to other vector databases? I know other DBs store metadata along with chunks, but it might be nice to clarify early on what most of the queries are running against, e.g. most of the semantic search and RAG queries are against thewiki_embeddings
view.
```sql | ||
SELECT title, chunk | ||
FROM wiki_embeddings | ||
ORDER BY embedding <=> ai.ollama_embed('all-minilm', 'AI tools') |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe a SQL comment here to explain how this translates to seeing the latest embedding?
No description provided.