AlloyDB Semantic Product Search

A demonstration of semantic search capabilities using local embeddings with Google's Gemini models and Aiven for AlloyDB Omni.

Overview

This application demonstrates how to implement semantic search for product data stored in AlloyDB. Instead of relying on AlloyDB's built-in integration with Google's Vertex AI, this application generates embeddings locally using the Gemini API and stores them in the database.

Features

Semantic Product Search: Search for products using natural language queries
Add Products: Add new products to the database with automatically generated embeddings
Update Embeddings: Generate or update embeddings for existing products

Setup

Prerequisites

Python 3.8+
An AlloyDB Omni instance (e.g., through Aiven)
A Google API key with access to the Gemini API

Installation

Clone this repository
Install the required packages:
```
pip install -r requirements.txt
```

Configuration

Create a .env file in the root directory with the following variables:

ALLOY_DB_USER=your_db_user
ALLOY_DB_PASSWORD=your_db_password
ALLOY_DB_HOST=your_db_host
ALLOY_DB_PORT=your_db_port
GEMINI_API_KEY=your_gemini_api_key

Create a .streamlit directory and inside it create a secrets.toml file with the following content:

[connections.postgresql]
dialect = "postgresql"
host = "your_db_host"
port = 12345  # Your DB port
database = "defaultdb"
username = "your_db_user"
password = "your_db_password"

Running the Application

Launch the Streamlit application:

streamlit run modified_search.py

Troubleshooting

If you encounter any issues:

Enable Debug Mode in the sidebar to see detailed error messages
Run the simple_test.py script to verify your database connection:
```
python simple_test.py
```
Make sure all environment variables are correctly set

Database Schema

The application expects a product table with the following schema:

CREATE TABLE product (
    id SERIAL PRIMARY KEY,
    title VARCHAR(255) NOT NULL,
    description TEXT,
    price NUMERIC(10, 2),
    images TEXT,
    emb REAL[]
);

Implementation Details

Embeddings are generated using Google's Gemini models/embedding-001 model
Embeddings are stored in the emb column as arrays of floating-point numbers
Similarity is calculated using cosine similarity between the query embedding and product embeddings

Note on Streamlit Version

This application uses st.experimental_connection() which is compatible with Streamlit version 1.27.2. If you have a newer version of Streamlit that supports st.connection(), you may need to update the code accordingly.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.streamlit		.streamlit
pages		pages
.env-example		.env-example
.env.example		.env.example
.gitignore		.gitignore
1-enable-pg-vector.js		1-enable-pg-vector.js
2-create-features-table.js		2-create-features-table.js
3-store-knowledge-demo.js		3-store-knowledge-demo.js
3-store-knowledge.js		3-store-knowledge.js
4-vector-search-knowledge-demo.js		4-vector-search-knowledge-demo.js
4-vector-search-knowledge.js		4-vector-search-knowledge.js
5-rag-search-knowledge-demo.js		5-rag-search-knowledge-demo.js
5-rag-search-knowledge.js		5-rag-search-knowledge.js
Makefile		Makefile
README-VECTOR-SEARCH.md		README-VECTOR-SEARCH.md
README-semantic-search.md		README-semantic-search.md
README.md		README.md
Search.py		Search.py
Update_Embeddings.py		Update_Embeddings.py
add.py		add.py
add_sample_data.py		add_sample_data.py
add_sample_data_with_local_embeddings.py		add_sample_data_with_local_embeddings.py
add_test_product.py		add_test_product.py
check-database-content.js		check-database-content.js
check-env.js		check-env.js
config.js		config.js
connect-and-setup.sh		connect-and-setup.sh
convert_to_vector.py		convert_to_vector.py
create-product-table.sql		create-product-table.sql
db_connection_test.py		db_connection_test.py
features.json		features.json
generate_embeddings.py		generate_embeddings.py
home_depot_data_1_2021_12.csv		home_depot_data_1_2021_12.csv
modified_add.py		modified_add.py
modified_search.py		modified_search.py
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
run_app.sh		run_app.sh
run_app_debug.sh		run_app_debug.sh
run_demo.sh		run_demo.sh
run_modified.sh		run_modified.sh
setup_and_run.sh		setup_and_run.sh
simple_streamlit_app.py		simple_streamlit_app.py
simple_streamlit_test.py		simple_streamlit_test.py
simple_test.py		simple_test.py
streamlit_db_test.py		streamlit_db_test.py
test-db-connection.js		test-db-connection.js
test-gemini-detailed.js		test-gemini-detailed.js
test-gemini-embeddings.py		test-gemini-embeddings.py
test-gemini.js		test-gemini.js
test_db_connection.py		test_db_connection.py
test_local_embedding.py		test_local_embedding.py
test_write.py		test_write.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AlloyDB Semantic Product Search

Overview

Features

Setup

Prerequisites

Installation

Configuration

Running the Application

Troubleshooting

Database Schema

Implementation Details

Note on Streamlit Version

About

Uh oh!

Releases

Packages

Uh oh!

Languages

JohnKennedyOSS/alloyDB-Omni-vector-search

Folders and files

Latest commit

History

Repository files navigation

AlloyDB Semantic Product Search

Overview

Features

Setup

Prerequisites

Installation

Configuration

Running the Application

Troubleshooting

Database Schema

Implementation Details

Note on Streamlit Version

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages