Cache dense encoder method #2610

shanbady · 2025-10-20T17:49:41Z

What are the relevant tickets?

Closes https://github.com/mitodl/hq/issues/8429

Description (What does it do?)

This PR makes it so we cache the dense encoder instance which avoids unnecessary calls to litellm endpoints and also alleviates us from having to pass around a dense encoder instance since calling the dense_encoder method will have no performance hit.

How can this be tested?

checkout main
try instantiating the dense encoder and see that there are calls to either litellm/ollama or openai depending on your local setup:

from vector_search.utils import dense_encoder

encoder = dense_encoder()
[2025-10-20 17:47:16] WARNING 7118 [root] litellm.py:25 - [0c41fc84b062] - Model nomic-embed-text not found in tiktoken. defaulting to None

encoder = dense_encoder()
[2025-10-20 17:47:20] WARNING 7118 [root] litellm.py:25 - [0c41fc84b062] - Model nomic-embed-text not found in tiktoken. defaulting to None

encoder = dense_encoder()
[2025-10-20 17:47:20] WARNING 7118 [root] litellm.py:25 - [0c41fc84b062] - Model nomic-embed-text not found in tiktoken. defaulting to None

checkout this branch and repeat the same and note there is only one call to the endpoint

shanbady added 2 commits September 8, 2025 10:39

caching result of dense_encoder method

43413a1

Merge branch 'main' into shanbady/cache-dense_encoder-method

6fbcf7d

shanbady added the Needs Review An open Pull Request that is ready for review label Oct 20, 2025

shanbady marked this pull request as ready for review October 20, 2025 17:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cache dense encoder method #2610

Cache dense encoder method #2610

Uh oh!

shanbady commented Oct 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Cache dense encoder method #2610

Are you sure you want to change the base?

Cache dense encoder method #2610

Uh oh!

Conversation

shanbady commented Oct 20, 2025

What are the relevant tickets?

Description (What does it do?)

How can this be tested?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant