feat: Llama index milvus rag #11

aevo98765 · 2025-04-08T16:32:59Z

Description

This pull request adds a llama-index RAG implementation. This depends upon ollama local model hosting. The instructions of how to setup have been added.

Suggestions for future improvements:

Get docling to do the chunking.
Return sources from the search function so the retrieval sources can be displayed on the mcp client.

Signed-off-by: Ash Evans <ash.evans@ibm.com>

…he env vars Signed-off-by: Ash Evans <ash.evans@ibm.com>

Signed-off-by: Ash Evans <ash.evans@ibm.com>

mergify · 2025-04-08T16:33:35Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

PeterStaar-IBM · 2025-04-09T03:40:08Z

Hi @aevo98765 , this is a good start. I see that you use the native markdown chunker from RAG. I think we can do better: @vagenas has been working on native chunker on the DoclingDocument and a lot of experience with llama-index. You can find a nice example here: rag_llamaindex.ipynb

@vagenas : Can you have a look at export_docling_document_to_vector_db
@aevo98765 : I dont see in the tools any ollama call. Why do we need the ollama and the llm if we dont use it in the tools?

PeterStaar-IBM · 2025-04-11T03:40:38Z

@aevo98765 I dont see where we use the ollama llm. Do we need to define an llm, or can we simply get away with a simple chromadb?

Signed-off-by: Ash Evans <ash.evans@ibm.com>

… to docling node parser Signed-off-by: Ash Evans <ash.evans@ibm.com>

Signed-off-by: Ash Evans <ash.evans@ibm.com>

aevo98765 · 2025-04-15T00:25:12Z

@PeterStaar-IBM Changed the code to use the docling native approach. I had to switch to Milvus to achieve this and store this in memory. ChromaDB wouldn't work. Ollama models are used under the hood in the LlamaIndex query engine. It is not explicit at the moment as it is set as a 'SETTING' which reads in like an environment variable for LlamaIndex to use. Hope this makes sense.

ceberam

To frame this new feature within the purpose of docling-mcp: I would document it in README.md more clearly as an optional feature. We could have a separate section at the end (for instance, Applications) where we explain how docling-mcp can be leveraged in Agentic RAG applications (basically, what you wrote in terms of configuration and examples).
In connection with above, I would move the export_docling_document_to_vector_db and search_documents tools to another module. For instance, rag.py
In other docling repos, we leverage pydantic settings, since we think they offer some advantages (we can clearly define the options and type-hint them, validate them, leverage aliases, automatically manage nested variables...).

Later on, we can:

enable other LLM providers (e.g., watsonx.ai)
add more variables (e.g., chunker type and chunking options)

Signed-off-by: Ash Evans <70710356+aevo98765@users.noreply.github.com>

…ded at the end in a Applications section Signed-off-by: Ash Evans <ash.evans@ibm.com>

…ng-mcp into llama-index-chromadb-rag

Signed-off-by: Ash Evans <ash.evans@ibm.com>

codecov · 2025-04-25T15:22:15Z

Codecov Report

Attention: Patch coverage is 20.68966% with 46 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
docling_mcp/tools/applications.py	0.00%	36 Missing ⚠️
docling_mcp/shared.py	63.15%	7 Missing ⚠️
docling_mcp/server.py	0.00%	3 Missing ⚠️

📢 Thoughts on this report? Let us know!

docling_mcp/tools/applications.py

aevo98765 · 2025-04-25T15:25:16Z

Also the tests are failing. This will be because the applications milvus RAG code is conditional. We would need to pass env vars to the test pipeline to fix this. Any suggesstions or advice here?

ceberam · 2025-04-28T09:15:52Z

Also the tests are failing. This will be because the applications milvus RAG code is conditional. We would need to pass env vars to the test pipeline to fix this. Any suggesstions or advice here?

I have added a conditional import on commit dbde847
Since we will add more applications, I will create a new issue to add test environment variables in the test-package job of the GitHub CI/CD.

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>

The application tools 'export_docling_document_to_vector_db' and 'search_documents' are imported in 'server.py' depending on the environmental variables. Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>

aevo98765 · 2025-04-28T09:26:49Z

LGTM

Signed-off-by: Ash Evans <ash.evans@ibm.com>

aevo98765 added 11 commits April 7, 2025 14:31

inital llamaindex rag system via docling

d35bef4

Signed-off-by: Ash Evans <ash.evans@ibm.com>

making the initialisation of RAG shared assets conditional based on t…

c27591f

…he env vars Signed-off-by: Ash Evans <ash.evans@ibm.com>

conditional rendering of the RAG tools based on the env vars

37c83ce

Signed-off-by: Ash Evans <ash.evans@ibm.com>

adding llama-index packages

c492fb0

Signed-off-by: Ash Evans <ash.evans@ibm.com>

adding an example env vars file

2495d1e

Signed-off-by: Ash Evans <ash.evans@ibm.com>

removing not needed imports and code

d82052b

Signed-off-by: Ash Evans <ash.evans@ibm.com>

conditionally rendering the RAG tooling

3042fad

Signed-off-by: Ash Evans <ash.evans@ibm.com>

adding RAG related setup info to the README

4ada834

Signed-off-by: Ash Evans <ash.evans@ibm.com>

moving the RAG related dependencies to within the conditional section

f390bf0

Signed-off-by: Ash Evans <ash.evans@ibm.com>

adding doc strings to tools

7156fb0

Signed-off-by: Ash Evans <ash.evans@ibm.com>

updating the uv.lock file

38aef46

Signed-off-by: Ash Evans <ash.evans@ibm.com>

aevo98765 requested a review from PeterStaar-IBM April 8, 2025 16:32

aevo98765 self-assigned this Apr 8, 2025

PeterStaar-IBM added the enhancement label Apr 9, 2025

PeterStaar-IBM changed the title ~~Llama index chromadb rag~~ feat: Llama index chromadb rag Apr 10, 2025

aevo98765 added 3 commits April 14, 2025 21:52

moving the embedding model specification to the .env file

0870215

Signed-off-by: Ash Evans <ash.evans@ibm.com>

changing from Chroma to milvus in memory. ALso changing from markdown…

c7c847a

… to docling node parser Signed-off-by: Ash Evans <ash.evans@ibm.com>

linting

89bdb4e

Signed-off-by: Ash Evans <ash.evans@ibm.com>

aevo98765 changed the title ~~feat: Llama index chromadb rag~~ feat: Llama index milvus rag Apr 15, 2025

aevo98765 added 3 commits April 15, 2025 01:20

updating the README

3b284a9

Signed-off-by: Ash Evans <ash.evans@ibm.com>

adding back uv.lock

44b7ab4

Signed-off-by: Ash Evans <ash.evans@ibm.com>

updating the README

b51b807

Signed-off-by: Ash Evans <ash.evans@ibm.com>

ceberam requested changes Apr 24, 2025

View reviewed changes

ceberam mentioned this pull request Apr 24, 2025

Integration with elastic-mcp for RAG applications #14

Open

aevo98765 and others added 2 commits April 25, 2025 14:09

Merge branch 'main' into llama-index-chromadb-rag

6f2f9ae

Signed-off-by: Ash Evans <70710356+aevo98765@users.noreply.github.com>

reorganising the RAG milvus instructions in the README.md to be inclu…

8ad7955

…ded at the end in a Applications section Signed-off-by: Ash Evans <ash.evans@ibm.com>

aevo98765 added 3 commits April 25, 2025 14:40

Merge branch 'llama-index-chromadb-rag' of github.com:aevo98765/docli…

d76db84

…ng-mcp into llama-index-chromadb-rag

moving the RAG tools to a new file

b25bbb4

Signed-off-by: Ash Evans <ash.evans@ibm.com>

pre commit enforced changes

518765b

Signed-off-by: Ash Evans <ash.evans@ibm.com>

aevo98765 commented Apr 25, 2025

View reviewed changes

docling_mcp/tools/applications.py Outdated Show resolved Hide resolved

ceberam added 3 commits April 28, 2025 11:22

chore: simplify the type hints of search_documents tool

0122efd

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>

chore: add Milvus database files to gitignore

5449451

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>

ceberam force-pushed the llama-index-chromadb-rag branch from a34b0bd to dbde847 Compare April 28, 2025 09:22

ceberam approved these changes Apr 28, 2025

View reviewed changes

adding comment about potential issue with lock file

7624a19

Signed-off-by: Ash Evans <ash.evans@ibm.com>

ceberam merged commit e21e336 into docling-project:main Apr 28, 2025
7 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Llama index milvus rag #11

feat: Llama index milvus rag #11

aevo98765 commented Apr 8, 2025

mergify bot commented Apr 8, 2025 •

edited

Loading

PeterStaar-IBM commented Apr 9, 2025

PeterStaar-IBM commented Apr 11, 2025

aevo98765 commented Apr 15, 2025

ceberam left a comment

codecov bot commented Apr 25, 2025 •

edited

Loading

aevo98765 commented Apr 25, 2025

ceberam commented Apr 28, 2025 •

edited

Loading

aevo98765 commented Apr 28, 2025

feat: Llama index milvus rag #11

feat: Llama index milvus rag #11

Conversation

aevo98765 commented Apr 8, 2025

mergify bot commented Apr 8, 2025 • edited Loading

Merge Protections

🟢 Enforce conventional commit

PeterStaar-IBM commented Apr 9, 2025

PeterStaar-IBM commented Apr 11, 2025

aevo98765 commented Apr 15, 2025

ceberam left a comment

Choose a reason for hiding this comment

codecov bot commented Apr 25, 2025 • edited Loading

Codecov Report

aevo98765 commented Apr 25, 2025

ceberam commented Apr 28, 2025 • edited Loading

aevo98765 commented Apr 28, 2025

mergify bot commented Apr 8, 2025 •

edited

Loading

codecov bot commented Apr 25, 2025 •

edited

Loading

ceberam commented Apr 28, 2025 •

edited

Loading