GitHub - AnasAber/MLflow_with_RAG: Using MLflow to deploy your RAG pipeline, using LLamaIndex, Langchain and Ollama/HuggingfaceLLMs/Groq

This project is for people that want to deploy a RAG pipeline using MLflow.

The project uses:

git clone https://github.com/AnasAber/RAG_in_CPU.git

pip install -r requirements.txt

Make sure to put your api_keys into the example.env, and rename it to .env

Put your own data files in the data/ folder
Go to the notebook, and replace "api_key_here" with your huggingface_api_key
If you have GPU, you're fine, if not, run it on google colab, and make sure to download the json file output at the end of the run.

python workflow.py

And after the run, go to your mlflow run, and pick the run ID: Place it into this command:

mlflow models serve -m runs:/<run id>/rag_deployement -p 5001

In the other terminal, make sure to run

app.py

npm start

Now, you should be seeing a web interface, and the two terminals are running.

If you got errors, try to see what's missing in the requirements.txt.

Enjoy!

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
deployement		deployement
frontend		frontend
images		images
.gitignore		.gitignore
Modelfile		Modelfile
README.md		README.md
__init__.py		__init__.py
app.py		app.py
generating_qa_dataset.ipynb		generating_qa_dataset.ipynb
pg_eval_dataset_index.json		pg_eval_dataset_index.json
pg_eval_dataset_index_BERT.json		pg_eval_dataset_index_BERT.json
requirements.txt		requirements.txt
tune_rag.py		tune_rag.py

Provide feedback