This project is an LLM-powered Retrieval-Augmented Generation (RAG) chatbot designed specifically for Hinglish (Hindi-English code-switched) conversations. Inspired by TurboMLโs mission to democratize AI, this chatbot leverages efficient retrieval and lightweight inference to enable real-time, multilingual AI applications.
๐น FAISS-based Retrieval โ Quickly fetches relevant Hinglish text to enhance context.
๐น LLM-powered Response Generation โ Uses Mistral-7B/LLaMA-2-7B for contextual replies.
๐น Optimized for Low-Resource Languages โ Custom embeddings & reinforcement learning improve retrieval.
๐น Efficient Inference Pipeline โ Runs on GPU for fast processing in real-world applications.
๐น Gradio-powered UI โ Interactive chatbot interface for seamless deployment.
1๏ธโฃ Retrieval: FAISS searches the most relevant Hinglish text snippets.
2๏ธโฃ Contextual Prompting: Retrieved text is formatted into a structured prompt.
3๏ธโฃ Response Generation: A fine-tuned LLM (Mistral/LLaMA-2) generates an appropriate response.
4๏ธโฃ User Interaction: The chatbot provides real-time answers in Hinglish.
# Clone the repository
git clone https://github.com/LLM_Hinglish/hinglish-rag-chatbot.git
cd hinglish-rag-chatbot
# Install dependencies (Ensure GPU support)
pip install -r requirements.txt
# Run the chatbot
python app.py
To launch the chatbot UI:
python app.py
This will generate a public shareable link for easy access.
โ
India-first LLM Adaptation: Hinglish is widely spoken but underrepresented in AI models.
โ
Scalable & Efficient: Works well even in low-resource environments.
โ
TurboML Inspiration: Aligns with the mission of building open-source foundation models for India.
๐ฅ Want to contribute? Fork the repo & raise a PR!
๐ฉ Interested in AI research? Connect with me on LinkedIn!
Letโs build Indiaโs AI future together! ๐ ๐ฎ๐ณ