MiRAG — Multi-input Retrieval-Augmented Generation

MiRAG is an interactive, multi-modal application built with Streamlit that leverages Retrieval-Augmented Generation (RAG) to perform question-answering and summarization across various content types:

🌐 Web pages
📄 PDF documents
📺 YouTube videos
📝 Custom user input

Built on LangChain, Gemini (Google Generative AI), and FAISS, MiRAG enables users to query unstructured content intelligently and intuitively.

🚀 Features

🔹 Web QA (RAG from URLs)

Extract and embed content from any public URL (JS and non-JS).
Perform context-aware question answering and summarization.
Retain memory across conversation turns.

🔹 PDF QA

Upload any PDF and perform:
- Contextual Q&A
- Full-document summarization
- Chat history export as PDF

🔹 YouTube Video QA

Input any YouTube video URL to fetch its transcript.
Ask questions and generate a summary.
Ideal for educational content, lectures, and long-form videos.

🔹 Custom Text QA

Use default chatbot mode or paste your own text block.
Build a temporary vectorstore and perform RAG on your content.
Memory support with chat history download.

🛠️ Tech Stack

Python 3.10+
Streamlit – User Interface
LangChain – Chain and embedding orchestration
Google Generative AI (Gemini) – LLM & embeddings
FAISS – Vectorstore for semantic retrieval
YouTube Transcript API – Transcript extraction
FPDF – PDF generation for exporting chats

📦 Installation

Clone the repository:

git clone https://github.com/iamtgiri/MiRAG.git
cd MiRAG

Create a virtual environment:

python -m venv .venv
source .venv/bin/activate   # On Windows: .venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```

Set environment variable:

export GOOGLE_API_KEY=your_api_key_here

Run the app:
```
streamlit run app.py
```

📁 Project Structure

MiRAG/
├── app.py                      # Main Streamlit app
├── pdf_utils.py                # PDF loading, splitting & summarization
├── process_youtube.py          # YouTube video processing & transcript extraction
├── rag_utils.py                # Utility functions & chain builders
├── requirements.txt
└── README.md

📸 Screenshots

A preview of the MiRAG application in action across different modules:

🏠 Home Interface

Module selection screen and branding

📝 Custom Text QA

Normal Q&A without any context Paste custom text, ask questions, and get answers using RAG with memory

🌐 Web QA

Enter a URL, extract content, and perform context-aware Q&A

📄 PDF QA

Upload a PDF, ask questions, and download the chat history as a PDF:

📺 YouTube QA

Enter a YouTube URL, analyze the transcript, and chat with context: Summarize the video and export the chat:

📤 Download Chat History

Export your full conversation as a downloadable PDF

🧠 Credits

Built with LangChain
Powered by Google Gemini
PDF export via FPDF
Transcripts via YouTube Transcript API

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
screenshots		screenshots
.gitignore		.gitignore
LICENCE		LICENCE
README.md		README.md
app.py		app.py
pdf_utils.py		pdf_utils.py
rag_utils.py		rag_utils.py
requirements.txt		requirements.txt
youtube_utils.py		youtube_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MiRAG — Multi-input Retrieval-Augmented Generation

🚀 Features

🔹 Web QA (RAG from URLs)

🔹 PDF QA

🔹 YouTube Video QA

🔹 Custom Text QA

🛠️ Tech Stack

📦 Installation

📁 Project Structure

📸 Screenshots

🏠 Home Interface

📝 Custom Text QA

🌐 Web QA

📄 PDF QA

📺 YouTube QA

📤 Download Chat History

🧠 Credits

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MiRAG — Multi-input Retrieval-Augmented Generation

🚀 Features

🔹 Web QA (RAG from URLs)

🔹 PDF QA

🔹 YouTube Video QA

🔹 Custom Text QA

🛠️ Tech Stack

📦 Installation

📁 Project Structure

📸 Screenshots

🏠 Home Interface

📝 Custom Text QA

🌐 Web QA

📄 PDF QA

📺 YouTube QA

📤 Download Chat History

🧠 Credits

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages