Skip to content

Latest commit

 

History

History
23 lines (16 loc) · 863 Bytes

README.md

File metadata and controls

23 lines (16 loc) · 863 Bytes

PDF (Retrieval-Augmented Generation)

This repository contains two main components:

  1. pdfrag.py - A PDF processing and question-answering app built using Streamlit and Google's Gemini model.
  2. Websearch_agent.py - A web search agent that performs searches and answers questions based on web search results.

Features

  • PDF text extraction using PyPDF2.
  • Chunking of text and creating a FAISS vector store for similarity searches.
  • Integration with Google's Gemini model for QA functionality.
  • Web search and QA using transformers.

Requirements

Before running the application, ensure that the following Python dependencies are installed. You can install them using pip:

pip install -r requirements.txt
streamlit run pdfrag.py    

image