GitHub - wittyicon29/MultiPDF-Chat: An app built using langchain and streamlit which allows to chat with multiple PDFs and can be run locally with GPUs.

MultiPDF-Chat

This is a Python application that allows you to have a conversation with multiple PDF documents. You can ask questions about your documents, and the application will provide you with answers based on the content of the PDFs. You can ask questions about the PDFs using natural language, and the application will provide relevant responses based on the content of the documents. This app utilizes a language model to generate accurate answers to your queries. Please note that the app will only respond to questions related to the loaded PDFs.

Note - For the app to perform get embeddings faster it is advised to run the app loaclly with a GPU.

Features

PDF Text Extraction: The application extracts text from multiple PDF documents.
Text Chunking: The extracted text is divided into manageable chunks for processing.
Semantic Search: It performs semantic search on the text chunks using deep learning embeddings.
Conversational AI: You can have a conversation with the AI using natural language.

How does it works

The application uses the following components:

PDF Text Extraction: It extracts text from the uploaded PDFs using PyPDF2.
Text Chunking: The extracted text is divided into chunks for efficient processing.
Semantic Search: It leverages Hugging Face Transformers to create embeddings of text chunks and uses FAISS for semantic search.
Conversational AI: Conversations are handled using Streamlit and Hugging Face's conversational models.

Configuration

You can configure the behavior of the application by modifying the code in main.py. You can change the PDF text extraction method, text chunking parameters, and conversational AI model.

Installation

To run this application, you'll need Python and the required libraries installed. You can install the necessary dependencies using pip:

pip install -r requirements.txt

Then from the app.py run the following command in the terminal using streamlit:

streamlit run app.py

PS. In the .env file you can change the environment variable API TOKEN

Usage

Upload PDFs: Upload your PDF documents using the file uploader.
Ask Questions: Type your questions in the input box and click "Ask."
Get Answers: The application will provide answers based on the content of the PDFs.

Liscense

The MultiPDF App is released under the MIT License.

Acknowledgment

Reference - Ask Multiple PDFs

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.venv		.venv
images		images
README.md		README.md
app.env		app.env
app.py		app.py
html_templates.py		html_templates.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MultiPDF-Chat

Features

How does it works

Configuration

Installation

Usage

Liscense

Acknowledgment

About

Releases

Packages

Languages

wittyicon29/MultiPDF-Chat

Folders and files

Latest commit

History

Repository files navigation

MultiPDF-Chat

Features

How does it works

Configuration

Installation

Usage

Liscense

Acknowledgment

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages