LLM Practice Suite

This project is a modular playground for experimenting with Large Language Models (LLMs) on various NLP tasks. It currently supports book/document summarization and is designed for easy extension to other tasks such as Named Entity Recognition (NER), Question Answering (QA), and more.

Features

Summarization: Chunking and step-by-step summarization of large documents using local HuggingFace models.
PDF Support: Reads and processes PDF files.
Interactive Web App: Summarize PDFs directly in your browser using the Streamlit-based summarizer_app.py.
Configurable: All model and cache paths set via config.py.
Extensible: Structure supports adding new LLM-based modules (NER, QA, etc.).

Planned Modules

Summarization (Jupyter notebook & Streamlit app)
Named Entity Recognition (NER)
Question Answering (QA)
Custom LLM Experiments

Folder Structure

.
├── config.py             # Configuration file for model/cache settings
├── config_loader.py      # Loads config.py dynamically
├── summarizer.ipynb       # Summarization notebook
├── summarizer_app.py     # Streamlit app for PDF summarization
├── The McKinsey Way...pdf  # Example PDF book
├── .gitignore
├── LICENSE
└── README.md

Requirements

Install dependencies:

pip install -r requirements.txt

Usage

Configure paths:
Edit config.py to set your model and cache directory.
Download NLTK data:
The notebook will automatically download required NLTK data to your cache directory.
Run a module:
- For the Jupyter notebook, open summarizer.ipynb in Jupyter or VS Code and run all cells.
- For the Streamlit app, run streamlit run summarizer_app.py in your terminal and open the provided URL in your browser.

How it works

Loads configuration from config.py.
Loads a local Transformers model for the selected task.
Reads and processes input data (e.g., PDF for summarization).
Applies chunking and LLM inference as needed.
Designed for easy extension to new tasks.

License

MIT License. See LICENSE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM Practice Suite

Features

Planned Modules

Folder Structure

Requirements

Usage

How it works

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
The McKinsey Way Using the Techniques of the World.pdf		The McKinsey Way Using the Techniques of the World.pdf
config.py		config.py
config_loader.py		config_loader.py
requirements.txt		requirements.txt
summarizer.ipynb		summarizer.ipynb
summarizer_app.py		summarizer_app.py

License

marziehsepehr/GA_practices

Folders and files

Latest commit

History

Repository files navigation

LLM Practice Suite

Features

Planned Modules

Folder Structure

Requirements

Usage

How it works

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages