Sentiment Analysis with LSTM

A professional, modular sentiment analysis framework using LSTM neural networks for movie review classification.

Overview

This project provides a complete, production-ready implementation of sentiment analysis using Long Short-Term Memory (LSTM) neural networks. Built on TensorFlow/Keras, it offers a modular architecture, comprehensive testing, multiple interfaces (CLI, Python API, Jupyter notebooks), and extensive documentation.

Key Features

Modular Architecture: Clean separation of concerns with dedicated modules for data loading, model building, training, prediction, and visualization
Bidirectional LSTM: Captures dependencies from both forward and backward sequences for improved accuracy
Multiple Interfaces:
- Python API for programmatic access
- CLI tools for command-line usage
- Jupyter notebooks for interactive exploration
Comprehensive Testing: Unit tests with pytest for reliability
Production Ready: Model persistence, logging, configuration management
Rich Visualizations: Training curves, confusion matrices, ROC curves, prediction distributions
Easy Installation: Simple pip installation with all dependencies

Installation

Prerequisites

Python 3.8 or higher
pip package manager

Install from Source

# Clone the repository
git clone https://github.com/pyenthusiasts/Sentiment-Analysis-LSTM.git
cd Sentiment-Analysis-LSTM

# Install the package
pip install -e .

Install Dependencies Only

pip install -r requirements.txt

Quick Start

Python API

from sentiment_analysis.train import Trainer
from sentiment_analysis.predict import Predictor

# Train a model
trainer = Trainer()
(X_train, y_train), (X_test, y_test) = trainer.prepare_data()
history = trainer.train(X_train, y_train, X_test, y_test)

# Make predictions
predictor = Predictor()
result = predictor.predict_text("Amazing movie! Loved it!")
print(f"Sentiment: {result['sentiment']} (Score: {result['score']:.2f})")

Project Structure

Sentiment-Analysis-LSTM/
├── src/sentiment_analysis/      # Main package
│   ├── config.py                # Configuration
│   ├── data_loader.py           # Data loading
│   ├── model.py                 # Model architecture
│   ├── train.py                 # Training logic
│   ├── predict.py               # Prediction logic
│   ├── utils.py                 # Utilities
│   └── visualization.py         # Visualizations
├── tests/                       # Unit tests
├── examples/                    # Example scripts
├── notebooks/                   # Jupyter notebooks
├── requirements.txt             # Dependencies
└── setup.py                     # Package setup

Usage

Training a Model

from sentiment_analysis.train import Trainer

trainer = Trainer()
(X_train, y_train), (X_test, y_test) = trainer.prepare_data()
history = trainer.train(X_train, y_train, X_test, y_test, epochs=5)

Making Predictions

from sentiment_analysis.predict import Predictor

predictor = Predictor()
result = predictor.predict_text("This movie was amazing!")
print(f"Sentiment: {result['sentiment']}")
print(f"Confidence: {result['confidence']:.2%}")

Model Architecture

Embedding Layer: 128-dimensional word embeddings
Bidirectional LSTM: 64 units processing in both directions
Dropout: 0.5 rate for regularization
LSTM: 32 units
Dense Output: Sigmoid activation for binary classification

Configuration

Edit src/sentiment_analysis/config.py to customize:

VOCAB_SIZE: 10000 (vocabulary size)
MAX_LENGTH: 300 (sequence length)
EMBEDDING_DIM: 128
BATCH_SIZE: 128
EPOCHS: 5

Examples

Run example scripts:

python examples/basic_usage.py
python examples/custom_training.py
python examples/prediction_only.py

Testing

# Run all tests
pytest

# Run with coverage
pytest --cov=sentiment_analysis

Contributing

Contributions welcome! See CONTRIBUTING.md for guidelines.

License

MIT License - see LICENSE for details.

Acknowledgments

Built with TensorFlow and Keras
IMDB dataset from Stanford AI Lab

Made with passion for NLP and Deep Learning

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github		.github
docs		docs
examples		examples
notebooks		notebooks
scripts		scripts
src/sentiment_analysis		src/sentiment_analysis
tests		tests
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.env.example		.env.example
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-api.txt		requirements-api.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Analysis with LSTM

Overview

Key Features

Table of Contents

Installation

Prerequisites

Install from Source

Install Dependencies Only

Quick Start

Python API

Project Structure

Usage

Training a Model

Making Predictions

Model Architecture

Configuration

Examples

Testing

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

pyenthusiasts/Sentiment-Analysis-LSTM

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis with LSTM

Overview

Key Features

Table of Contents

Installation

Prerequisites

Install from Source

Install Dependencies Only

Quick Start

Python API

Project Structure

Usage

Training a Model

Making Predictions

Model Architecture

Configuration

Examples

Testing

Contributing

License

Acknowledgments

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages