penr-oz-neural-network-v3-torch-ddp

Version 3 Implementation of Neural Network service leveraging pytorch distributed data platform (ddp)

This repository demonstrates same key concepts in neural networks as in penr-oz-neural-network-v2-torch-nn with automatic gradient descent calculations relying on PyTorch library leveraging Neural Network (nn) package, Distributed Data Parallel (ddp) feature to support scaling to multiple GPU (CUDA) devices, and changes to API to support downloading/sharding training data for local read instead of an API payload.

Implementation follows:

Backpropagation: Auto Gradient Calculation

The gradients are automatically computed using PyTorch and the PyTorch Neural Network package

Scaling Calculation Speed and Concurrency

This is done by leveraging PyTorch Distributed Data Parallel

Quickstart Guide

Clone the Repository:

git clone https://github.com/derinworks/penr-oz-neural-network-v3-torch-ddp.git
cd penr-oz-neural-network-v3-torch-ddp

Create and Activate a Virtual Environment:
- Install python 3.10
```
$ python3 --version
Python 3.10.18
```
- Create:
```
python3 -m venv venv
```
- Activate:
  - On Unix or macOS:
```
source venv/bin/activate
```
  - On Windows:
```
venv\Scripts\activate
```
Install Dependencies:
```
pip install -r requirements.txt
```

Run the Service:

python main.py

or

uvicorn main:app --log-config log_config.json

Interact with the Service Test the endpoints using Swagger at http://127.0.0.1:8000/docs.
Interact with the Dashboard Diagnose model training at http://127.0.0.1:8000/dashboard.
Quickly spin up the Service in a brand-new Linux VM
```
./run-in-vm.sh
```

Testing and Coverage

To ensure code quality and maintainability, follow these steps to run tests and check code coverage:

Run All Tests:
```
python -m pytest -v
```
The test suite includes 114 tests across 7 test files:
- test_main.py - API endpoint tests (29 tests)
- test_neural_net_model.py - Model implementation tests (43 tests)
- test_neural_net_layers.py - Custom layer tests (12 tests)
- test_loaders.py - Dataset loader/downloader tests (7 tests)
- test_mappers.py - Layer/optimizer mapper tests (3 tests)
- test_tokenizers.py - Tokenization tests (4 tests)
- test_ddp.py - Distributed training tests (16 tests)
Run Tests with Coverage: Execute the following commands to run tests and generate a coverage report:
```
coverage run -m pytest
coverage report
```
Generate HTML Coverage Report (Optional): For a detailed coverage report in HTML format:
```
coverage html
```
Open the htmlcov/index.html file in a web browser to view the report.

Platform-Specific Tests

Some tests require Linux-specific features (e.g., /dev/shm for shared memory caching) and will be automatically skipped on macOS/Windows:

test_train_* - Full training integration tests with model persistence
test_cache_miss - Shared memory cache behavior
test_delete - Model deletion with shared memory cleanup

These tests will run automatically on Linux systems where /dev/shm is available.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

penr-oz-neural-network-v3-torch-ddp

Backpropagation: Auto Gradient Calculation

Scaling Calculation Speed and Concurrency

Quickstart Guide

Testing and Coverage

Platform-Specific Tests

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github/workflows		.github/workflows
static		static
templates		templates
.coveragerc		.coveragerc
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md
ddp.py		ddp.py
loaders.py		loaders.py
log_config.json		log_config.json
main.py		main.py
mappers.py		mappers.py
neural_net_layers.py		neural_net_layers.py
neural_net_model.py		neural_net_model.py
requirements.txt		requirements.txt
run-in-vm.sh		run-in-vm.sh
test_ddp.py		test_ddp.py
test_loaders.py		test_loaders.py
test_main.py		test_main.py
test_mappers.py		test_mappers.py
test_neural_net_layers.py		test_neural_net_layers.py
test_neural_net_model.py		test_neural_net_model.py
test_tokenizers.py		test_tokenizers.py
tokenizers.py		tokenizers.py
warp.md		warp.md

License

derinworks/penr-oz-neural-network-v3-torch-ddp

Folders and files

Latest commit

History

Repository files navigation

penr-oz-neural-network-v3-torch-ddp

Backpropagation: Auto Gradient Calculation

Scaling Calculation Speed and Concurrency

Quickstart Guide

Testing and Coverage

Platform-Specific Tests

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages