nilAI

Overview

nilAI is a platform designed to run on Confidential VMs with Trusted Execution Environments (TEEs). It ensures secure deployment and management of multiple AI models across different environments, providing a unified API interface for accessing various AI models with robust user management and model lifecycle handling.

Prerequisites

Docker
Docker Compose
Hugging Face API Token (for accessing certain models)

Configuration

Environment Setup
- Copy the .env.sample file to .env
```
cp .env.sample .env
```
- Update the environment variables in .env:
  - HUGGINGFACE_API_TOKEN: Your Hugging Face API token
- Obtain token by requesting access on the specific model's Hugging Face page. For example, to request access for the Llama 1B model, you can ask here. Note that for the Llama-8B model, you need to make a separate request.

Deployment Options

1. Docker Compose Deployment (Recommended)

Development Environment

# Build nilai_attestation endpoint
docker build -t nillion/nilai-attestation:latest -f docker/attestation.Dockerfile .
# Build vLLM docker container
docker build -t nillion/nilai-vllm:latest -f docker/vllm.Dockerfile .
# Build nilai_api container
docker build -t nillion/nilai-api:latest -f docker/api.Dockerfile --target nilai .

Then, to deploy:

# Deploy with CPU-only configuration
docker compose -f docker-compose.yml \
  -f docker-compose.dev.yml \
  -f docker/compose/docker-compose.llama-1b-gpu.yml \
  up -d

# Monitor logs
docker compose -f docker-compose.yml \
  -f docker-compose.dev.yml \
  -f docker/compose/docker-compose.llama-1b-gpu.yml \
  logs -f

Production Environment

# Build nilai_attestation endpoint
docker build -t nillion/nilai-attestation:latest -f docker/attestation.Dockerfile .
# Build vLLM docker container
docker build -t nillion/nilai-vllm:latest -f docker/vllm.Dockerfile .
# Build nilai_api container
docker build -t nillion/nilai-api:latest -f docker/api.Dockerfile --target nilai .

To deploy:

docker compose -f docker-compose.yml \
-f docker-compose.prod.yml \
-f docker/compose/docker-compose.llama-3b-gpu.yml \
-f docker/compose/docker-compose.llama-8b-gpu.yml \
up -d

Note: Remove lines for models you do not wish to deploy.

Testing Without GPU

# Build nilai_attestation endpoint
docker build -t nillion/nilai-attestation:latest -f docker/attestation.Dockerfile .
# Build vLLM docker container
docker build -t nillion/nilai-vllm:latest -f docker/vllm.Dockerfile .
# Build nilai_api container
docker build -t nillion/nilai-api:latest -f docker/api.Dockerfile --target nilai --platform linux/amd64 .

To deploy:

docker compose -f docker-compose.yml \
-f docker-compose.dev.yml \
-f docker/compose/docker-compose.llama-1b-cpu.yml \
up -d

2. Manual Component Deployment

Components

API Frontend: Handles user requests and routes model interactions
Databases:
- SQLite: User registry and access management
- etcd3: Distributed key-value store for model lifecycle management

Setup Steps

Start etcd3 Instance

docker run -d --name etcd-server \
  -p 2379:2379 -p 2380:2380 \
  -e ALLOW_NONE_AUTHENTICATION=yes \
  bitnami/etcd:latest

docker run -d --name redis \
  -p 6379:6379 \
  redis:latest

Start PostgreSQL

docker run -d --name postgres
-e POSTGRES_USER=${POSTGRES_USER}
-e POSTGRES_PASSWORD=${POSTGRES_PASSWORD}
-e POSTGRES_DB=${POSTGRES_DB}
-p 5432:5432
--network frontend_net
--volume postgres_data:/var/lib/postgresql/data
postgres:16


2. **Run API Server**
   ```shell
   # Development Environment
    fastapi dev nilai-api/src/nilai_api/__main__.py --port 8080

   # Production Environment
   uv run fastapi run nilai-api/src/nilai_api/__main__.py --port 8080

Run Model Instances

# Example: Llama 3.2 1B Model
# Development Environment
uv run fastapi dev nilai-models/src/nilai_models/models/llama_1b_cpu/__init__.py

# Production Environment
uv run fastapi run nilai-models/src/nilai_models/models/llama_1b_cpu/__init__.py

Developer Workflow

Code Quality and Formatting

Install pre-commit hooks to automatically format code and run checks:

uv run pre-commit install

Model Lifecycle Management

Models register themselves in the etcd database
Registration includes address information with an auto-expiring lifetime
If a model disconnects, it is automatically removed from the available models

Security

Hugging Face API token controls model access
PostgreSQL database manages user permissions
Distributed architecture allows for flexible security configurations

Troubleshooting

Common issues and solutions:

Container Logs

# View logs for all services
docker compose logs -f

# View logs for specific service
docker compose logs -f api

Database Connection

# Check PostgreSQL connection
docker exec -it postgres psql -U ${POSTGRES_USER} -d ${POSTGRES_DB}

Service Health

# Check service health status
docker compose ps

vLLM for Local Execution on macOS

To configure vLLM for local execution on macOS, execute the following steps:

# Clone vLLM repository (root folder)
git clone https://github.com/vllm-project/vllm.git
git checkout v0.7.3 # We use v0.7.3
# Build vLLM OpenAI (vllm folder)
cd vllm
docker build -f Dockerfile.arm -t vllm/vllm-openai . --shm-size=4g
# Build nilai attestation container
docker build -t nillion/nilai-attestation:latest -f docker/attestation.Dockerfile .
# Build vLLM docker container (root folder)
docker build -t nillion/nilai-vllm:latest -f docker/vllm.Dockerfile .
# Build nilai_api container
docker build -t nillion/nilai-api:latest -f docker/api.Dockerfile --target nilai --platform linux/amd64 .

Contributing

Fork the repository
Create a feature branch
Install pre-commit hooks
Make your changes
Submit a pull request

License

[Add your project's license information here]

Name		Name	Last commit message	Last commit date
Latest commit History 206 Commits
.github/workflows		.github/workflows
caddy		caddy
db		db
docker		docker
grafana		grafana
nilai-api		nilai-api
nilai-attestation		nilai-attestation
nilai-auth		nilai-auth
nilai-models		nilai-models
nilai		nilai
packages/nilai-common		packages/nilai-common
prometheus		prometheus
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.env.ci		.env.ci
.env.sample		.env.sample
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.prod.yml		docker-compose.prod.yml
docker-compose.v2.dev.yml		docker-compose.v2.dev.yml
docker-compose.v2.yml		docker-compose.v2.yml
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

nilAI

Overview

Prerequisites

Configuration

Deployment Options

1. Docker Compose Deployment (Recommended)

Development Environment

Production Environment

Testing Without GPU

2. Manual Component Deployment

Components

Setup Steps

Start PostgreSQL

Developer Workflow

Code Quality and Formatting

Model Lifecycle Management

Security

Troubleshooting

vLLM for Local Execution on macOS

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 9

Uh oh!

Languages

License

NillionNetwork/nilAI

Folders and files

Latest commit

History

Repository files navigation

nilAI

Overview

Prerequisites

Configuration

Deployment Options

1. Docker Compose Deployment (Recommended)

Development Environment

Production Environment

Testing Without GPU

2. Manual Component Deployment

Components

Setup Steps

Start PostgreSQL

Developer Workflow

Code Quality and Formatting

Model Lifecycle Management

Security

Troubleshooting

vLLM for Local Execution on macOS

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 9

Uh oh!

Languages

Packages