Llambda

Description
Requirements
- Local
- AWS
Getting started
References

Description

This sample project aims to show the possibility of deploying serverless generative AI on AWS at a very low cost.

The main idea is to deploy a container with an endpoint on a Lambda function to interact with the model.

Requirements

Local

You need to have installed:

Docker;
AWS CLI;
Make.

AWS

Make sure you have created:

an ECR repository.

Getting started

Configuration

Create the .env file from the .env.dist file and update it with:

ECR: the ECR registry;
REPOSITORY: the ECR repository;
MODEL_URL: the download url of a model in GGUF format (https://huggingface.co/models?library=gguf);

Note that the size of the model should be not a little less than the memory limit of the lambda, which is about 10 GB at most.

Setup

Build and push the image to the registry.

Download the model:
```
make download
```
Build the container image and tag it:
```
make build
```
```
make tag
```
Login into ECR:
```
make ecr-login
```
Push the image:
```
make push
```
Create a Lambda function with your repository.

Make sure to:
- set the maximum available memory;
- enable function URL;
- increase the timeout if necessary;

Usage

Make a request to the function endpoint to get the model response:

curl "https://{LAMBDA_FUNCTION_URL}/prompt?text=hello"

References

OpenLLaMa on AWS Lambda

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.env.dist		.env.dist
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Llambda

Description

Requirements

Local

AWS

Getting started

Configuration

Setup

Usage

References

About

Uh oh!

Releases

Packages

Languages

marilena-baldi/Llambda

Folders and files

Latest commit

History

Repository files navigation

Llambda

Description

Requirements

Local

AWS

Getting started

Configuration

Setup

Usage

References

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages