-
Notifications
You must be signed in to change notification settings - Fork 52
Description
Community Note
- Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
- Please do not leave "+1" or other comments that do not add relevant new information or questions, they generate extra noise for issue followers and do not help prioritize the request
- If you are interested in working on this issue or have submitted a pull request, please leave a comment
What is the outcome that you are trying to reach?
Proposal to add guidance on generating container images for the Neuron SDK. Customers need to have e comprehensive way to build container images for running frameworks for Inference and Training.
The guidance should provide instructions in how to build a Container Image and push it into ECR so that it can then be pulled and used in the blueprints.
Describe the solution you would like
The guidance can make reference to building images that include software stacks such as Ray or vLLM but could be use to build images with other frameworks. It can the reference or be referenced in Blueprints that can use this type of Container image
Describe alternatives you have considered
Having this guidance published in the Neuron documentation
Additional context
Starting resources:
https://github.com/awslabs/ai-on-eks/blob/main/blueprints/inference/neuron/ray-vllm/dockerfiles/Dockerfile
https://catalog.us-east-1.prod.workshops.aws/workshops/99fcf694-3f9c-4ad6-b450-659112de3481/en-US