🚀
keeping everything running
Sr. Machine Learning Engineer @ Red Hat | Inference Engineering | Building llm-d: distributed inference for LLMs on Kubernetes
- San Francisco
Pinned Loading
-
llm-d/llm-d
llm-d/llm-d PublicAchieve state of the art inference performance with modern accelerators on Kubernetes
-
kubernetes-sigs/gateway-api-inference-extension
kubernetes-sigs/gateway-api-inference-extension PublicGateway API Inference Extension
-
llm-d-incubation/llm-d-infra
llm-d-incubation/llm-d-infra Publicllm-d helm charts and deployment examples
-
llm-d-incubation/llm-d-modelservice
llm-d-incubation/llm-d-modelservice Publichelm charts for deploying models with llm-d
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.