Change the repository type filter
All
Repositories list
31 repositories
- Community maintained hardware plugin for vLLM on Ascend
- A framework for efficient model inference with omni-modality models
- System Level Intelligent Router for Mixture-of-Models
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
vllm-openvino
Publicmedia-kit
PublicDeepGEMM
Publicrfcs
Public