part of : https://github.com/vllm-project/aibrix/issues/1430 part of: https://github.com/vllm-project/aibrix/issues/1422