Skip to content

Actions: vllm-project/vllm

Cleanup PR Body

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
13,611 workflow runs
13,611 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Model][2/N] Automatic conversion of CrossEncoding model. Part 2
Cleanup PR Body #13538: Pull request #19978 edited by noooop
June 24, 2025 07:24 3m 12s
June 24, 2025 07:24 3m 12s
[Model][2/N] Automatic conversion of CrossEncoding model. Part 2
Cleanup PR Body #13537: Pull request #19978 edited by noooop
June 24, 2025 06:58 1m 49s
June 24, 2025 06:58 1m 49s
[WIP][V1][P/D]Support automatic instance removal after crash for P2pNcclConnector
Cleanup PR Body #13536: Pull request #20006 opened by Abatom
June 24, 2025 06:38 21s
June 24, 2025 06:38 21s
[Model][2/N] Automatic conversion of CrossEncoding model. Part 2
Cleanup PR Body #13535: Pull request #19978 edited by noooop
June 24, 2025 06:38 17s
June 24, 2025 06:38 17s
[Model][2/N] Automatic conversion of CrossEncoding model. Part 2
Cleanup PR Body #13534: Pull request #19978 edited by noooop
June 24, 2025 06:37 12s
June 24, 2025 06:37 12s
[Model][2/N] Automatic conversion of CrossEncoding model. Part 2
Cleanup PR Body #13533: Pull request #19978 edited by noooop
June 24, 2025 06:33 2m 3s
June 24, 2025 06:33 2m 3s
[Model][2/N] Automatic conversion of CrossEncoding model. Part 2
Cleanup PR Body #13532: Pull request #19978 edited by noooop
June 24, 2025 06:33 28s
June 24, 2025 06:33 28s
[Frontend] Added support for HermesToolParser for models without special tokens
Cleanup PR Body #13531: Pull request #16890 edited by minpeter
June 24, 2025 04:49 15s
June 24, 2025 04:49 15s
[Feature] add quick all reduce
Cleanup PR Body #13530: Pull request #19744 edited by lihaoyang-amd
June 24, 2025 03:29 13s
June 24, 2025 03:29 13s
[PERF] Use faster way of decode in tokenizer: avoid useless list-to-list conversion
Cleanup PR Body #13529: Pull request #20000 opened by vadiklyutiy
June 24, 2025 01:29 13s
June 24, 2025 01:29 13s
[Feature] Expert Parallelism Load Balancer (EPLB)
Cleanup PR Body #13528: Pull request #18343 edited by abmfy
June 24, 2025 01:22 13s
June 24, 2025 01:22 13s
[Llama4] Update attn_temperature_tuning
Cleanup PR Body #13527: Pull request #19997 edited by b8zhong
June 24, 2025 01:08 20s
June 24, 2025 01:08 20s
[Llama4] Update attn_temperature_tuning
Cleanup PR Body #13526: Pull request #19997 edited by b8zhong
June 24, 2025 01:07 21s
June 24, 2025 01:07 21s
[Llama4] Update attn_temperature_tuning
Cleanup PR Body #13525: Pull request #19997 edited by b8zhong
June 24, 2025 00:38 14s
June 24, 2025 00:38 14s
[Models] Remove GPU-CPU sync when do_pan_and_scan=false in Gemma3
Cleanup PR Body #13524: Pull request #19999 opened by lgeiger
June 23, 2025 23:47 13s
June 23, 2025 23:47 13s
[Feature] Expert Parallelism Load Balancer (EPLB)
Cleanup PR Body #13523: Pull request #18343 edited by abmfy
June 23, 2025 22:43 13s
June 23, 2025 22:43 13s
[Llama4] Update attn_temperature_tuning
Cleanup PR Body #13522: Pull request #19997 edited by b8zhong
June 23, 2025 22:37 14s
June 23, 2025 22:37 14s
[Llama4] Update attn_temperature_tuning
Cleanup PR Body #13521: Pull request #19997 opened by b8zhong
June 23, 2025 22:33 12s
June 23, 2025 22:33 12s
Adds OTEL instrumentation to OpenAI API server
Cleanup PR Body #13520: Pull request #19987 edited by bbartels
June 23, 2025 19:28 27s
June 23, 2025 19:28 27s
[TPU] Fix tpu model runner test
Cleanup PR Body #13519: Pull request #19995 opened by Chenyaaang
June 23, 2025 18:44 12s
June 23, 2025 18:44 12s
[P/D] Asynchronously do _nixl_handshake
Cleanup PR Body #13518: Pull request #19836 edited by lk-chen
June 23, 2025 18:43 17s
June 23, 2025 18:43 17s
[Misc][Benchmark] Remove colon from key 'request_goodput:'
Cleanup PR Body #13517: Pull request #16018 edited by appleparan
June 23, 2025 18:38 21s
June 23, 2025 18:38 21s
Update test case parameter to have the throughput above 8.0
Cleanup PR Body #13516: Pull request #19994 opened by QiliangCui
June 23, 2025 17:26 32s
June 23, 2025 17:26 32s
[Misc] Clean up InternVL family config registration
Cleanup PR Body #13515: Pull request #19992 opened by Isotr0py
June 23, 2025 16:36 14s
June 23, 2025 16:36 14s
[Draft][torch.compile][ROCm][V1] Enable attention output FP8 fusion for V1 attention backends
Cleanup PR Body #13514: Pull request #19767 edited by gshtras
June 23, 2025 16:22 16s
June 23, 2025 16:22 16s