[Misc]add coding benchmark for speculative decoding #15303

CXIAAAAA · 2025-03-21T20:20:15Z

add likaixin/InstructCoder for speculative decoding benchmark throughput

to run instruct coder benchmark:

VLLM_WORKER_MULTIPROC_METHOD=spawn VLLM_USE_V1=1  python3 benchmarks/benchmark_throughput.py --dataset-name=instructcoder --model <you hf model> --input-len 1000 --output-len 100 --num-prompts 2048 --async-engine

to run random benchmark:

VLLM_WORKER_MULTIPROC_METHOD=spawn VLLM_USE_V1=1  python3 benchmarks/benchmark_throughput.py --dataset-name=random --model <you hf model> --input-len 1000 --output-len 100 --num-prompts 2048 --async-engine

github-actions · 2025-03-21T20:20:24Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

youran-qi

Thank you and LGTM!

For your references, here are the statistics about the input / output lengths (number of tokens) in this dataset. Not sure whether you want to change DEFAULT_OUTPUT_LEN accordingly

	avg	min	max
instruction + input	151	15	837
output	179	9	1317

LiuXiaoxuanPKU

LGTM, @ywang96 please also take a look in case I miss anything!

Also, I'm just wondering if it's possible to share some simple benchmark results on the instructcoder dataset, really appreciate it!

LiuXiaoxuanPKU · 2025-03-22T15:44:01Z

benchmarks/benchmark_dataset.py

+
+# -----------------------------------------------------------------------------
+# Instruct Coder Dataset Implementation
+# -----------------------------------------------------------------------------


Could you add simple description about the dataset, such as 'it includes code-editing tasks such as comment insertion, code optimization, and code refactoring.'

CXIAAAAA added 3 commits March 20, 2025 23:34

empty commit

0b32d57

initial commit

835d970

rm unnessary change

e70da14

CXIAAAAA mentioned this pull request Mar 21, 2025

[V1] Add code dataset to benchmark the performance of spec decode #14013

Open

youran-qi approved these changes Mar 21, 2025

View reviewed changes

LiuXiaoxuanPKU reviewed Mar 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Misc]add coding benchmark for speculative decoding #15303

[Misc]add coding benchmark for speculative decoding #15303

CXIAAAAA commented Mar 21, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Mar 21, 2025

youran-qi left a comment •

edited

Loading

LiuXiaoxuanPKU left a comment •

edited

Loading

LiuXiaoxuanPKU Mar 22, 2025

[Misc]add coding benchmark for speculative decoding #15303

Are you sure you want to change the base?

[Misc]add coding benchmark for speculative decoding #15303

Conversation

CXIAAAAA commented Mar 21, 2025 • edited by github-actions bot Loading

github-actions bot commented Mar 21, 2025

youran-qi left a comment • edited Loading

Choose a reason for hiding this comment

LiuXiaoxuanPKU left a comment • edited Loading

Choose a reason for hiding this comment

LiuXiaoxuanPKU Mar 22, 2025

Choose a reason for hiding this comment

CXIAAAAA commented Mar 21, 2025 •

edited by github-actions bot

Loading

youran-qi left a comment •

edited

Loading

LiuXiaoxuanPKU left a comment •

edited

Loading