Skip to content

How to Achieve Data-Parallel Offline Batch Inference Using vLLM? #14283

Closed Answered by StefanHeng
T-Atlas asked this question in Q&A
Discussion options

You must be logged in to vote

I'm in a similar situation. I saw and read this issue.

For me I think it's easier to just write my own multi processing code, one for each vLLM myself.

Also found an official vLLM data parallel example here.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by hmellor
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants