Use VLLM_WORKER_MULTIPROC_METHOD=spawn instead of --forked for tests #268

tjohnson31415 · 2025-06-27T23:03:49Z

This PR lets us remove the requirement of --forked from our pytest tests.

The hang that is observed without --forked is due to the known issue with libgomp and threading (see this gcc bug report that is a "won't fix"). It is a common problem in Python dueo to the usage of native libraries behind the scenes. If a process is forked after an OpenMP thread pool has been created, then the child will not have a threadpool and the code hangs the next time code enters a parallel context.

Where this comes up in our tests is actually because we use transformers to compare the generation results. vLLM and PyTorch delay initializing the thread pool until it is needed. When just using vLLM in V1, this does not happen in the frontend process, so it is ok to use fork(), but using transformer's model.generate in the main process during the tests initializes the thread pool and causes the next attempt to create a vllm.LLM to hang in the forked worker process.

With spawn, the new process is created from scratch and creates a new OpenMP thread pool. But there are trade-offs here too, eg. using spawn in offline mode requires particular handling in a script. vLLM docs have a good summary of the trade-offs of this setting in REF. Because of that, I just set in in the test environment. Using the vllm cli to run the code actually defaults to spawn anyways (REF).

FIX #146

github-actions · 2025-06-27T23:03:58Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

prashantgupta24 · 2025-06-30T20:47:09Z

bot:test
MARKERS="spyre"

prashantgupta24 · 2025-06-30T21:09:57Z

bot:test
MARKERS="spyre and not cb"

prashantgupta24 · 2025-06-30T21:21:46Z

bot:test
MARKERS="spyre and not cb"

prashantgupta24 · 2025-06-30T21:38:51Z

bot:test
MARKERS="spyre and not quantized and not multi and not embedding and not cb"

.github/workflows/test.yml

Signed-off-by: Travis Johnson <[email protected]>

tests/conftest.py

joerunde

lpgtm

tjohnson31415 force-pushed the try-tests-no-fork branch from d834d5b to fa06501 Compare July 1, 2025 15:47

prashantgupta24 reviewed Jul 1, 2025

View reviewed changes

.github/workflows/test.yml Show resolved Hide resolved

tjohnson31415 added 3 commits July 3, 2025 13:06

try using OMP_NUM_THREADS=1 to allow non-forked tests

667cc61

Signed-off-by: Travis Johnson <[email protected]>

try spawn instead of OMP_NUM_THREADS=1

8e8cdc4

Signed-off-by: Travis Johnson <[email protected]>

remove pytest-forked

e21df75

Signed-off-by: Travis Johnson <[email protected]>

tjohnson31415 force-pushed the try-tests-no-fork branch from 45ff0fe to e21df75 Compare July 3, 2025 19:06

tjohnson31415 changed the title ~~[DO NOT MERGE] Try using OMP_NUM_THREADS=1 to allow non-forked tests~~ Use VLLM_WORKER_MULTIPROC_METHOD=spawn instead of --forked for tests Jul 3, 2025

tjohnson31415 marked this pull request as ready for review July 3, 2025 19:54

tjohnson31415 requested review from joerunde and ckadner as code owners July 3, 2025 19:54

prashantgupta24 approved these changes Jul 7, 2025

View reviewed changes

tjohnson31415 requested review from rafvasq and sducouedic as code owners July 8, 2025 19:58

fix: move setting VLLM_WORKER_MULTIPROC_METHOD to conftest.py

a75e4df

Signed-off-by: Travis Johnson <[email protected]>

tjohnson31415 force-pushed the try-tests-no-fork branch from ca6f42c to a75e4df Compare July 8, 2025 20:24

prashantgupta24 reviewed Jul 9, 2025

View reviewed changes

tests/conftest.py Show resolved Hide resolved

joerunde approved these changes Jul 9, 2025

View reviewed changes

prashantgupta24 merged commit 697e3ba into main Jul 9, 2025
18 checks passed

prashantgupta24 deleted the try-tests-no-fork branch July 9, 2025 16:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use VLLM_WORKER_MULTIPROC_METHOD=spawn instead of --forked for tests #268

Use VLLM_WORKER_MULTIPROC_METHOD=spawn instead of --forked for tests #268

Uh oh!

tjohnson31415 commented Jun 27, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jun 27, 2025

Uh oh!

prashantgupta24 commented Jun 30, 2025

Uh oh!

prashantgupta24 commented Jun 30, 2025

Uh oh!

prashantgupta24 commented Jun 30, 2025

Uh oh!

prashantgupta24 commented Jun 30, 2025

Uh oh!

Uh oh!

Uh oh!

joerunde left a comment

Uh oh!

Uh oh!

Uh oh!

Use VLLM_WORKER_MULTIPROC_METHOD=spawn instead of --forked for tests #268

Use VLLM_WORKER_MULTIPROC_METHOD=spawn instead of --forked for tests #268

Uh oh!

Conversation

tjohnson31415 commented Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jun 27, 2025

Uh oh!

prashantgupta24 commented Jun 30, 2025

Uh oh!

prashantgupta24 commented Jun 30, 2025

Uh oh!

prashantgupta24 commented Jun 30, 2025

Uh oh!

prashantgupta24 commented Jun 30, 2025

Uh oh!

Uh oh!

Uh oh!

joerunde left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

tjohnson31415 commented Jun 27, 2025 •

edited

Loading