[Core] Fix standalone runs of test_reset_prefix_cache_e2e #29899

markmc · 2025-12-02T16:49:07Z

This test was added by #28827 and in CI it passes with:

[2025-12-02T03:08:44Z] v1/core/test_reset_prefix_cache_e2e.py::test_reset_prefix_cache_e2e INFO 12-01 19:08:44 [model.py:637] Resolved architecture: Qwen3ForCausalLM ... [2025-12-02T03:08:45Z] WARNING 12-01 19:08:45 [system_utils.py:136] [..] Overriding VLLM_WORKER_MULTIPROC_METHOD to 'spawn'. [..] Reasons: CUDA is initialized

This doesn't happen if you run locally, standalone, and the test fails with:

E AssertionError: ground_truth_results['ground_truth_0'].outputs[0].text= - but it could cause serious harm. - It is a human preempted_results['preempted_0'].outputs[0].text= - but it could cause serious harm. - This is the case E assert ' - but it c...It is a human' == ' - but it c...s is the case' E E - - but it could cause serious harm. - This is the case E ? ^^ ^^^^ ^^^ ^^ E + - but it could cause serious harm. - It is a human E ? ^^^ ^^ ^^ ^ tests/v1/core/test_reset_prefix_cache_e2e.py:60: AssertionError

Forcing "spawn" fixes it.

This test was added by vllm-project#28827 and in CI it passes with: ``` [2025-12-02T03:08:44Z] v1/core/test_reset_prefix_cache_e2e.py::test_reset_prefix_cache_e2e INFO 12-01 19:08:44 [model.py:637] Resolved architecture: Qwen3ForCausalLM ... [2025-12-02T03:08:45Z] WARNING 12-01 19:08:45 [system_utils.py:136] [..] Overriding VLLM_WORKER_MULTIPROC_METHOD to 'spawn'. [..] Reasons: CUDA is initialized ``` This doesn't happen if you run locally, standalone, and the test fails with: ``` E AssertionError: ground_truth_results['ground_truth_0'].outputs[0].text= - but it could cause serious harm. - It is a human preempted_results['preempted_0'].outputs[0].text= - but it could cause serious harm. - This is the case E assert ' - but it c...It is a human' == ' - but it c...s is the case' E E - - but it could cause serious harm. - This is the case E ? ^^ ^^^^ ^^^ ^^ E + - but it could cause serious harm. - It is a human E ? ^^^ ^^ ^^ ^ tests/v1/core/test_reset_prefix_cache_e2e.py:60: AssertionError ``` Forcing "spawn" fixes it. Signed-off-by: Mark McLoughlin <markmc@redhat.com>

gemini-code-assist

Code Review

This pull request addresses a test flakiness issue in test_reset_prefix_cache_e2e by forcing the multiprocessing start method to 'spawn'. The change is well-justified, as using 'spawn' is a known and effective way to ensure determinism in tests involving CUDA, preventing issues that can arise from forked processes. The implementation is clean, using pytest's monkeypatch fixture, which is the standard approach for managing environment variables within tests. The change is correct and improves the reliability of the test suite. I have no further comments.

markmc requested a review from zhuohan123 December 2, 2025 16:49

markmc requested review from ApostaC, WoosukKwon, alexm-redhat, heheda12345, njhill, robertgshaw2-redhat and ywang96 as code owners December 2, 2025 16:49

mergify bot added the v1 label Dec 2, 2025

gemini-code-assist bot reviewed Dec 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Core] Fix standalone runs of test_reset_prefix_cache_e2e #29899

[Core] Fix standalone runs of test_reset_prefix_cache_e2e #29899

markmc commented Dec 2, 2025 •

edited by github-actions bot

Loading

gemini-code-assist bot left a comment

Labels

1 participant

Uh oh!

[Core] Fix standalone runs of test_reset_prefix_cache_e2e #29899

Are you sure you want to change the base?

[Core] Fix standalone runs of test_reset_prefix_cache_e2e #29899

Conversation

markmc commented Dec 2, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Labels

1 participant

markmc commented Dec 2, 2025 •

edited by github-actions bot

Loading