[Bugfix] [pytorch] Patch AOTAutogradCache._get_shape_env #17142

jamesjwu · 2025-04-24T22:02:02Z

Description

After https://github.com/pytorch/pytorch/pull/151563/files, VLLM needs to patch an extra _get_shape_env() function when running inductor, as AOTAutogradCache now uses its own shape env function. The implementation is technically shared at torch._inductor.codecache.GuardedCache._get_shape_env, but adding it like this preserves backward compatibility with PyTorch 2.6.

Test Plan

The following program now runs when linking VLLM with pytorch main

from vllm import LLM, SamplingParams
import argparse

if __name__ == "__main__":
    parser = argparse.ArgumentParser(description='Script with a boolean flag')
    parser.add_argument('--model',
                        type=str,
                        default="facebook/opt-125m",
                        help='Model to use for generation')
    args = parser.parse_args()
    prompts = [
        "Hello, my name is",
        "The president of the United States is",
        "The capital of France is",
        "The future of AI is",
    ]
    sampling_params = SamplingParams(temperature=0.8, top_p=0.95)
    model = args.model

    compilation_config = {
        'level': 3,
        'compile_sizes': [1],
    }
    llm = LLM(model=model, compilation_config=compilation_config)
    outputs = llm.generate(prompts, sampling_params)

    for output in outputs:
        prompt = output.prompt
        generated_text = output.outputs[0].text
        print(f"Prompt: {prompt!r}, Generated text: {generated_text!r}")

github-actions · 2025-04-24T22:02:11Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

zou3519

LGTM assuming the tests pass

houseroad

Can we add some unittest in the following PR to ensure we catch the issue in future?

houseroad · 2025-04-24T22:17:38Z

could you fix the pre-commit?

jamesjwu · 2025-04-25T02:23:37Z

Fixed the pre-commit and gated the change to be backward compatible
Will think about how to better unit test this as a followup: the thing is, the failure only happens on the latest version of torch (which VLLM does not pin), so in a way, all the existing unit tests would fail anyway once the pin updates. In fact, the unit test probably belongs on pytorch side, not here.

Signed-off-by: James Wu <[email protected]>

…t#17142) Signed-off-by: James Wu <[email protected]>

…t#17142) Signed-off-by: James Wu <[email protected]> Signed-off-by: Agata Dobrzyniewicz <[email protected]>

…t#17142) Signed-off-by: James Wu <[email protected]> Signed-off-by: Mu Huai <[email protected]>

…t#17142) Signed-off-by: James Wu <[email protected]> Signed-off-by: Yuqi Zhang <[email protected]>

…t#17142) Signed-off-by: James Wu <[email protected]> Signed-off-by: minpeter <[email protected]>

jamesjwu changed the title ~~Patch AOTAutogradCache._get_shape_env~~ [Bugfix] Patch AOTAutogradCache._get_shape_env Apr 24, 2025

jamesjwu force-pushed the patch-shape-env branch from 3393d47 to e0f6153 Compare April 24, 2025 22:03

jamesjwu changed the title ~~[Bugfix] Patch AOTAutogradCache._get_shape_env~~ [Bugfix] [pytorch] Patch AOTAutogradCache._get_shape_env Apr 24, 2025

zou3519 requested review from tlrmchlsmth, mgoin, youkaichao and houseroad April 24, 2025 22:07

zou3519 approved these changes Apr 24, 2025

View reviewed changes

jamesjwu force-pushed the patch-shape-env branch from e0f6153 to 4499dd6 Compare April 24, 2025 22:07

houseroad approved these changes Apr 24, 2025

View reviewed changes

jamesjwu force-pushed the patch-shape-env branch from 4499dd6 to 497e9af Compare April 25, 2025 02:20

Patch AOTAutogradCache._get_shape_env

450f999

Signed-off-by: James Wu <[email protected]>

jamesjwu force-pushed the patch-shape-env branch from 497e9af to 450f999 Compare April 25, 2025 15:33

houseroad added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 25, 2025

DarkLight1337 merged commit a6e72e1 into vllm-project:main Apr 26, 2025
59 checks passed

jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Apr 29, 2025

[Bugfix] [pytorch] Patch AOTAutogradCache._get_shape_env (vllm-projec…

d3b39d2

…t#17142) Signed-off-by: James Wu <[email protected]>

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025

[Bugfix] [pytorch] Patch AOTAutogradCache._get_shape_env (vllm-projec…

b2d1b35

…t#17142) Signed-off-by: James Wu <[email protected]>

adobrzyn pushed a commit to HabanaAI/vllm-fork that referenced this pull request Apr 30, 2025

[Bugfix] [pytorch] Patch AOTAutogradCache._get_shape_env (vllm-projec…

9906716

…t#17142) Signed-off-by: James Wu <[email protected]> Signed-off-by: Agata Dobrzyniewicz <[email protected]>

RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025

[Bugfix] [pytorch] Patch AOTAutogradCache._get_shape_env (vllm-projec…

cf760f1

…t#17142) Signed-off-by: James Wu <[email protected]> Signed-off-by: Mu Huai <[email protected]>

ckhordiasma mentioned this pull request May 14, 2025

nm vllm ent 0.8.5 sync red-hat-data-services/vllm#139

Merged

zzzyq pushed a commit to zzzyq/vllm that referenced this pull request May 24, 2025

[Bugfix] [pytorch] Patch AOTAutogradCache._get_shape_env (vllm-projec…

5e1c1f9

…t#17142) Signed-off-by: James Wu <[email protected]> Signed-off-by: Yuqi Zhang <[email protected]>

minpeter pushed a commit to minpeter/vllm that referenced this pull request Jun 24, 2025

[Bugfix] [pytorch] Patch AOTAutogradCache._get_shape_env (vllm-projec…

d65e0ff

…t#17142) Signed-off-by: James Wu <[email protected]> Signed-off-by: minpeter <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] [pytorch] Patch AOTAutogradCache._get_shape_env #17142

[Bugfix] [pytorch] Patch AOTAutogradCache._get_shape_env #17142

Uh oh!

jamesjwu commented Apr 24, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Apr 24, 2025

Uh oh!

zou3519 left a comment

Uh oh!

houseroad left a comment

Uh oh!

houseroad commented Apr 24, 2025

Uh oh!

jamesjwu commented Apr 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Bugfix] [pytorch] Patch AOTAutogradCache._get_shape_env #17142

[Bugfix] [pytorch] Patch AOTAutogradCache._get_shape_env #17142

Uh oh!

Conversation

jamesjwu commented Apr 24, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Test Plan

Uh oh!

github-actions bot commented Apr 24, 2025

Uh oh!

zou3519 left a comment

Choose a reason for hiding this comment

Uh oh!

houseroad left a comment

Choose a reason for hiding this comment

Uh oh!

houseroad commented Apr 24, 2025

Uh oh!

jamesjwu commented Apr 25, 2025

Uh oh!

Uh oh!

Uh oh!

jamesjwu commented Apr 24, 2025 •

edited by github-actions bot

Loading