[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument #17677

varun-sundar-rabindranath · 2025-05-05T21:26:39Z

LoRA triton kernels fail to compile on non-CUDA gpus because the maxnreg argument is recognized only on CUDA platforms.

Fix: The maxnreg argument isn't used and it is safe to retire completely.

FIX #16676

github-actions · 2025-05-05T21:26:49Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

mergify · 2025-05-05T21:27:13Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @varun-sundar-rabindranath.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: varun sundar rabindranath <[email protected]>

robertgshaw2-redhat · 2025-05-05T21:36:54Z

Thanks Varun!

…ect#17677) Signed-off-by: Mu Huai <[email protected]>

…ect#17677)

Syncing midstream NM fork to Upstream tag of [v0.8.5.post1](https://github.com/vllm-project/vllm/tree/v0.8.5.post1) + cherry pick of vllm-project@be633fb needed for benchmarks + [CP](neuralmagic/nm-vllm-ent@1fe447d) for compressed tensor bump + [CP](vllm-project#17677) for lora on AMD + [CP](vllm-project#17315) for llama4 w/ pure dense layers ``` commit 31c73ba (HEAD -> upstream-v0.8.5, nm-fork/upstream-v0.8.5) Author: Chauncey <[email protected]> Date: Wed Apr 30 15:11:04 2025 +0800 [Bugfix] Fix AttributeError: 'State' object has no attribute 'engine_client' (vllm-project#17434) Signed-off-by: chaunceyjiang <[email protected]> commit f8db0bd Author: Lucas Wilkinson <[email protected]> Date: Fri May 2 14:01:38 2025 -0400 [BugFix][Attention] Fix sliding window attention in V1 giving incorrect results (vllm-project#17574) Signed-off-by: Lucas Wilkinson <[email protected]> commit e335c34 Author: Robert Shaw <[email protected]> Date: Fri May 2 04:07:03 2025 -0400 [BugFix] Fix Memory Leak (vllm-project#17567) Signed-off-by: [email protected] <[email protected]> commit cc463fe Merge: 1e358ff ba41cc9 Author: Selbi Nuryyeva <[email protected]> Date: Tue Apr 29 12:34:57 2025 -0400 Merge branch 'tag-upstream-v0.8.5' into upstream-v0.8.5 commit ba41cc9 (tag: v0.8.5, tag-upstream-v0.8.5) Author: Michael Goin <[email protected]> Date: Mon Apr 28 16:20:24 2025 -0600 [Model] Add tuned triton fused_moe configs for Qwen3Moe (vllm-project#17328) Signed-off-by: mgoin <[email protected]> commit dcbac4c Author: Simon Mo <[email protected]> Date: Mon Apr 28 14:12:01 2025 -0700 [Model] Qwen3 Dense FP8 Compat Fixes (vllm-project#17318) Signed-off-by: simon-mo <[email protected]> [...] ``` Commands ``` git fetch upstream git checkout -b upstream-v0.8.5 git merge upstream/v0.8.5 git cherry-pick be633fb ``` TEST PLAN accept sync: https://github.com/neuralmagic/nm-cicd/actions/runs/14841223552 related PR in cicd: neuralmagic/nm-cicd#99 release workflow: https://github.com/neuralmagic/nm-cicd/actions/runs/14845693864

…ect#17677)

…ect#17677) Signed-off-by: Yuqi Zhang <[email protected]>

…ect#17677) Signed-off-by: minpeter <[email protected]>

mergify bot added the needs-rebase label May 5, 2025

retire unused maxnreg lora arg

8c5ecb5

Signed-off-by: varun sundar rabindranath <[email protected]>

varun-sundar-rabindranath force-pushed the varun/retire-maxnreg branch from a68945d to 8c5ecb5 Compare May 5, 2025 21:30

mergify bot removed the needs-rebase label May 5, 2025

varun-sundar-rabindranath mentioned this pull request May 5, 2025

[Bugfix] Adding maxnreg to lora expand/shrink kernel definition #17671

Closed

robertgshaw2-redhat approved these changes May 5, 2025

View reviewed changes

simon-mo merged commit 90bd2ae into vllm-project:main May 6, 2025
27 of 29 checks passed

NickLucche mentioned this pull request May 6, 2025

[Misc] Add Next Edit Prediction (NEP) datasets support in benchmark_serving.py #16839

Merged

1 task

RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument (vllm-proj…

a589edc

…ect#17677) Signed-off-by: Mu Huai <[email protected]>

dtrifiro pushed a commit to red-hat-data-services/vllm that referenced this pull request May 13, 2025

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument (vllm-proj…

278cc0f

…ect#17677)

mawong-amd pushed a commit to ROCm/vllm that referenced this pull request May 14, 2025

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument (vllm-proj…

a5bae91

…ect#17677)

ckhordiasma mentioned this pull request May 14, 2025

nm vllm ent 0.8.5 sync red-hat-data-services/vllm#139

Merged

zzzyq pushed a commit to zzzyq/vllm that referenced this pull request May 24, 2025

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument (vllm-proj…

13ca814

…ect#17677) Signed-off-by: Yuqi Zhang <[email protected]>

minpeter pushed a commit to minpeter/vllm that referenced this pull request Jun 24, 2025

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument (vllm-proj…

13a3c07

…ect#17677) Signed-off-by: minpeter <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument #17677

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument #17677

Uh oh!

varun-sundar-rabindranath commented May 5, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented May 5, 2025

Uh oh!

mergify bot commented May 5, 2025

Uh oh!

robertgshaw2-redhat commented May 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument #17677

[Bugfix] LoRA - Retire unused maxnreg LoRA kernel argument #17677

Uh oh!

Conversation

varun-sundar-rabindranath commented May 5, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented May 5, 2025

Uh oh!

mergify bot commented May 5, 2025

Uh oh!

robertgshaw2-redhat commented May 5, 2025

Uh oh!

Uh oh!

Uh oh!

varun-sundar-rabindranath commented May 5, 2025 •

edited by github-actions bot

Loading