[Bugfix] Fix Lora Name Parsing #17196

alex-jw-brooks · 2025-04-25T15:59:56Z

Currently, we assume lora weights start with base_model.model. and split the prefix of lora weights accordingly. This is usually true, but in some situations, there may be lora adapters that don't have this prefix. An example of such a model is granite speech 3.3, where the audio specific lora weights are bundled directly with the model so that they can be automatically loaded together with from_pretrained using the transformers peft mixin, and the weight names align with the transformers (not peft) model tensor names.

This fix keeps the full prefix if the lora tensors don't start with base_model.model.. This is needed for #16246 to work correctly, since the lora is currently not applied due to erroneous slicing (e.g., language_model.model.layers.1.self_attn_q_proj vs layers.1.self_attn_q_proj)

Signed-off-by: Alex-Brooks <[email protected]>

github-actions · 2025-04-25T16:00:07Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: Alex-Brooks <[email protected]>

jeejeelee · 2025-04-26T01:41:36Z

Thank you for your contribution, will look at this ASAP

vllm/lora/utils.py

Co-authored-by: Jee Jee Li <[email protected]> Signed-off-by: Alex-Brooks <[email protected]>

Signed-off-by: Alex-Brooks <[email protected]>

jeejeelee

Thank you for your contribution, LGTM

Signed-off-by: Alex-Brooks <[email protected]> Co-authored-by: Jee Jee Li <[email protected]>

Signed-off-by: Alex-Brooks <[email protected]> Co-authored-by: Jee Jee Li <[email protected]> Signed-off-by: Agata Dobrzyniewicz <[email protected]>

Signed-off-by: Alex-Brooks <[email protected]> Co-authored-by: Jee Jee Li <[email protected]> Signed-off-by: Mu Huai <[email protected]>

Signed-off-by: Alex-Brooks <[email protected]> Co-authored-by: Jee Jee Li <[email protected]> Signed-off-by: Yuqi Zhang <[email protected]>

Signed-off-by: Alex-Brooks <[email protected]> Co-authored-by: Jee Jee Li <[email protected]> Signed-off-by: minpeter <[email protected]>

alex-jw-brooks added 2 commits April 25, 2025 15:28

Fix lora weight name parsing

6a5730d

Signed-off-by: Alex-Brooks <[email protected]>

Add failing cases for lora name parsing

b4bc64a

Signed-off-by: Alex-Brooks <[email protected]>

Update comment

001e63a

Signed-off-by: Alex-Brooks <[email protected]>

alex-jw-brooks force-pushed the fix_lora_parse branch from e919ee9 to 001e63a Compare April 25, 2025 16:43

alex-jw-brooks mentioned this pull request Apr 25, 2025

[Model] Add Granite Speech Support #16246

Merged

jeejeelee self-requested a review April 26, 2025 01:40

jeejeelee reviewed Apr 27, 2025

View reviewed changes

vllm/lora/utils.py Outdated Show resolved Hide resolved

alex-jw-brooks and others added 2 commits April 27, 2025 09:04

Update vllm/lora/utils.py

a7ea22f

Co-authored-by: Jee Jee Li <[email protected]> Signed-off-by: Alex-Brooks <[email protected]>

Fix formatting

5504775

Signed-off-by: Alex-Brooks <[email protected]>

alex-jw-brooks force-pushed the fix_lora_parse branch from c4d61ef to 5504775 Compare April 27, 2025 09:10

jeejeelee approved these changes Apr 27, 2025

View reviewed changes

jeejeelee added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 27, 2025

jeejeelee merged commit 756848e into vllm-project:main Apr 27, 2025
63 checks passed

jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Apr 29, 2025

[Bugfix] Fix Lora Name Parsing (vllm-project#17196)

a4f900c

Signed-off-by: Alex-Brooks <[email protected]> Co-authored-by: Jee Jee Li <[email protected]>

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025

[Bugfix] Fix Lora Name Parsing (vllm-project#17196)

46ed589

Signed-off-by: Alex-Brooks <[email protected]> Co-authored-by: Jee Jee Li <[email protected]>

RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025

[Bugfix] Fix Lora Name Parsing (vllm-project#17196)

36b07da

Signed-off-by: Alex-Brooks <[email protected]> Co-authored-by: Jee Jee Li <[email protected]> Signed-off-by: Mu Huai <[email protected]>

ckhordiasma mentioned this pull request May 14, 2025

nm vllm ent 0.8.5 sync red-hat-data-services/vllm#139

Merged

zzzyq pushed a commit to zzzyq/vllm that referenced this pull request May 24, 2025

[Bugfix] Fix Lora Name Parsing (vllm-project#17196)

b3a704f

Signed-off-by: Alex-Brooks <[email protected]> Co-authored-by: Jee Jee Li <[email protected]> Signed-off-by: Yuqi Zhang <[email protected]>

minpeter pushed a commit to minpeter/vllm that referenced this pull request Jun 24, 2025

[Bugfix] Fix Lora Name Parsing (vllm-project#17196)

d90d4f3

Signed-off-by: Alex-Brooks <[email protected]> Co-authored-by: Jee Jee Li <[email protected]> Signed-off-by: minpeter <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Fix Lora Name Parsing #17196

[Bugfix] Fix Lora Name Parsing #17196

Uh oh!

alex-jw-brooks commented Apr 25, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Apr 25, 2025

Uh oh!

jeejeelee commented Apr 26, 2025

Uh oh!

Uh oh!

jeejeelee left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[Bugfix] Fix Lora Name Parsing #17196

[Bugfix] Fix Lora Name Parsing #17196

Uh oh!

Conversation

alex-jw-brooks commented Apr 25, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 25, 2025

Uh oh!

jeejeelee commented Apr 26, 2025

Uh oh!

Uh oh!

jeejeelee left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

alex-jw-brooks commented Apr 25, 2025 •

edited by github-actions bot

Loading