[V1] Enable multi-input by default #15799

DarkLight1337 · 2025-03-31T07:15:12Z

This PR enables multiple multi-modal input items for V1 without having to set limit_mm_per_prompt.

Note: This may increase the default memory usage for multi-modal models because max_num_mm_items_decoder_budget no longer limits max_num_mm_items in GPUModelRunner.profile_run. You can explicitly set the limit to one via limit_mm_per_prompt or even disable unused modalities completely by setting the limit of that modality to zero. I have added a section to the Offline Inference docs accordingly.

There is no need to set limits for V1 since encoder and decoder are profiled separately which should avoid OOM during inference time. The only hard limit is the context length which is checked in Processor._validate_model_inputs already.

Note: Users can still limit_mm_per_prompt to exclude individual modalities from being profiled and used in inference.

This is loosely a follow-up to #15703 which removed the direct dependency of various models on multimodal limits.

Some other changes:

To reduce memory cost, unused modalities are now fully disabled in examples and the common model tests, instead of using the default limit of that modality.
Fix incorrect type annotations of data parsing overrides for MiniCPM-O/V and Qwen2-VL.

Signed-off-by: DarkLight1337 <[email protected]>

github-actions · 2025-03-31T07:15:20Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: DarkLight1337 <[email protected]>

Isotr0py

Overall look reasonable to me!

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: Yang Wang <[email protected]>

Signed-off-by: DarkLight1337 <[email protected]>

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: Mu Huai <[email protected]>

[V1] Disable multimodal limits

0c80359

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 31, 2025

DarkLight1337 requested review from heheda12345 and WoosukKwon March 31, 2025 07:15

DarkLight1337 requested a review from ywang96 as a code owner March 31, 2025 07:15

mergify bot added frontend multi-modality Related to multi-modality (#4194) labels Mar 31, 2025

DarkLight1337 added 2 commits March 31, 2025 15:21

Add log

2e859de

Signed-off-by: DarkLight1337 <[email protected]>

Update doc

71ecc7f

Signed-off-by: DarkLight1337 <[email protected]>

mergify bot added the documentation Improvements or additions to documentation label Mar 31, 2025

DarkLight1337 added 2 commits March 31, 2025 15:24

Clean

4f942e2

Signed-off-by: DarkLight1337 <[email protected]>

Update

657d1b8

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 removed the ready ONLY add when PR is ready to merge/full CI is needed label Mar 31, 2025

DarkLight1337 force-pushed the v1-mm-limits branch from 3e4e0ac to c910774 Compare March 31, 2025 08:55

Fixes

90c6830

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 force-pushed the v1-mm-limits branch from c910774 to 90c6830 Compare March 31, 2025 08:56

DarkLight1337 changed the title ~~[V1] Disable multimodal limits~~ [V1] Enable multi-input by default Mar 31, 2025

DarkLight1337 added 5 commits March 31, 2025 09:10

Still respect limit_mm_per_prompt if given

ba23b69

Signed-off-by: DarkLight1337 <[email protected]>

Use processor in chat utils

0eb9b72

Signed-off-by: DarkLight1337 <[email protected]>

Use processor even in V0 if possible

b57b232

Signed-off-by: DarkLight1337 <[email protected]>

Cleanup

96966b5

Signed-off-by: DarkLight1337 <[email protected]>

Update

4a2cb7d

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 added this to Multi-modality Core Mar 31, 2025

DarkLight1337 moved this to In Progress in Multi-modality Core Mar 31, 2025

DarkLight1337 self-assigned this Mar 31, 2025

DarkLight1337 added 2 commits March 31, 2025 16:02

Merge branch 'main' into v1-mm-limits

4133554

Fix entrypoint tests

6fb816c

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 requested review from robertgshaw2-redhat and simon-mo as code owners March 31, 2025 16:21

DarkLight1337 added 2 commits April 11, 2025 09:38

Fix typos

c7052b4

Signed-off-by: DarkLight1337 <[email protected]>

Remove unnecessary check

140155f

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 11, 2025

DarkLight1337 added 4 commits April 11, 2025 14:06

Fix processing test

b449644

Signed-off-by: DarkLight1337 <[email protected]>

Fix type errors

ff10269

Signed-off-by: DarkLight1337 <[email protected]>

Revert to save memory

3ab1cdc

Signed-off-by: DarkLight1337 <[email protected]>

Avoid OOM

0a038dc

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 force-pushed the v1-mm-limits branch from a9c6571 to 2de5e97 Compare April 11, 2025 15:11

Add docs for disabling unused modalities

8d1f28b

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 force-pushed the v1-mm-limits branch from 2de5e97 to 8d1f28b Compare April 11, 2025 15:13

Set limit for single example

d56d4d5

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 requested review from mgoin and Isotr0py April 12, 2025 00:40

Fix embedding example

a0d387b

Signed-off-by: DarkLight1337 <[email protected]>

Isotr0py approved these changes Apr 12, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) April 12, 2025 06:51

DarkLight1337 merged commit d9fc8cd into vllm-project:main Apr 12, 2025
49 checks passed

github-project-automation bot moved this from In Progress to Done in Multi-modality Core Apr 12, 2025

DarkLight1337 deleted the v1-mm-limits branch April 12, 2025 09:36

DarkLight1337 mentioned this pull request Apr 15, 2025

[CI/Build] Fix LoRA OOM #16624

Merged

This was referenced Apr 18, 2025

[Misc] Clean up Kimi-VL #16833

Merged

[VLM] Clean up models #16873

Merged

yangw-dev pushed a commit to yangw-dev/vllm that referenced this pull request Apr 21, 2025

[V1] Enable multi-input by default (vllm-project#15799)

f31a7a2

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: Yang Wang <[email protected]>

jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Apr 29, 2025

[V1] Enable multi-input by default (vllm-project#15799)

6b6712e

Signed-off-by: DarkLight1337 <[email protected]>

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025

[V1] Enable multi-input by default (vllm-project#15799)

daf142b

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 mentioned this pull request May 6, 2025

[Bugfix] Fix modality limits in vision language example #17721

Merged

RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025

[V1] Enable multi-input by default (vllm-project#15799)

e36dd8e

Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: Mu Huai <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[V1] Enable multi-input by default #15799

[V1] Enable multi-input by default #15799

Uh oh!

DarkLight1337 commented Mar 31, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Mar 31, 2025

Uh oh!

Isotr0py left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[V1] Enable multi-input by default #15799

[V1] Enable multi-input by default #15799

Uh oh!

Conversation

DarkLight1337 commented Mar 31, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 31, 2025

Uh oh!

Isotr0py left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

DarkLight1337 commented Mar 31, 2025 •

edited by github-actions bot

Loading