Fix Qwen2.5-Omni get_chunked_index chunking functionality #37631

imkero · 2025-04-19T15:36:29Z

What does this PR do?

This PR fixes the incorrect mrope position chunking function get_chunked_index in modular_qwen2_5_omni.

The shape of token_indices for modular_qwen2_5_omni.get_chunked_index is (3, seq_len) in main branch

It takes a constant value 3 (comes from len(token_indices)) in the loop condition to iterate the token_indices input (instead of the correct value token_indices.shape[1]). This will make it always produce only a single chunk. (Line 1160 vs Line 1166)

transformers/src/transformers/models/qwen2_5_omni/modular_qwen2_5_omni.py

Lines 1157 to 1168 in 27a25be

    
           def _iter(): 
        
               i, start_idx = 0, 0  # skip bos token 
        
               current_chunk = 1 
        
               while i < len(token_indices):  # skip eos token 
        
                   if token_indices[0][i] - remove_index >= current_chunk * tokens_per_chunk: 
        
                       yield (start_idx, i) 
        
                       start_idx = i 
        
                       current_chunk += 1 
        
                   i += 1 
        
               yield (start_idx, token_indices.shape[1]) 
        
           return list(_iter())

Another similar but correct impl lies in processing_qwen2_5_omni and we can take it as a reference. (Line 303 vs Line 309)

transformers/src/transformers/models/qwen2_5_omni/processing_qwen2_5_omni.py

Lines 300 to 311 in 27a25be

    
           def _iter(): 
        
               i, start_idx = 0, 0  # skip bos token 
        
               current_chunk = 1 
        
               while i < len(token_indices):  # skip eos token 
        
                   if token_indices[i] >= current_chunk * tokens_per_chunk: 
        
                       yield (start_idx, i) 
        
                       start_idx = i 
        
                       current_chunk += 1 
        
                   i += 1 
        
               yield (start_idx, len(token_indices)) 
        
           return list(_iter())

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@BakerBunker can you take a look about this bugfix please?

github-actions · 2025-04-19T15:36:43Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. The CI will be paused while the PR is in draft mode. When it is ready for review, please click the Ready for review button (at the bottom of the PR page). This will assign reviewers and trigger CI.

BakerBunker · 2025-04-20T00:23:03Z

Thank you, I will look at this tomorrow.

BakerBunker · 2025-04-21T05:40:30Z

LGTM, can be merged after the test cases are passed.

cc @zucchini-nlp

imkero · 2025-04-21T06:27:56Z

I think my change on the spatial_merge_size config broke some unit tests, will fix them soon.

imkero · 2025-04-21T09:04:58Z

I think my change on the spatial_merge_size config broke some unit tests, will fix them soon.

The failed tests have been fixed and pass now.

BakerBunker · 2025-04-22T05:26:45Z

@zucchini-nlp Ready to merge.

zucchini-nlp

Thanks, LGTM

…e#37631) * fix: qwen2.5 omni modular get_rope_index * test: add test for qwen2.5 omni rope index (video with audio input) * style * expected_position_ids readability * fix: use spatial_merge_size = 1 in unit test

github-actions bot marked this pull request as draft April 19, 2025 15:36

imkero changed the title ~~Bugfix/qwen2 5 omni mrope chunk~~ Fix Qwen2.5-Omni get_chunked_index impl Apr 19, 2025

imkero marked this pull request as ready for review April 19, 2025 15:38

github-actions bot requested review from Rocketknight1 and ydshieh April 19, 2025 15:39

imkero mentioned this pull request Apr 19, 2025

[Bugfix] Fix Qwen2.5-Omni M-RoPE position ids generation vllm-project/vllm#16878

Merged

imkero changed the title ~~Fix Qwen2.5-Omni get_chunked_index impl~~ Fix Qwen2.5-Omni get_chunked_index chunking functionality Apr 19, 2025

imkero added 5 commits April 21, 2025 16:55

fix: qwen2.5 omni modular get_rope_index

fd88731

test: add test for qwen2.5 omni rope index (video with audio input)

3ce6737

style

bb41afa

expected_position_ids readability

092716d

fix: use spatial_merge_size = 1 in unit test

ad926cc

imkero force-pushed the bugfix/qwen2-5-omni-mrope-chunk branch from f9aeef6 to ad926cc Compare April 21, 2025 08:55

zucchini-nlp approved these changes Apr 22, 2025

View reviewed changes

zucchini-nlp merged commit 5f79128 into huggingface:main Apr 22, 2025
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Qwen2.5-Omni get_chunked_index chunking functionality #37631

Fix Qwen2.5-Omni get_chunked_index chunking functionality #37631

Uh oh!

imkero commented Apr 19, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Apr 19, 2025

Uh oh!

BakerBunker commented Apr 20, 2025

Uh oh!

BakerBunker commented Apr 21, 2025

Uh oh!

imkero commented Apr 21, 2025 •

edited

Loading

Uh oh!

imkero commented Apr 21, 2025

Uh oh!

BakerBunker commented Apr 22, 2025

Uh oh!

zucchini-nlp left a comment

Uh oh!

Uh oh!

Uh oh!

	def _iter():
	i, start_idx = 0, 0 # skip bos token
	current_chunk = 1
	while i < len(token_indices): # skip eos token
	if token_indices[0][i] - remove_index >= current_chunk * tokens_per_chunk:
	yield (start_idx, i)
	start_idx = i
	current_chunk += 1
	i += 1
	yield (start_idx, token_indices.shape[1])

	return list(_iter())

Fix Qwen2.5-Omni get_chunked_index chunking functionality #37631

Fix Qwen2.5-Omni get_chunked_index chunking functionality #37631

Uh oh!

Conversation

imkero commented Apr 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

github-actions bot commented Apr 19, 2025

Uh oh!

BakerBunker commented Apr 20, 2025

Uh oh!

BakerBunker commented Apr 21, 2025

Uh oh!

imkero commented Apr 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

imkero commented Apr 21, 2025

Uh oh!

BakerBunker commented Apr 22, 2025

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

imkero commented Apr 19, 2025 •

edited

Loading

imkero commented Apr 21, 2025 •

edited

Loading