-
-
Notifications
You must be signed in to change notification settings - Fork 8.6k
[V1][Performance] Implement custom serializaton for MultiModalKwargs [Rebased] #16432
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 30 commits
Commits
Show all changes
31 commits
Select commit
Hold shift + click to select a range
7b6b7ba
Implement efficient serialization of MultiModalKwargs
p88h 4bdd16e
Apply suggestions from code review
p88h e5931af
Additional fixes after code review
p88h 6641584
Fix some broken bits & reformat
p88h a94df99
Add custom support for MultiModalFieldConfig, less pickle
p88h 57467e2
Too many stars. Test for other field types.
p88h d993e42
Set zero-copy threshold to 256MB. Also copy out tensors.
p88h 3401429
Make mypy happy, and also simplify field type restore
p88h 252d8a0
Merge branch 'vllm-project:main' into serialize-multimodal-kwargs
p88h 5902d6e
Merge branch 'main' into serialize-multimodal-kwargs
p88h 57e1922
style fix
p88h 2c0e9a8
Merge branch 'vllm-project:main' into serialize-multimodal-kwargs
p88h 176ba06
Copy memory when sending, zero copy when receiving
p88h 3461ce6
Merge branch 'vllm-project:main' into serialize-multimodal-kwargs
p88h 578aab8
Add threshold env var, re-do field serialization, cleanup
p88h 91a4500
Merge branch 'vllm-project:main' into serialize-multimodal-kwargs
p88h 3d4e380
Merge branch 'vllm-project:main' into serialize-multimodal-kwargs
p88h c61c87a
Merge branch 'vllm-project:main' into serialize-multimodal-kwargs
p88h 936c95e
remove asdict() which involves object deep copy.
p88h 7cf5492
Bring back zero-copy, plus more review updates
p88h 12c9d8b
Apply suggestions from code review
p88h 8bda83c
fix review edits
p88h 678cba1
revert encode_into changes
p88h f8d26df
Apply suggestions from code review
p88h bce2f07
Small fixes
p88h d7cb694
Merge branch 'vllm-project:main' into serialize-multimodal-kwargs
p88h 7511262
style
p88h 97188e6
Merge branch 'vllm-project:main' into serialize-multimodal-kwargs
p88h 48ab2d9
remove unnecessary comment
p88h a60333e
Merge branch 'vllm-project:main' into serialize-multimodal-kwargs
p88h 281f0f1
Accommodate floats in NestedTensors
njhill File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.