-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Add pipeline parallelism for Qwen2 and Qwen3 Model #6250
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
After change the model to Qwen/Qwen3-8B, here is the result for the pipeline parallelism test cases.
|
@Ying1123 This PR can be reviewed quickly, thanks! |
also test for the Qwen3-30B-A3B model
|
this is a following improve for the pp impl. We want to test the Qwen model for the PP |
ping @zhyncs , @merrymercy if have time for this small PR. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
HI @libratiger, thanks for the PR. Could you also add an accuracy test for PP on these models? Also, could you resolve the conflicts and pass the CI tests?
I fix the conflicts and add a new accuracy test case as suggested。 Here is the result:
|
In the previous CI result, I just noticed some flaky failed test is timeout on DeepSeekV3 model. |
@libratiger Great work. let me rerun the CI and review it. Do not need to rebase on your own unless we ask. thank1 |
Motivation
Modifications
Checklist