-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Support qwen3 deepep #6120
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support qwen3 deepep #6120
Conversation
Do we need raise error for bf16 when enable deepep? |
91ac718
to
ffcf97b
Compare
When can we support qwen3 bf16 with deepep? |
I think deepep currently do not support bf16 dispatch? |
@yizhang2077 Seems DeepEP already support bf16 dispatch https://github.com/deepseek-ai/DeepEP?tab=readme-ov-file#roadmap |
I think normal mode is unmentioned |
@yizhang2077 For normal mode dispatch, I think sglang code already support bf16. |
Motivation
Support qwen3's deepep. For now, we've simply copied the deepep code from DS, and the accuracy test has passed.
TODO
Test Command
gsm8k Accuracy: 0.970