Popular repositories Loading
-
verl_megatron_practice
verl_megatron_practice Public(best/better) practices of megatron on veRL and tuning guide
-
-
-
fast-DiT
fast-DiT PublicForked from chuanyangjin/fast-DiT
Fast Diffusion Models with Transformers
Python
-
Megatron-MoE-ModelZoo
Megatron-MoE-ModelZoo PublicForked from yanring/Megatron-MoE-ModelZoo
Best practices for testing advanced Mixtral, DeepSeek, and Qwen series MoE models using Megatron Core MoE.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.