Skip to content

Commit 2da29d2

Browse files
jwfrommfacebook-github-bot
authored andcommitted
Use custom copy of cutlass to enable FP8 Grouped Gemm. (pytorch#726)
Summary: Pull Request resolved: facebookresearch/FBGEMM#726 To support MOE models, we need to enable FP8 rowwise grouped gemm in FBGEMM. One missing piece to do this is support for rowwise scaling in cutlass. We have enabled this feature in a custom copy of cutlass but getting it into mainline will take a while. As a temporary measure, we can point FBGEMM to a custom copy of cutlass instead. Once the feature lands in mainline and we bump our support, we can go back to using the main repo. X-link: pytorch#3649 Reviewed By: q10, jiawenliu64 Differential Revision: D68967944 Pulled By: jwfromm fbshipit-source-id: 3e4625227ba6c33cf0478811fc9a8d40af361612
1 parent 9894293 commit 2da29d2

File tree

2 files changed

+5
-2
lines changed

2 files changed

+5
-2
lines changed

.gitmodules

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,9 +10,12 @@
1010
[submodule "external/hipify_torch"]
1111
path = external/hipify_torch
1212
url = https://github.com/ROCmSoftwarePlatform/hipify_torch.git
13+
# TODO Using a private copy of cutlass is a temporary mitigation to enable grouped gemm.
14+
# Go back to main cutlass when possible.
1315
[submodule "external/cutlass"]
1416
path = external/cutlass
15-
url = https://github.com/NVIDIA/cutlass.git
17+
url = https://github.com/jwfromm/cutlass.git
18+
branch = FBGEMM
1619
[submodule "external/json"]
1720
path = external/json
1821
url = https://github.com/nlohmann/json.git

0 commit comments

Comments
 (0)