You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A hotfix for FBGEMM fp8 rowwise with irregular gemm sizes
Summary:
- Hotfix for T219165899 reported by pranavsh, which is caused by some instances requiring K size being multiple of `KTile`
- Added fallback for GEMM cases with K is not multiple of Max KTile = 256.
Reviewed By: jianyuh
Differential Revision: D71863248
Copy file name to clipboardExpand all lines: fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise/kernels/fp8_rowwise_64x16x16x256_16x16_1x1_16x4x1_16x4x1_1x4x1x16_4x4x1_1x1_intrawave_v1.hip
0 commit comments