support fp16 dtypes for input weight and bias (pytorch#1017)

zhaozhul · facebook-github-bot · commit 2c3e549ad76b · 2025-04-07T15:07:13.000-07:00
Summary: X-link: pytorch#3931 Pull Request resolved: facebookresearch/FBGEMM#1017 ATT, this diff - Supports fp16 inputs for fp8 quantization and gemm - add addmm fallback to cpu for fp8gemm, s.t. the module is compatible with publish time processings Reviewed By: sijiac Differential Revision: D72479579 fbshipit-source-id: db9615095b1f246578fec39b92be6c2238ded5da
diff --git a/fbgemm_gpu/experimental/gen_ai/src/quantize/quantize.cu b/fbgemm_gpu/experimental/gen_ai/src/quantize/quantize.cu
@@ -1077,8 +1077,9 @@ std::vector<at::Tensor> quantize_fp8_per_row(
       "Invalid dim. The dim of input should be greater than or equal to 2");
   TORCH_CHECK(
       input.scalar_type() == torch::kBFloat16 ||
-          input.scalar_type() == torch::kFloat,
-      "Invalid datatype. input must be BF16 or FP32");
+          input.scalar_type() == torch::kFloat ||
+          input.scalar_type() == torch::kHalf,
+      "Invalid datatype. input must be BF16, FP16 or FP32");
   TORCH_CHECK(
       !stochastic_rounding || input.size(-1) % 4 == 0,
       "input row dim must be 4's multiple when stochastic_rounding is True");