support fp16 dtypes for input weight and bias #3931

zhaozhul · 2025-04-04T21:07:39Z

Summary:
X-link: https://github.com/facebookresearch/FBGEMM/pull/1017

ATT, this diff

Supports fp16 inputs for fp8 quantization and gemm
add addmm fallback to cpu for fp8gemm, s.t. the module is compatible with publish time processings

Differential Revision: D72479579

Summary: X-link: facebookresearch/FBGEMM#1017 ATT, this diff - Supports fp16 inputs for fp8 quantization and gemm - add addmm fallback to cpu for fp8gemm, s.t. the module is compatible with publish time processings Differential Revision: D72479579

facebook-github-bot · 2025-04-04T21:07:48Z

This pull request was exported from Phabricator. Differential Revision: D72479579

netlify · 2025-04-04T21:07:59Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`1b04d46`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/67f04a1d51a7330008dad41f
😎 Deploy Preview	https://deploy-preview-3931--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

zhaozhul · 2025-04-04T22:10:12Z

@pytorchbot merge

pytorch-bot · 2025-04-04T22:10:16Z

Mergebot is not configured for this repository. Please use the merge button provided by GitHub.

facebook-github-bot · 2025-04-07T22:10:28Z

This pull request has been merged in e4905d3.

Summary: X-link: pytorch#3931 Pull Request resolved: facebookresearch/FBGEMM#1017 ATT, this diff - Supports fp16 inputs for fp8 quantization and gemm - add addmm fallback to cpu for fp8gemm, s.t. the module is compatible with publish time processings Reviewed By: sijiac Differential Revision: D72479579 fbshipit-source-id: db9615095b1f246578fec39b92be6c2238ded5da

facebook-github-bot added the cla signed label Apr 4, 2025

facebook-github-bot added the fb-exported label Apr 4, 2025

sijiac approved these changes Apr 4, 2025

View reviewed changes

facebook-github-bot closed this in e4905d3 Apr 7, 2025

facebook-github-bot added the Merged label Apr 7, 2025

q10 added category:improvement feature:genai labels Apr 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

support fp16 dtypes for input weight and bias #3931

support fp16 dtypes for input weight and bias #3931

Uh oh!

zhaozhul commented Apr 4, 2025

Uh oh!

facebook-github-bot commented Apr 4, 2025

Uh oh!

netlify bot commented Apr 4, 2025 •

edited

Loading

Uh oh!

zhaozhul commented Apr 4, 2025

Uh oh!

pytorch-bot bot commented Apr 4, 2025

Uh oh!

facebook-github-bot commented Apr 7, 2025

Uh oh!

Uh oh!

support fp16 dtypes for input weight and bias #3931

support fp16 dtypes for input weight and bias #3931

Uh oh!

Conversation

zhaozhul commented Apr 4, 2025

Uh oh!

facebook-github-bot commented Apr 4, 2025

Uh oh!

netlify bot commented Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Uh oh!

zhaozhul commented Apr 4, 2025

Uh oh!

pytorch-bot bot commented Apr 4, 2025

Uh oh!

facebook-github-bot commented Apr 7, 2025

Uh oh!

Uh oh!

netlify bot commented Apr 4, 2025 •

edited

Loading