Support zero-size inputs in FP8 cuda quantize kernel #3448

jiawenliu64 · 2024-12-04T00:10:27Z

Summary:
For MOE, if tokens are not routed (in dynamic case), we could have some experts running 0 tokens, found by Jason

This Diff supports zero-size inputs in FP8 cuda quantize kernel in this case

Differential Revision: D66727399

facebook-github-bot · 2024-12-04T00:10:37Z

This pull request was exported from Phabricator. Differential Revision: D66727399

netlify · 2024-12-04T00:10:41Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`9981cae`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/6750eb6a74ca850008a7869f
😎 Deploy Preview	https://deploy-preview-3448--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Summary: X-link: facebookresearch/FBGEMM#533 For MOE, if tokens are not routed (in dynamic case), we could have some experts running 0 tokens, found by jasonjk-park This Diff supports zero-size inputs in FP8 cuda quantize kernel in this case Reviewed By: jasonjk-park Differential Revision: D66727399

facebook-github-bot · 2024-12-04T23:53:26Z

This pull request was exported from Phabricator. Differential Revision: D66727399

facebook-github-bot · 2024-12-05T03:45:55Z

This pull request has been merged in 1a0d837.

Summary: X-link: pytorch#3448 Pull Request resolved: facebookresearch/FBGEMM#533 For MOE, if tokens are not routed (in dynamic case), we could have some experts running 0 tokens, found by jasonjk-park This Diff supports zero-size inputs in FP8 cuda quantize kernel in this case Reviewed By: jasonjk-park Differential Revision: D66727399 fbshipit-source-id: e4d760edace6b9e0cc6a1018f88e03b8a19b0ce6

facebook-github-bot added the cla signed label Dec 4, 2024

facebook-github-bot added the fb-exported label Dec 4, 2024

jiawenliu64 force-pushed the export-D66727399 branch from f1b3fef to 9981cae Compare December 4, 2024 23:53

facebook-github-bot closed this in 1a0d837 Dec 5, 2024

facebook-github-bot added the Merged label Dec 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support zero-size inputs in FP8 cuda quantize kernel #3448

Support zero-size inputs in FP8 cuda quantize kernel #3448

Uh oh!

jiawenliu64 commented Dec 4, 2024

Uh oh!

facebook-github-bot commented Dec 4, 2024

Uh oh!

netlify bot commented Dec 4, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Dec 4, 2024

Uh oh!

facebook-github-bot commented Dec 5, 2024

Uh oh!

Uh oh!

Support zero-size inputs in FP8 cuda quantize kernel #3448

Support zero-size inputs in FP8 cuda quantize kernel #3448

Uh oh!

Conversation

jiawenliu64 commented Dec 4, 2024

Uh oh!

facebook-github-bot commented Dec 4, 2024

Uh oh!

netlify bot commented Dec 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Uh oh!

facebook-github-bot commented Dec 4, 2024

Uh oh!

facebook-github-bot commented Dec 5, 2024

Uh oh!

Uh oh!

netlify bot commented Dec 4, 2024 •

edited

Loading