Add VBE to Dense TBE frontend #2628

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

joshuadeng wants to merge 3 commits into pytorch:main from joshuadeng:export-D56651380

Contributor

joshuadeng commented May 23, 2024

Summary:

add frontend support to Dense TBE module to support VBE
add unit test

Differential Revision: D56651380


          Add cache conflict miss support (pytorch#2596)

ed6d18d

Summary:
Pull Request resolved: pytorch#2596

Prior to this diff, SSD TBE lacked support for the conflict cache miss
scenario. It operated under the assumption that the cache, located in
GPU memory, was sufficiently large to hold all prefetched data from
SSD. In the event of a conflict cache miss, the behavior of SSD TBE
would be unpredictable (it could either fail or potentially access
illegal memory). Note that a conflict cache miss happens when an
embedding row is absent in the cache, and after being fetched from
SSD, it cannot be inserted into the cache due to capacity constraints
or associativity limitations.

This diff introduces support for conflict cache misses by storing rows
that cannot be inserted into the cache due to conflicts in a scratch
pad, which is a temporary GPU tensor. In the case where rows are
missed from the cache, TBE kernels can access the scratch pad.

Prior to this diff, during the SSD prefetch stage, any row that was
missed the cache and required fetching from SSD would be first fetched
into a CPU scratch pad and then transferred to GPU. Rows that could be
inserted into the cache would subsequently be copied from the GPU
scratch pad into the cache. If conflict misses occurred, the prefetch
behavior would be unpredictable. With this diff, conflict missed rows
are now retained in the scratch pad, which is kept alive until the
current iteration completes.  Throughout the forward and backward +
optimizer stages of TBE, both the cache and scratch pad are equivalent
in terms of usage. However, following the completion of the backward +
optimizer step, rows in the scratch pad are flushed back to SSD,
unlike rows residing in the cache which are not evicted for future
usage (see the diagram below for more details).

 {F1645878181}

Differential Revision: D55998215

facebook-github-bot added the cla signed label

netlify bot commented May 23, 2024 •

edited

Loading

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`65e3266`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/66501c30836cae000809b1fe
😎 Deploy Preview	https://deploy-preview-2628--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Contributor

facebook-github-bot commented May 23, 2024

This pull request was exported from Phabricator. Differential Revision: D56651380

facebook-github-bot added the fb-exported label

Contributor

facebook-github-bot commented May 23, 2024

This pull request was exported from Phabricator. Differential Revision: D56651380

joshuadeng added a commit to joshuadeng/FBGEMM that referenced this pull request


          Add VBE to Dense TBE frontend (pytorch#2628)

86f8974

Summary:
Pull Request resolved: pytorch#2628

- add frontend support to Dense TBE module to support VBE
- add unit test

Differential Revision: D56651380

joshuadeng force-pushed the export-D56651380 branch from b33c8bb to 86f8974 Compare

May 23, 2024 20:45

Contributor

facebook-github-bot commented May 23, 2024

This pull request was exported from Phabricator. Differential Revision: D56651380

joshuadeng force-pushed the export-D56651380 branch from 86f8974 to b31daed Compare

May 23, 2024 21:50

joshuadeng added a commit to joshuadeng/FBGEMM that referenced this pull request


          Add VBE to Dense TBE frontend (pytorch#2628)

b31daed

Summary:
Pull Request resolved: pytorch#2628

- add frontend support to Dense TBE module to support VBE
- add unit test

Differential Revision: D56651380


          add dense TBE to template and enable VBE support (pytorch#2620)

f0f312f

Summary:
Pull Request resolved: pytorch#2620

- make the dense TBE headers into a template
- add VBE options to the dense TBE header

Differential Revision: https://internalfb.com/D57017981

Contributor

facebook-github-bot commented May 24, 2024

This pull request was exported from Phabricator. Differential Revision: D56651380

joshuadeng added a commit to joshuadeng/FBGEMM that referenced this pull request


          Add VBE to Dense TBE frontend (pytorch#2628)

e048d0a

Summary:
Pull Request resolved: pytorch#2628

- add frontend support to Dense TBE module to support VBE
- add unit test

Differential Revision: D56651380

joshuadeng force-pushed the export-D56651380 branch from b31daed to e048d0a Compare

May 24, 2024 04:26

Contributor

facebook-github-bot commented May 24, 2024

This pull request was exported from Phabricator. Differential Revision: D56651380

joshuadeng added a commit to joshuadeng/FBGEMM that referenced this pull request


          Add VBE to Dense TBE frontend (pytorch#2628)

f95d940

Summary:
Pull Request resolved: pytorch#2628

- add frontend support to Dense TBE module to support VBE
- add unit test

Differential Revision: D56651380

joshuadeng force-pushed the export-D56651380 branch from e048d0a to f95d940 Compare

May 24, 2024 04:32


          Add VBE to Dense TBE frontend (pytorch#2628)

65e3266

Summary:
Pull Request resolved: pytorch#2628

- add frontend support to Dense TBE module to support VBE
- add unit test

Differential Revision: D56651380

Contributor

facebook-github-bot commented May 24, 2024

This pull request was exported from Phabricator. Differential Revision: D56651380

joshuadeng force-pushed the export-D56651380 branch from f95d940 to 65e3266 Compare

May 24, 2024 04:48

facebook-github-bot closed this in

d50babd

facebook-github-bot added the Merged label

Contributor

facebook-github-bot commented Jun 5, 2024

This pull request has been merged in d50babd.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla signed fb-exported Merged