-
Notifications
You must be signed in to change notification settings - Fork 610
Integrate l2 cache to TBE operator #2959
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
✅ Deploy Preview for pytorch-fbgemm-docs ready!
To edit notification comments on pull requests, go to your Netlify site configuration. |
This pull request was exported from Phabricator. Differential Revision: D60289936 |
This pull request was exported from Phabricator. Differential Revision: D60289936 |
Summary: Pull Request resolved: pytorch#2959 Integrate the CacheLib based L2 cache into TBE operator. At a high level, this diff 1. Implements the CacheLib based L2 cache class as a wrapper to provide a unified interface including get() and put() from/to CPU caches. 2. Integrates the L2 cache class into the EmbedingKVDB base class to expand the capability of the get()/set() functions. 3. Introduces the async get()/set() APIs for PS/SSD for performance tuning reason (e.g. enabled the parallelism to the memcpy and get from remote storage) Reviewed By: sryap Differential Revision: D60289936
7927498
to
a31dcf1
Compare
This pull request was exported from Phabricator. Differential Revision: D60289936 |
Summary: X-link: facebookresearch/FBGEMM#61 Pull Request resolved: pytorch#2959 Integrate the CacheLib based L2 cache into TBE operator. At a high level, this diff 1. Implements the CacheLib based L2 cache class as a wrapper to provide a unified interface including get() and put() from/to CPU caches. 2. Integrates the L2 cache class into the EmbedingKVDB base class to expand the capability of the get()/set() functions. 3. Introduces the async get()/set() APIs for PS/SSD for performance tuning reason (e.g. enabled the parallelism to the memcpy and get from remote storage) Reviewed By: sryap Differential Revision: D60289936
a31dcf1
to
a250477
Compare
This pull request was exported from Phabricator. Differential Revision: D60289936 |
Summary: X-link: facebookresearch/FBGEMM#61 Pull Request resolved: pytorch#2959 Integrate the CacheLib based L2 cache into TBE operator. At a high level, this diff 1. Implements the CacheLib based L2 cache class as a wrapper to provide a unified interface including get() and put() from/to CPU caches. 2. Integrates the L2 cache class into the EmbedingKVDB base class to expand the capability of the get()/set() functions. 3. Introduces the async get()/set() APIs for PS/SSD for performance tuning reason (e.g. enabled the parallelism to the memcpy and get from remote storage) Reviewed By: sryap Differential Revision: D60289936
a250477
to
1e3b57b
Compare
Summary: X-link: facebookresearch/FBGEMM#61 Pull Request resolved: pytorch#2959 Integrate the CacheLib based L2 cache into TBE operator. At a high level, this diff 1. Implements the CacheLib based L2 cache class as a wrapper to provide a unified interface including get() and put() from/to CPU caches. 2. Integrates the L2 cache class into the EmbedingKVDB base class to expand the capability of the get()/set() functions. 3. Introduces the async get()/set() APIs for PS/SSD for performance tuning reason (e.g. enabled the parallelism to the memcpy and get from remote storage) Reviewed By: sryap Differential Revision: D60289936
This pull request was exported from Phabricator. Differential Revision: D60289936 |
1e3b57b
to
16816ed
Compare
This pull request has been merged in 9987dba. |
Summary: Pull Request resolved: facebookresearch/FBGEMM#61 X-link: pytorch#2959 Integrate the CacheLib based L2 cache into TBE operator. At a high level, this diff 1. Implements the CacheLib based L2 cache class as a wrapper to provide a unified interface including get() and put() from/to CPU caches. 2. Integrates the L2 cache class into the EmbedingKVDB base class to expand the capability of the get()/set() functions. 3. Introduces the async get()/set() APIs for PS/SSD for performance tuning reason (e.g. enabled the parallelism to the memcpy and get from remote storage) Reviewed By: sryap Differential Revision: D60289936 fbshipit-source-id: 4a33c534aa7a75c38d7d81dbf4872ecfe120cec2
Summary:
Integrate the CacheLib based L2 cache into TBE operator. At a high level, this diff
Differential Revision: D60289936