-
Notifications
You must be signed in to change notification settings - Fork 501
[Feature] AIBrix KVCache L2Cache Part1 #1062
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] AIBrix KVCache L2Cache Part1 #1062
Conversation
79f0699 to
a8833d0
Compare
- connectors - key builders - marshallers - key builder benchmark Signed-off-by: Haiyang Shi <[email protected]>
a8833d0 to
17a2b7c
Compare
| redis = "^6.0.0" | ||
| fakeredis = "^2.28.1" | ||
| codespell = "2.4.1" | ||
| infinistore = "^0.2.35" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
does hpkv provide whl in pypi ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
HPKV is provided in our internal pypi, not available from the public one
| # index to support multi-GPU per RNIC. For example, if we have 8 | ||
| # GPUs and 2 RNICs, then GPU 0 to 3 are mapped to RNIC 0, and GPU | ||
| # 4 to 7 are mapped to RNIC 1. | ||
| factor = num_visible_gpus // len(dev_list) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
em. we may not explicitly pass device list to engine for TP <= 8 case. we can do some improvements later to intelligently search dev_list if that's not given from user.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
sure
| gpu_idx = torch.cuda.current_device() | ||
| rnic_idx = gpu_idx // factor | ||
| dev_name = dev_list[rnic_idx] | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
TODO: add gid search logic and add --hint-gid-index to infinistore.ClientConfig to support VNI mode, this can be done later
- connectors - key builders - marshallers - key builder benchmark Signed-off-by: Haiyang Shi <[email protected]> Co-authored-by: Haiyang Shi <[email protected]>
Pull Request Description
This PR includes core modules of L2Cache.
Related Issues
Resolves: #[Insert issue number(s)]
Important: Before submitting, please complete the description above and review the checklist below.
Contribution Guidelines (Expand for Details)
We appreciate your contribution to aibrix! To ensure a smooth review process and maintain high code quality, please adhere to the following guidelines:
Pull Request Title Format
Your PR title should start with one of these prefixes to indicate the nature of the change:
[Bug]: Corrections to existing functionality[CI]: Changes to build process or CI pipeline[Docs]: Updates or additions to documentation[API]: Modifications to aibrix's API or interface[CLI]: Changes or additions to the Command Line Interface[Misc]: For changes not covered above (use sparingly)Note: For changes spanning multiple categories, use multiple prefixes in order of importance.
Submission Checklist
By submitting this PR, you confirm that you've read these guidelines and your changes align with the project's contribution standards.