Skip to content

Conversation

@gangmuk
Copy link
Collaborator

@gangmuk gangmuk commented Mar 7, 2025

Pull Request Description

Script that generates workload for prefix aware routing

Related Issues

Resolves: #819

Important: Before submitting, please complete the description above and review the checklist below.


Contribution Guidelines (Expand for Details)

We appreciate your contribution to aibrix! To ensure a smooth review process and maintain high code quality, please adhere to the following guidelines:

Pull Request Title Format

Your PR title should start with one of these prefixes to indicate the nature of the change:

  • [Bug]: Corrections to existing functionality
  • [CI]: Changes to build process or CI pipeline
  • [Docs]: Updates or additions to documentation
  • [API]: Modifications to aibrix's API or interface
  • [CLI]: Changes or additions to the Command Line Interface
  • [Misc]: For changes not covered above (use sparingly)

Note: For changes spanning multiple categories, use multiple prefixes in order of importance.

Submission Checklist

  • PR title includes appropriate prefix(es)
  • Changes are clearly explained in the PR description
  • New and existing tests pass successfully
  • Code adheres to project style and best practices
  • Documentation updated to reflect changes (if applicable)
  • Thorough testing completed, no regressions introduced

By submitting this PR, you confirm that you've read these guidelines and your changes align with the project's contribution standards.

@happyandslow happyandslow changed the title Script that generates workload for prefix aware routing Workload generation scripts for prefix aware routing Mar 7, 2025
@happyandslow happyandslow merged commit 02311e1 into main Mar 7, 2025
3 checks passed
@happyandslow happyandslow deleted the gangmuk/prefix-benchmark branch March 7, 2025 19:42
happyandslow pushed a commit to happyandslow/aibrix that referenced this pull request Mar 12, 2025
* Script that generates workload for prefix aware routing. Included some prepared prefix workload

Signed-off-by: “Gangmuk <[email protected]>

* generate_realistic_prefix_share_workload.py in benchmakrs/generator

Signed-off-by: “Gangmuk <[email protected]>

---------

Signed-off-by: “Gangmuk <[email protected]>
Signed-off-by: Le Xu <[email protected]>
happyandslow added a commit that referenced this pull request Mar 16, 2025
* Workload generation scripts for prefix aware routing (#820)

* Script that generates workload for prefix aware routing. Included some prepared prefix workload

Signed-off-by: “Gangmuk <[email protected]>

* generate_realistic_prefix_share_workload.py in benchmakrs/generator

Signed-off-by: “Gangmuk <[email protected]>

---------

Signed-off-by: “Gangmuk <[email protected]>
Signed-off-by: Le Xu <[email protected]>

* Generate workload based on prefix sharing synthetic data

Signed-off-by: Le Xu <[email protected]>

* update prefix sharing from distribution

Signed-off-by: Le Xu <[email protected]>

* remove adapter name

Signed-off-by: Le Xu <[email protected]>

* update user argument

Signed-off-by: Le Xu <[email protected]>

* update README

Signed-off-by: Le Xu <[email protected]>

* fix model argument

Signed-off-by: Le Xu <[email protected]>

* adding default model to client (making compatible with older generation workload)

Signed-off-by: Le Xu <[email protected]>

* fixing None statitiscs in output file

Signed-off-by: Le Xu <[email protected]>

* update readme for references

Signed-off-by: Le Xu <[email protected]>

* clean up

Signed-off-by: Le Xu <[email protected]>

---------

Signed-off-by: “Gangmuk <[email protected]>
Signed-off-by: Le Xu <[email protected]>
Co-authored-by: Gangmuk Lim <[email protected]>
Co-authored-by: Le Xu <[email protected]>
gangmuk added a commit to gangmuk/aibrix-gangmuk that referenced this pull request Jun 21, 2025
* Script that generates workload for prefix aware routing. Included some prepared prefix workload

Signed-off-by: “Gangmuk <[email protected]>

* generate_realistic_prefix_share_workload.py in benchmakrs/generator

Signed-off-by: “Gangmuk <[email protected]>

---------

Signed-off-by: “Gangmuk <[email protected]>
gangmuk added a commit to gangmuk/aibrix-gangmuk that referenced this pull request Jun 21, 2025
…t#840)

* Workload generation scripts for prefix aware routing (vllm-project#820)

* Script that generates workload for prefix aware routing. Included some prepared prefix workload

Signed-off-by: “Gangmuk <[email protected]>

* generate_realistic_prefix_share_workload.py in benchmakrs/generator

Signed-off-by: “Gangmuk <[email protected]>

---------

Signed-off-by: “Gangmuk <[email protected]>
Signed-off-by: Le Xu <[email protected]>

* Generate workload based on prefix sharing synthetic data

Signed-off-by: Le Xu <[email protected]>

* update prefix sharing from distribution

Signed-off-by: Le Xu <[email protected]>

* remove adapter name

Signed-off-by: Le Xu <[email protected]>

* update user argument

Signed-off-by: Le Xu <[email protected]>

* update README

Signed-off-by: Le Xu <[email protected]>

* fix model argument

Signed-off-by: Le Xu <[email protected]>

* adding default model to client (making compatible with older generation workload)

Signed-off-by: Le Xu <[email protected]>

* fixing None statitiscs in output file

Signed-off-by: Le Xu <[email protected]>

* update readme for references

Signed-off-by: Le Xu <[email protected]>

* clean up

Signed-off-by: Le Xu <[email protected]>

---------

Signed-off-by: “Gangmuk <[email protected]>
Signed-off-by: Le Xu <[email protected]>
Co-authored-by: Gangmuk Lim <[email protected]>
Co-authored-by: Le Xu <[email protected]>
Yaegaki1Erika pushed a commit to Yaegaki1Erika/aibrix that referenced this pull request Jul 23, 2025
* Script that generates workload for prefix aware routing. Included some prepared prefix workload

Signed-off-by: “Gangmuk <[email protected]>

* generate_realistic_prefix_share_workload.py in benchmakrs/generator

Signed-off-by: “Gangmuk <[email protected]>

---------

Signed-off-by: “Gangmuk <[email protected]>
Yaegaki1Erika pushed a commit to Yaegaki1Erika/aibrix that referenced this pull request Jul 23, 2025
…t#840)

* Workload generation scripts for prefix aware routing (vllm-project#820)

* Script that generates workload for prefix aware routing. Included some prepared prefix workload

Signed-off-by: “Gangmuk <[email protected]>

* generate_realistic_prefix_share_workload.py in benchmakrs/generator

Signed-off-by: “Gangmuk <[email protected]>

---------

Signed-off-by: “Gangmuk <[email protected]>
Signed-off-by: Le Xu <[email protected]>

* Generate workload based on prefix sharing synthetic data

Signed-off-by: Le Xu <[email protected]>

* update prefix sharing from distribution

Signed-off-by: Le Xu <[email protected]>

* remove adapter name

Signed-off-by: Le Xu <[email protected]>

* update user argument

Signed-off-by: Le Xu <[email protected]>

* update README

Signed-off-by: Le Xu <[email protected]>

* fix model argument

Signed-off-by: Le Xu <[email protected]>

* adding default model to client (making compatible with older generation workload)

Signed-off-by: Le Xu <[email protected]>

* fixing None statitiscs in output file

Signed-off-by: Le Xu <[email protected]>

* update readme for references

Signed-off-by: Le Xu <[email protected]>

* clean up

Signed-off-by: Le Xu <[email protected]>

---------

Signed-off-by: “Gangmuk <[email protected]>
Signed-off-by: Le Xu <[email protected]>
Co-authored-by: Gangmuk Lim <[email protected]>
Co-authored-by: Le Xu <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Basic prefix workload generation with configurable parameter

3 participants