Kunshang/v1 support mini #13

jikunshang · 2025-06-12T05:04:02Z

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Signed-off-by: Kunshang Ji <[email protected]> some v1 fixes Signed-off-by: Kunshang Ji <[email protected]> remove useless file Signed-off-by: Kunshang Ji <[email protected]> remove Signed-off-by: Kunshang Ji <[email protected]> add V1 test and set spawn in docker env Signed-off-by: Kunshang Ji <[email protected]> add missing dependency Signed-off-by: Kunshang Ji <[email protected]> fix test Signed-off-by: Kunshang Ji <[email protected]> update api name Signed-off-by: Kunshang Ji <[email protected]> update api Signed-off-by: Kunshang Ji <[email protected]> update default block size for v1 Signed-off-by: Kunshang Ji <[email protected]> update memory usage Signed-off-by: Kunshang Ji <[email protected]> fix rebase issues Signed-off-by: Kunshang Ji <[email protected]> fix rebase, spec decode meta set to none Signed-off-by: Kunshang Ji <[email protected]> add xpu v1 config check Signed-off-by: Kunshang Ji <[email protected]> add mem log Signed-off-by: Kunshang Ji <[email protected]> fix init cache Signed-off-by: Kunshang Ji <[email protected]> add xpu profiler for V1 Signed-off-by: Kunshang Ji <[email protected]> update rebase issue Signed-off-by: Kunshang Ji <[email protected]> update prepare_inputs for perf Signed-off-by: Kunshang Ji <[email protected]> update Signed-off-by: Kunshang Ji <[email protected]> refine xpu_model_runner Signed-off-by: Kunshang Ji <[email protected]>

Signed-off-by: Kunshang Ji <[email protected]>

…one by default. The modification involves adding a check to prevent potential null exceptions。 (vllm-project#173) Signed-off-by: Kunshang Ji <[email protected]>

Signed-off-by: Kunshang Ji <[email protected]>

Co-authored-by: yan <[email protected]> Co-authored-by: mayuyuace <[email protected]> Signed-off-by: Kunshang Ji <[email protected]>

Signed-off-by: Kunshang Ji <[email protected]>

github-actions · 2025-06-12T05:04:09Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

jikunshang and others added 17 commits June 12, 2025 01:10

fix sampler, attn metadata

9855d05

Signed-off-by: Kunshang Ji <[email protected]>

fix rebase issues

855f2dd

Signed-off-by: Kunshang Ji <[email protected]>

fix format issues

2f416c0

Signed-off-by: Kunshang Ji <[email protected]>

keep rebase

2c029b7

Signed-off-by: Kunshang Ji <[email protected]>

Fix block_size setting hard code; (vllm-project#175)

ecfe8c3

Signed-off-by: Kunshang Ji <[email protected]>

Instances created using VllmConfig() typically have model_config as N…

613b5d5

…one by default. The modification involves adding a check to prevent potential null exceptions。 (vllm-project#173) Signed-off-by: Kunshang Ji <[email protected]>

sign off

7649231

Signed-off-by: Kunshang Ji <[email protected]>

fix format

8b3f995

Signed-off-by: Kunshang Ji <[email protected]>

fix

1977622

Signed-off-by: Kunshang Ji <[email protected]>

fix typo

fa7b562

Signed-off-by: Kunshang Ji <[email protected]>

update xpu path

fa41aa1

Signed-off-by: Kunshang Ji <[email protected]>

address comments

75947a7

Signed-off-by: Kunshang Ji <[email protected]>

address comments, refine chunk_prefill op API

12a4bd3

Signed-off-by: Kunshang Ji <[email protected]>

refine APIs

4e2774b

Co-authored-by: yan <[email protected]> Co-authored-by: mayuyuace <[email protected]> Signed-off-by: Kunshang Ji <[email protected]>

only A770 will fallback to fp16

b4ac463

Signed-off-by: Kunshang Ji <[email protected]>

update arg_utils

97210bf

Signed-off-by: Kunshang Ji <[email protected]>

minimal code change, while performance is not best

6732a5b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Kunshang/v1 support mini #13

Kunshang/v1 support mini #13

Uh oh!

jikunshang commented Jun 12, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Jun 12, 2025

Uh oh!

Uh oh!

Kunshang/v1 support mini #13

Are you sure you want to change the base?

Kunshang/v1 support mini #13

Uh oh!

Conversation

jikunshang commented Jun 12, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Essential Elements of an Effective PR Description Checklist

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Uh oh!

github-actions bot commented Jun 12, 2025

Uh oh!

Uh oh!

jikunshang commented Jun 12, 2025 •

edited by github-actions bot

Loading