[Bugfix] Fix the issue where llm.generate cannot be called repeatedly after setting GuidedDecodingParams #16767

chaunceyjiang · 2025-04-17T08:31:02Z

Fix the issue where llm.generate cannot be called repeatedly after setting GuidedDecodingParams

Signed-off-by: chaunceyjiang <[email protected]>

github-actions · 2025-04-17T08:32:51Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

chaunceyjiang · 2025-04-17T08:38:01Z

/cc @DarkLight1337 @russellb PTAL.

russellb · 2025-04-17T12:31:42Z

Thanks for the PR! I'm taking a closer look now.

dgslqh · 2025-04-17T12:43:53Z

I meet the same problem, thanks a lot!

Closes vllm-project#16738 Signed-off-by: Russell Bryant <[email protected]>

russellb · 2025-04-17T12:55:03Z

@chaunceyjiang can you take a look at the commit I added? It's an alternate solution that I prefer, but want to make sure you're OK with it.

chaunceyjiang · 2025-04-17T13:09:01Z

@chaunceyjiang can you take a look at the commit I added? It's an alternate solution that I prefer, but want to make sure you're OK with it.

I think your solution is even better. @russellb

russellb · 2025-04-17T14:56:12Z

Thanks for taking a look! Since I made changes, I asked for someone else to take a look for approval.

mgoin

LGTM, however I would like to see a test case specifically for this

russellb · 2025-04-17T14:58:37Z

LGTM, however I would like to see a test case specifically for this

good point - I can do that pretty quick I think

Signed-off-by: Russell Bryant <[email protected]>

russellb · 2025-04-17T15:30:49Z

LGTM, however I would like to see a test case specifically for this

good point - I can do that pretty quick I think

added test coverage by slightly extending an existing test.

chaunceyjiang · 2025-04-21T14:24:14Z

/cc @russellb @DarkLight1337 I think this PR can be merged.

… after setting GuidedDecodingParams (vllm-project#16767) Signed-off-by: chaunceyjiang <[email protected]> Signed-off-by: Russell Bryant <[email protected]> Co-authored-by: Russell Bryant <[email protected]> Signed-off-by: Frieda (Jingying) Huang <[email protected]>

kaimatzu · 2025-04-23T06:30:09Z

When is this getting released?

DarkLight1337 · 2025-04-23T06:33:33Z

The next release will be around the end of this month. If you can't wait, you can install the nightly vLLM package.

https://docs.vllm.ai/en/latest/getting_started/installation/gpu.html#pre-built-wheels

jgoriasilva · 2025-04-25T08:55:58Z

Thanks! I was having the same problem and this solves it, I tested it with the nightly package.

… after setting GuidedDecodingParams (vllm-project#16767) Signed-off-by: chaunceyjiang <[email protected]> Signed-off-by: Russell Bryant <[email protected]> Co-authored-by: Russell Bryant <[email protected]>

… after setting GuidedDecodingParams (vllm-project#16767) Signed-off-by: chaunceyjiang <[email protected]> Signed-off-by: Russell Bryant <[email protected]> Co-authored-by: Russell Bryant <[email protected]> Signed-off-by: Agata Dobrzyniewicz <[email protected]>

… after setting GuidedDecodingParams (vllm-project#16767) Signed-off-by: chaunceyjiang <[email protected]> Signed-off-by: Russell Bryant <[email protected]> Co-authored-by: Russell Bryant <[email protected]> Signed-off-by: Mu Huai <[email protected]>

… after setting GuidedDecodingParams (vllm-project#16767) Signed-off-by: chaunceyjiang <[email protected]> Signed-off-by: Russell Bryant <[email protected]> Co-authored-by: Russell Bryant <[email protected]> Signed-off-by: minpeter <[email protected]>

[Bugfix] Fix the issue where cannot be called repeatedly after setting .

4b40365

Signed-off-by: chaunceyjiang <[email protected]>

chaunceyjiang requested review from WoosukKwon, robertgshaw2-redhat, njhill, ywang96, comaniac and alexm-redhat as code owners April 17, 2025 08:31

mergify bot added the v1 label Apr 17, 2025

Remember when struct output backend was set via auto

b68fd90

Closes vllm-project#16738 Signed-off-by: Russell Bryant <[email protected]>

mgoin approved these changes Apr 17, 2025

View reviewed changes

Add test for re-use of sampling_params with the auto backend

a05ba5c

Signed-off-by: Russell Bryant <[email protected]>

russellb self-requested a review as a code owner April 17, 2025 15:30

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 22, 2025

DarkLight1337 enabled auto-merge (squash) April 22, 2025 06:02

DarkLight1337 merged commit acba33a into vllm-project:main Apr 22, 2025
60 checks passed

chaunceyjiang deleted the guided_decoding branch April 22, 2025 13:48

ckhordiasma mentioned this pull request May 14, 2025

nm vllm ent 0.8.5 sync red-hat-data-services/vllm#139

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Fix the issue where llm.generate cannot be called repeatedly after setting GuidedDecodingParams #16767

[Bugfix] Fix the issue where llm.generate cannot be called repeatedly after setting GuidedDecodingParams #16767

Uh oh!

chaunceyjiang commented Apr 17, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Apr 17, 2025

Uh oh!

chaunceyjiang commented Apr 17, 2025

Uh oh!

russellb commented Apr 17, 2025

Uh oh!

dgslqh commented Apr 17, 2025

Uh oh!

russellb commented Apr 17, 2025

Uh oh!

chaunceyjiang commented Apr 17, 2025

Uh oh!

russellb commented Apr 17, 2025

Uh oh!

mgoin left a comment

Uh oh!

russellb commented Apr 17, 2025

Uh oh!

russellb commented Apr 17, 2025

Uh oh!

chaunceyjiang commented Apr 21, 2025

Uh oh!

Uh oh!

kaimatzu commented Apr 23, 2025

Uh oh!

DarkLight1337 commented Apr 23, 2025

Uh oh!

jgoriasilva commented Apr 25, 2025

Uh oh!

Uh oh!

Uh oh!

[Bugfix] Fix the issue where llm.generate cannot be called repeatedly after setting GuidedDecodingParams #16767

[Bugfix] Fix the issue where llm.generate cannot be called repeatedly after setting GuidedDecodingParams #16767

Uh oh!

Conversation

chaunceyjiang commented Apr 17, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 17, 2025

Uh oh!

chaunceyjiang commented Apr 17, 2025

Uh oh!

russellb commented Apr 17, 2025

Uh oh!

dgslqh commented Apr 17, 2025

Uh oh!

russellb commented Apr 17, 2025

Uh oh!

chaunceyjiang commented Apr 17, 2025

Uh oh!

russellb commented Apr 17, 2025

Uh oh!

mgoin left a comment

Choose a reason for hiding this comment

Uh oh!

russellb commented Apr 17, 2025

Uh oh!

russellb commented Apr 17, 2025

Uh oh!

chaunceyjiang commented Apr 21, 2025

Uh oh!

Uh oh!

kaimatzu commented Apr 23, 2025

Uh oh!

DarkLight1337 commented Apr 23, 2025

Uh oh!

jgoriasilva commented Apr 25, 2025

Uh oh!

Uh oh!

chaunceyjiang commented Apr 17, 2025 •

edited by github-actions bot

Loading