Skip to content

Conversation

@hustxiayang
Copy link
Contributor

@hustxiayang hustxiayang commented Dec 12, 2025

Description

I get these fields from vLLM: https://github.com/vllm-project/vllm/blob/d2c919dcc20b1ea77a94fa01e813ebbb31f8a66a/vllm/entrypoints/openai/protocol.py#L1897

TODO: remove the fields related to transformers, these should not be generic.

Something needs to be discussed:

1 I want to name it as OpenAI although tokenize endpoint is not supported in OpenAI models. The reason is that these fields are from OpenAI.

TODO:
For OpenAI models, we can just use golang binding for https://github.com/openai/tiktoken.

Signed-off-by: yxia216 <[email protected]>
Signed-off-by: yxia216 <[email protected]>
Signed-off-by: yxia216 <[email protected]>
Signed-off-by: yxia216 <[email protected]>
@hustxiayang hustxiayang requested a review from a team as a code owner December 12, 2025 16:46
@dosubot dosubot bot added the size:XXL This PR changes 1000+ lines, ignoring generated files. label Dec 12, 2025
Copy link
Member

@mathetake mathetake left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  • Can you delete the entire examples/tokenize example yamls as they are irrelevant here ie. no specific config to tokenize endpoint. If you think about it, we don't have yamls for /messages /rerank, etc.
  • Let's start small for the new feature like this instead of sending a giant PR. E.g. start with only AWS bedrock. That would be helpful for you to land code faster but also helping reviewers a lot. So could you remove the other code like gcp translation etc.

@mathetake
Copy link
Member

@hustxiayang I see you left TODOs in the description -- so let me mark the PR as draft until if you feel ready for review after resolving them.

@mathetake mathetake marked this pull request as draft December 15, 2025 07:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size:XXL This PR changes 1000+ lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants