Fix/issue 10113 embeddings use non default tokenizer #10629

camfarineau · 2025-05-07T17:04:28Z

Title

Embeddings: Not use default gpt-3.5-turbo's tokenizer

Relevant issues

Fixes #10113

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
I have added a screenshot of my new test passing locally
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem

Type

🐛 Bug Fix

Changes

embeddings: allow for passthrough of list of lists of tokens to self hosted vllm models

… of tokens (int)

… hosted_vllm models

vercel · 2025-05-07T17:04:33Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
litellm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	May 13, 2025 8:46am

CLAassistant · 2025-05-07T17:04:35Z

All committers have signed the CLA.

litellm/proxy/proxy_server.py

krrishdholakia · 2025-05-09T19:05:03Z

litellm/proxy/proxy_server.py

@@ -3814,14 +3814,15 @@ async def embeddings(  # noqa: PLR0915
                    if m["model_name"] == data["model"] and (
                        m["litellm_params"]["model"] in litellm.open_ai_embedding_models
                        or m["litellm_params"]["model"].startswith("azure/")
+                        or m["litellm_params"]["model"].startswith("hosted_vllm/")


add a unit test for this in test_proxy_server.py in tests/litellm -

litellm/tests/litellm/proxy/test_proxy_server.py

Line 4 in e2a9cd0

import socket

So there are no future regressions.

ideally - this can be a list in constants.py - stating the list of Llmproviders which support an input of array of tokens

Constants.py file -

litellm/litellm/constants.py

Line 4 in 02efffc

DEFAULT_BATCH_SIZE = 512

example list -

litellm/litellm/constants.py

Line 103 in 02efffc

LITELLM_CHAT_PROVIDERS = [

…of lists of tokens (int)" This reverts commit a48acd9.

…t a list of tokens

…accept a arrays of tokens as input When passing a list of tokens as input, verify the provider of the model by going through the list of models (`llm_model_list`). First, it check for model name then get the provider and verify if it accept or not arrays of tokens. If yes, then pass, else decode. Previously, it was verifying provider and model name at the same time resulting in decoding even if the current model checked was not the target one (looping onto `llm_model_list`)

litellm/proxy/proxy_server.py

…th input as array of tokens Ref: BerriAI#10113

camfarineau · 2025-05-13T08:46:23Z

Screenshot of the new test passing locally

camfarineau added 2 commits May 7, 2025 18:52

fix(embeddings): use non default tokenizer when passing list of lists…

a48acd9

… of tokens (int)

feat(embeddings): allow for passthrough of list of lists of tokens to…

18f57d6

… hosted_vllm models

vercel bot deployed to Preview May 7, 2025 17:05 View deployment

krrishdholakia reviewed May 9, 2025

View reviewed changes

litellm/proxy/proxy_server.py Outdated Show resolved Hide resolved

krrishdholakia reviewed May 9, 2025

View reviewed changes

Revert "fix(embeddings): use non default tokenizer when passing list …

a11ea61

…of lists of tokens (int)" This reverts commit a48acd9.

vercel bot deployed to Preview May 12, 2025 13:15 View deployment

camfarineau added 2 commits May 12, 2025 17:13

refactor(embeddings): use a list to verify if provider accept as inpu…

9ae9166

…t a list of tokens

vercel bot deployed to Preview May 12, 2025 15:18 View deployment

krrishdholakia reviewed May 12, 2025

View reviewed changes

litellm/proxy/proxy_server.py Show resolved Hide resolved

test(embedding): add unit test to bypass decode for some providers wi…

cedacd1

…th input as array of tokens Ref: BerriAI#10113

camfarineau marked this pull request as ready for review May 13, 2025 08:46

vercel bot deployed to Preview May 13, 2025 08:46 View deployment

krrishdholakia merged commit b88e56e into BerriAI:main May 15, 2025
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix/issue 10113 embeddings use non default tokenizer #10629

Fix/issue 10113 embeddings use non default tokenizer #10629

Uh oh!

camfarineau commented May 7, 2025 •

edited

Loading

Uh oh!

vercel bot commented May 7, 2025 •

edited

Loading

Uh oh!

CLAassistant commented May 7, 2025 •

edited

Loading

Uh oh!

Uh oh!

krrishdholakia May 9, 2025

Uh oh!

krrishdholakia May 9, 2025

Uh oh!

Uh oh!

camfarineau commented May 13, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fix/issue 10113 embeddings use non default tokenizer #10629

Fix/issue 10113 embeddings use non default tokenizer #10629

Uh oh!

Conversation

camfarineau commented May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Title

Relevant issues

Pre-Submission checklist

Type

Changes

Uh oh!

vercel bot commented May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CLAassistant commented May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

krrishdholakia May 9, 2025

Choose a reason for hiding this comment

Uh oh!

krrishdholakia May 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

camfarineau commented May 13, 2025

Uh oh!

Uh oh!

Uh oh!

camfarineau commented May 7, 2025 •

edited

Loading

vercel bot commented May 7, 2025 •

edited

Loading

CLAassistant commented May 7, 2025 •

edited

Loading