You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
vLLM supports some parameters that are not supported by OpenAI, `top_k` for example.
34
34
You can pass these parameters to vLLM using the OpenAI client in the `extra_body` parameter of your requests, i.e. `extra_body={"top_k": 50}` for `top_k`.
35
35
:::
36
+
36
37
:::{important}
37
38
By default, the server applies `generation_config.json` from the Hugging Face model repository if it exists. This means the default values of certain sampling parameters can be overridden by those recommended by the model creator.
38
39
39
40
To disable this behavior, please pass `--generation-config vllm` when launching the server.
0 commit comments