multiple prompts in a batch is not currently supported

Is this planned?  Seems like good idea to support full OpenAI behavior and any batching is already handled well by vLLM, so should be relatively easy I would guess?

https://github.com/vllm-project/vllm/blob/acbed3ef40f015fcf64460e629813922fab90380/vllm/entrypoints/openai/api_server.py#L396-L401

	elif isinstance(first_element, (str, list)):
	# TODO: handles multiple prompt case in list[list[int]]
	if len(request.prompt) > 1:
	return create_error_response(
	HTTPStatus.BAD_REQUEST,
	"multiple prompts in a batch is not currently supported")

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

multiple prompts in a batch is not currently supported #1270

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

multiple prompts in a batch is not currently supported #1270

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions