[vLLM] add vLLM offline inference by zhaoyinglia · Pull Request #47 · baaivision/Emu3.5

zhaoyinglia · 2025-11-19T03:26:01Z

PR Description

Add vLLM backend support to enable efficient inference for Emu3.5 AR.
New features include:

Batch scheduler between cond_input and uncond_input.
Customized logits processor: ClassifierFreeGuidanceLogitsForVisualTokenProcessor.

Performance is presented below, as reported in the technical report.

Usage

# Requires Python 3.12 or higher.
pip install -r requirements/vllm.txt # vllm==0.11.0, toch==2.8.0+cu128
pip install flash_attn==2.8.3 --no-build-isolation

cd Emu3.5
python src/patch/apply.py # apply all *.patch files based on vllm-0.11.0

# 🖼️ Text-to-Image (T2I) task
CUDA_VISIBLE_DEVICES=0,1 python inference_vllm.py --cfg configs/example_config_t2i.py

# 🔄 Any-to-Image (X2I) task
CUDA_VISIBLE_DEVICES=0,1 python inference_vllm.py --cfg configs/example_config_x2i.py

# 🎯 Visual Guidance task
CUDA_VISIBLE_DEVICES=0,1 python inference_vllm.py --cfg configs/example_config_visual_guidance.py

# 📖 Visual Narrative task
CUDA_VISIBLE_DEVICES=0,1 python inference_vllm.py --cfg configs/example_config_visual_narrative.py

Note:
vLLM's gpu_memory_utilization for kv_cache defaults to 0.7 on an 80GiB device. Adjust as needed for your hardware.

zhaoyinglia added 6 commits November 18, 2025 17:56

[vLLM] add vLLM offline inference

92ef499

update x2i tokenizer_path

6910f69

add copyright

cba2877

refactor requirements

eecdc71

rm requirements.txt

44558f3

update readme

40c9ac4

aikx merged commit 1e40e5b into baaivision:main Nov 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[vLLM] add vLLM offline inference#47

[vLLM] add vLLM offline inference#47
aikx merged 6 commits intobaaivision:mainfrom
zhaoyinglia:inference_vllm

zhaoyinglia commented Nov 19, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

zhaoyinglia commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Description

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zhaoyinglia commented Nov 19, 2025 •

edited

Loading