HabanaAI / vllm-fork Public

forked from vllm-project/vllm

Notifications You must be signed in to change notification settings
Fork 113
Star 76

Code
Issues 10
Pull requests 97
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: HabanaAI/vllm-fork

Labels 19 Milestones 0

New pull request New

97 Open 1,335 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

remove token_types for non-rerank models

#1522 opened Jul 3, 2025 by gyou2021

Loading…

Fix proxy with right content type

#1519 opened Jul 3, 2025 by zhenwei-intel

Loading…

Add multi-image prompt support for benchmark offline test

#1518 opened Jul 3, 2025 by Jianhong-Zhang

Loading…

add split qkv to gemma3

#1517 opened Jul 2, 2025 by skaulintel

Loading…

[Misc] Allow AutoWeightsLoader to skip loading weights with specific substr in name

#1514 opened Jul 2, 2025 by kwisniewski98

Loading…

Readme warmup update

#1512 opened Jul 2, 2025 by adobrzyn

Loading…

[deepseek_r1] refine _schedule_prefills for prompts with large length range

#1511 opened Jul 2, 2025 by yangulei

Loading…

Embedding fix: warmup failure in embedding model

#1510 opened Jul 2, 2025 by shepark

Loading…

[SW-233526]Fix MLA and deepseek modeling for 9.0.1 rebase

#1509 opened Jul 1, 2025 by xuechendi

Loading…

Rebase 0.9.0.1

#1507 opened Jul 1, 2025 by michalkuligowski

Loading…

docker vllm: add new configs

#1506 opened Jul 1, 2025 by tthaddey

Loading…

Add runtime FP8 conversion for gaudi2

#1505 opened Jul 1, 2025 by kwisniewski98

Loading…

Port high-level profiler to V1 engine

#1501 opened Jul 1, 2025 by jkaniecki

Loading…

vllm hpu-extension for automatization of long context prompt

#1499 opened Jun 30, 2025 by iboiko-habana

Loading…

vllm hpu-extension for automatization of long context

#1498 opened Jun 30, 2025 by iboiko-habana

Loading…

Fix structure output

#1494 opened Jun 30, 2025 by inkcherry

Loading…

[DeepSeek R1] Initial torch.compile support

#1492 opened Jun 28, 2025 by xinyu-intel

Loading…

use fused RoPE kernel in DeepseekScalingRotaryEmbedding

#1488 opened Jun 27, 2025 by yangulei

Loading…

Fix shutdown issue of inc during exit

#1487 opened Jun 27, 2025 by mengniwang95

Loading…

adding wpyszka to codeowners

#1480 opened Jun 25, 2025 by wpyszka

Loading…

Added multi-image payload json (from cutomer images) and .sh files with certain # of tokens

#1475 opened Jun 24, 2025 by gilliean • Draft

Remove sync point in warmup

#1472 opened Jun 24, 2025 by kzawora-intel

Loading…

Enable vision bucketing/warmup for gemma3 model

#1470 opened Jun 23, 2025 by libinta

Loading…

Sasarkar/jha/sliding window gemma3 1

#1460 opened Jun 23, 2025 by libinta • Draft

Fix the script file typo in README file

#1458 opened Jun 22, 2025 by taotod

Loading…

Previous 1 2 3 4 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!