Skip to content

[Track] DeepSeek V3/R1 nextn progress #3472

Closed
@zhyncs

Description

@zhyncs

Triton Backend

@ispobock @pankajroark

FlashInfer Backend

@zhyncs @yzh119

  • compatible with disable MLA

  • support FlashInfer nightly MLA ragged prefill and CUDA Core MLA decoding

  • support FlashInfer v0.2.0.post3 MLA ragged, paged prefill and decoding (@zhyncs @yzh119 )

  • nextn parts can be shared with Triton Backend

EAGLE 2

@zhyncs @Ying1123

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions