[Model] Add support for nvidia/Llama-3_3-Nemotron-Super-49B-v1 #5073

kylehh · 2025-04-04T22:11:47Z

Motivation

Add support for nvidia/Llama-3_3-Nemotron-Super-49B-v1

Modifications

Migration nemotron_nas from vllm

Checklist

Format your code according to the Code Formatting with Pre-Commit.
Add unit tests as outlined in the Running Unit Tests.
Update documentation / docstrings / example tutorials as needed, according to Writing Documentation.
Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to Benchmark and Profiling and Accuracy Results.
For reviewers: If you haven't made any contributions to this PR and are only assisting with merging the main branch, please remove yourself as a co-author when merging the PR.
Please feel free to join our Slack channel at https://slack.sglang.ai to discuss your PR.

merrymercy · 2025-04-27T22:28:37Z

python/sglang/srt/utils.py

@@ -411,6 +425,375 @@ class LayerFn(Protocol):
    def __call__(self, layer_id: int, prefix: str) -> torch.nn.Module: ...


+class PPMissingLayer(torch.nn.Identity):


Do not add these first. Can you drop all of them?

Missinglayer class was migrated from vllm implementation, which was created by Nemotron team. Keep it for code consistent and future proof. Do these cause the failure of CI pipeline?

zhyncs · 2025-07-05T06:26:09Z

@kylehh please rebase. thanks.

root and others added 9 commits April 2, 2025 23:09

init commit

c7fa255

import nemotron_nas

948b093

update sampler and compute_logits

3843751

logit_process update

2b6fec9

add test

f4dd95d

config update

44b1607

remove test py

2e927a4

code reformat

52754ad

formatting

6b30430

kylehh requested review from merrymercy, Ying1123, hnyls2002, zhyncs, ispobock and ByronHsu as code owners April 4, 2025 22:11

kylehh mentioned this pull request Apr 5, 2025

[Bug] Testing new Llama-3_3-Nemotron-Super-49B-v1 by Nvidia: "Model architectures ['DeciLMForCausalLM'] are not supported for now." #4689

Open

5 tasks

kylehh and others added 3 commits April 5, 2025 02:12

format

4952a25

Merge branch 'main' into khuang-nemotron

e1cde31

update to support Nvidia Nemotron Ultra

1e63870

merrymercy added the ready-to-merge The PR is ready to merge after the CI is green. label Apr 21, 2025

Merge branch 'main' into khuang-nemotron

7461fb2

merrymercy requested changes Apr 27, 2025

View reviewed changes

zhyncs assigned yizhang2077 Jul 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Model] Add support for nvidia/Llama-3_3-Nemotron-Super-49B-v1 #5073

[Model] Add support for nvidia/Llama-3_3-Nemotron-Super-49B-v1 #5073

Uh oh!

kylehh commented Apr 4, 2025

Uh oh!

merrymercy Apr 27, 2025

Uh oh!

kylehh Apr 28, 2025

Uh oh!

zhyncs commented Jul 5, 2025

Uh oh!

Uh oh!

		@@ -411,6 +425,375 @@ class LayerFn(Protocol):
		def __call__(self, layer_id: int, prefix: str) -> torch.nn.Module: ...


		class PPMissingLayer(torch.nn.Identity):

[Model] Add support for nvidia/Llama-3_3-Nemotron-Super-49B-v1 #5073

Are you sure you want to change the base?

[Model] Add support for nvidia/Llama-3_3-Nemotron-Super-49B-v1 #5073

Uh oh!

Conversation

kylehh commented Apr 4, 2025

Motivation

Modifications

Checklist

Uh oh!

merrymercy Apr 27, 2025

Choose a reason for hiding this comment

Uh oh!

kylehh Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

zhyncs commented Jul 5, 2025

Uh oh!

Uh oh!