Skip to content

Commit ed238c6

Browse files
BBufxwu-intel
authored andcommitted
[doc] add a note for --n-share-experts-fusion args (sgl-project#6154)
1 parent 3668a2f commit ed238c6

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

python/sglang/srt/server_args.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1194,7 +1194,7 @@ def add_cli_args(parser: argparse.ArgumentParser):
11941194
type=int,
11951195
default=0,
11961196
help="The number of shared_experts need to be replicated to fuse with normal experts in deepseek v3/r1, "
1197-
"set it to tp_size can get best optimized performance.",
1197+
"set it to tp_size can get best optimized performance. Note that for architectures with SM==90, we have enabled the shared experts fusion optimization by default for DeepSeek V3/R1, with n_share_experts_fusion automatically set to the TP size.",
11981198
)
11991199
parser.add_argument(
12001200
"--disable-chunked-prefix-cache",

0 commit comments

Comments
 (0)