Skip to content

Commit 7cf3a56

Browse files
ZhuJiaqi9905jimoosciuc
authored andcommitted
fix: fix typo of comments in w8a8_fp8.py (sgl-project#4843)
1 parent 01d446c commit 7cf3a56

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

python/sglang/srt/layers/quantization/w8a8_fp8.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ class W8A8Fp8Config(QuantizationConfig):
3737
Note:
3838
- For models without offline quantization, weights will be quantized during model loading
3939
- If CUTLASS is supported: Per-channel weight quantization is used
40-
- If CUTLASS is not supported: Falls back to per-token weight quantization
40+
- If CUTLASS is not supported: Falls back to per-tensor weight quantization
4141
"""
4242

4343
def __init__(self, is_checkpoint_fp8_serialized: bool = False):

0 commit comments

Comments
 (0)