Fix refactor error - fp8.py (sgl-project#5106)

HaiShaw · merrymercy · jimoosciuc · commit 3cdd39899549 · 2025-04-17T11:59:23.000+08:00
Co-authored-by: Lianmin Zheng &lt;lianminzheng@gmail.com&gt;
diff --git a/python/sglang/srt/layers/quantization/fp8.py b/python/sglang/srt/layers/quantization/fp8.py
@@ -860,7 +860,7 @@ def process_weights_hip_int4(self, layer: Module):
             layer.w13_weight_scale1[expert_id] *= max_w13_scales[expert_id]
             layer.w2_weight_scale1[expert_id] *= layer.w2_weight_scale[expert_id]
 
-    def process_weights_hip_scale_padding(self, layer: Module, padding_size: int):
+    def process_weights_hip_scale_padding(self, layer: Module):
         from sglang.srt.layers.moe.fused_moe_triton.fused_moe import (
             padding_size,  # Avoid circular import
         )

Original file line number	Diff line number	Diff line change
`@@ -860,7 +860,7 @@ def process_weights_hip_int4(self, layer: Module):`
`860`	`860`	`layer.w13_weight_scale1[expert_id] *= max_w13_scales[expert_id]`
`861`	`861`	`layer.w2_weight_scale1[expert_id] *= layer.w2_weight_scale[expert_id]`
`862`	`862`
`863`		`- def process_weights_hip_scale_padding(self, layer: Module, padding_size: int):`
	`863`	`+ def process_weights_hip_scale_padding(self, layer: Module):`
`864`	`864`	`from sglang.srt.layers.moe.fused_moe_triton.fused_moe import (`
`865`	`865`	`padding_size, # Avoid circular import`
`866`	`866`	`)`