sgl-project · hnyls2002 · Apr 25, 2025 · Mar 17, 2025 · Mar 17, 2025 · Mar 20, 2025
diff --git a/benchmark/mmmu/README.md b/benchmark/mmmu/README.md
@@ -22,9 +22,18 @@ It's recommended to reduce the memory usage by appending something ike `--mem-fr
 python benchmark/mmmu/bench_hf.py --model-path Qwen/Qwen2-VL-7B-Instruct
 ```
 
-Some popular model results:
-
-1. Qwen/Qwen2-VL-2B-Instruct: 0.241
-2. Qwen/Qwen2-VL-7B-Instruct: 0.255
-3. Qwen/Qwen2.5-VL-3B-Instruct: 0.245
-4. Qwen/Qwen2.5-VL-7B-Instruct: 0.242
+Benchmark Results:
+
+| Model                   | SGLang | HuggingFace |
+|-------------------------|--------|-------------|
+| Qwen2-VL-7B-Instruct   | 0.476  | -            |
+| Qwen2.5-VL-7B-Instruct | 0.477  | -        |
+| MiniCPM-V-2.6          | 0.435  | —            |
+| MiniCPM-O-2_6          | 0.401 | - |
+| Deepseek-Janus-Pro-7B  | 0.373  | -            |
+| Deepseek-VL2           | 0.405 | - |
+| Gemma-3-it-4B          | 0.41   | -        |
+| llama3-llava-next-8b | 0.245 | - |
+| llava-v1.6-mistral-7b-sglang | 0.338 | - |
+| llava-onevision-qwen2-7b-ov | 0.423 | - |
+| Mlama - Llama-3.2-11B-Vision-Instruct | 0.321 | - |