Skip to content

Commit b014a9b

Browse files
BBufjimoosciuc
authored andcommitted
[doc] Update benchmark_and_profiling.md (sgl-project#5449)
1 parent c4f4e55 commit b014a9b

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

docs/references/benchmark_and_profiling.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -64,6 +64,8 @@
6464

6565
This command sets the number of prompts to 2 with `--num-prompts` argument and limits the length of output sequences to 100 with `--sharegpt-output-len` argument, which can generate a small trace file for browser to open smoothly.
6666

67+
Additionally, if you want to locate the SGLang Python source code through the cuda kernel in Trace, you need to disable CUDA Graph when starting the service. This can be done by using the `--disable-cuda-graph` parameter in the command to start the service.
68+
6769
## Profile with Nsight
6870

6971
[Nsight systems](https://docs.nvidia.com/nsight-systems/) is an advanced tool that exposes more profiling details, such as register and shared memory usage, annotated code regions and low-level CUDA APIs and events.

0 commit comments

Comments
 (0)