Skip to content

python scripts #36

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
49 changes: 43 additions & 6 deletions .github/workflows/manual-workflow.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,11 +3,42 @@ run-name: "Deploy main code to staging env to benchmark"
on:
workflow_dispatch: # 只允许手动触发
inputs:
concurrency:
prefill_replica:
description: '预填充副本数'
required: false
default: "16"
description: "最大并发数设置"
default: '2'
type: string
prefill_tp:
description: '预填充 TP 值'
required: false
default: '16'
type: string
prefill_dp:
description: '预填充 DP 值'
required: false
default: '4'
type: string
decode_replica:
description: '解码副本数'
required: false
default: '2'
type: string
decode_tp:
description: '解码 TP 值'
required: false
default: '16'
type: string
decode_dp:
description: '解码 DP 值'
required: false
default: '16'
type: string
max_concurrencies:
description: '最大并发数(逗号分隔的多个值)'
required: false
default: '1024'
type: string


concurrency:
group: "merge-pr-workflow"
Expand Down Expand Up @@ -38,9 +69,15 @@ jobs:

- name: Start the benchmark
run: |
ssh staging "bash -c 'export MAX_CONCURRENCIES=${{ github.event.inputs.concurrency }}; cd /root/sglang-auto/benchmark/furion-cn && ./auto_benchmark.sh'"
env:
MAX_CONCURRENCIES: ${{ github.event.inputs.concurrency }}
ssh staging "bash -c 'cd /root/sglang-auto/benchmark/furion-cn && \
python3 auto_benchmark.py run \
--prefill-replica ${{ github.event.inputs.prefill_replica }} \
--prefill-tp ${{ github.event.inputs.prefill_tp }} \
--prefill-dp ${{ github.event.inputs.prefill_dp }} \
--decode-replica ${{ github.event.inputs.decode_replica }} \
--decode-tp ${{ github.event.inputs.decode_tp }} \
--decode-dp ${{ github.event.inputs.decode_dp }} \
--max-concurrencies ${{ github.event.inputs.max_concurrencies }}'"

- name: Clean up
run: |
Expand Down
Loading