We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent a2a2a97 commit fd9062fCopy full SHA for fd9062f
docs/source/getting_started/installation/vke.rst
@@ -53,9 +53,10 @@ Once the model is ready and running, you can test it by running:
53
LB_IP=$(kubectl get svc/envoy-aibrix-system-aibrix-eg-903790dc -n envoy-gateway-system -o=jsonpath='{.status.loadBalancer.ingress[0].ip}')
54
ENDPOINT="${LB_IP}:80"
55
56
+ # To test the disaggregation model, change the routing-strategy header to "pd"
57
curl http://${ENDPOINT}/v1/chat/completions \
58
-H "Content-Type: application/json" \
- -H "routing-strategy: random" \ # change to `pd` if you deployed in disaggregation mode
59
+ -H "routing-strategy: random" \
60
-d '{
61
"model": "deepseek-r1-distill-llama-8b",
62
"messages": [
0 commit comments