Closed
Description
Checklist
- 1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.
- 2. Please use English, otherwise it will be closed.
Motivation
Currently, if a Prefill node fails, the Decode side may hang during data transfer if cached information about the failed node is still in use. A mechanism is needed to notify Decode to re-establish the connection with a healthy Prefill node.
Related resources
No response
Metadata
Metadata
Assignees
Labels
No labels