runtime: support resuming a single goroutine under debuggers

[CL 109699](https://golang.org/cl/109699) added support for debugger function call injection, but has an annoying limitation: it requires that the debugger resume the entire Go process after injecting the function call (and, to inject into a runnable but not running goroutine, it requires resuming the entire process even before injecting the call).

@heschik argued that this is a pretty bad experience. E.g., all the user wants to do is call String() to format something, and the entire process moves under them in the meantime. It's also different from what other debuggers do, which could surprise users.

This is tricky to solve. Simply resuming only the thread where the call was injected doesn't work because 1) it could easily lead to runtime-level deadlocks if any other thread is in the runtime, 2) the runtime could just switch to a different goroutine on that thread, and 3) if the GC kicks in it will try to cooperatively stop the other threads and deadlock.

I think solving this requires at least a little help from the runtime to pause all other user goroutines during the injected call. I'm not sure what exact form this should take, but I'm imagining the debugger could use call injection to first inject a runtime call to stop user goroutines, and then inject the real call.

However, even this gets tricky with non-cooperative safe points (e.g., the runtime would need to use the register maps to immediately preempt the other goroutines rather than waiting for them to reach a cooperative safe point) and with goroutines that are at unsafe points (particularly goroutines that are in the runtime). One possibility would be to have the debugger inject this "stop" call on every running thread. Using the call injection mechanism takes care of stopping at non-cooperative points, and would give the debugger the opportunity to step other goroutines past unsafe points and out of the runtime before injecting the stop. This puts some complexity into the debugger, but it should already have most of the core mechanisms necessary to do this (such as single stepping ability). Specifically, I'm picturing a protocol like:

1. For each thread, attempt to inject a `runtime.debugStop` call. Let all threads resume.
2. These calls will notify the debugger when the goroutine is successfully stopped, or the debug call injection will fail.
3. For injection that failed because the thread is in the runtime, unwind the stack and insert a breakpoint at the first return to user code. At that breakpoint attempt another `debugStop`. For injection that failed because the thread is at an unsafe point, single step the thread, attempting to inject `debugStop` at each step.
4. Let the remaining threads continue running. Repeat steps 2 through 4 until all threads are stopped.

This is basically a debugger-assisted non-cooperative stop-the-world. For Go 1.12, I plan to implement non-cooperative preemption directly in the runtime, which may move much of this logic into the runtime itself.

/cc @aarzilli 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

runtime: support resuming a single goroutine under debuggers #25578

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

runtime: support resuming a single goroutine under debuggers #25578

Description

Activity

aarzilli commented on May 28, 2018

aarzilli commented on May 28, 2018

derekparker commented on May 29, 2018

aclements commented on May 29, 2018

gopherbot commented on Sep 11, 2018

derekparker commented on Feb 27, 2019

prattmic commented on Sep 6, 2022

prattmic commented on Sep 6, 2022

aarzilli commented on Sep 12, 2022

prattmic commented on Sep 12, 2022

prattmic commented on Sep 12, 2022

aarzilli commented on Sep 13, 2022

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions