src/runtime/HACKING.md

   1 This is a living document and at times it will be out of date. It is
   2 intended to articulate how programming in the Go runtime differs from
   3 writing normal Go. It focuses on pervasive concepts rather than
   4 details of particular interfaces.
   5
   6 Scheduler structures
   7 ====================
   8
   9 The scheduler manages three types of resources that pervade the
  10 runtime: Gs, Ms, and Ps. It's important to understand these even if
  11 you're not working on the scheduler.
  12
  13 Gs, Ms, Ps
  14 ----------
  15
  16 A "G" is simply a goroutine. It's represented by type `g`. When a
  17 goroutine exits, its `g` object is returned to a pool of free `g`s and
  18 can later be reused for some other goroutine.
  19
  20 An "M" is an OS thread that can be executing user Go code, runtime
  21 code, a system call, or be idle. It's represented by type `m`. There
  22 can be any number of Ms at a time since any number of threads may be
  23 blocked in system calls.
  24
  25 Finally, a "P" represents the resources required to execute user Go
  26 code, such as scheduler and memory allocator state. It's represented
  27 by type `p`. There are exactly `GOMAXPROCS` Ps. A P can be thought of
  28 like a CPU in the OS scheduler and the contents of the `p` type like
  29 per-CPU state. This is a good place to put state that needs to be
  30 sharded for efficiency, but doesn't need to be per-thread or
  31 per-goroutine.
  32
  33 The scheduler's job is to match up a G (the code to execute), an M
  34 (where to execute it), and a P (the rights and resources to execute
  35 it). When an M stops executing user Go code, for example by entering a
  36 system call, it returns its P to the idle P pool. In order to resume
  37 executing user Go code, for example on return from a system call, it
  38 must acquire a P from the idle pool.
  39
  40 All `g`, `m`, and `p` objects are heap allocated, but are never freed,
  41 so their memory remains type stable. As a result, the runtime can
  42 avoid write barriers in the depths of the scheduler.
  43
  44 `getg()` and `getg().m.curg`
  45 ----------------------------
  46
  47 To get the current user `g`, use `getg().m.curg`.
  48
  49 `getg()` alone returns the current `g`, but when executing on the
  50 system or signal stacks, this will return the current M's "g0" or
  51 "gsignal", respectively. This is usually not what you want.
  52
  53 To determine if you're running on the user stack or the system stack,
  54 use `getg() == getg().m.curg`.
  55
  56 Stacks
  57 ======
  58
  59 Every non-dead G has a *user stack* associated with it, which is what
  60 user Go code executes on. User stacks start small (e.g., 2K) and grow
  61 or shrink dynamically.
  62
  63 Every M has a *system stack* associated with it (also known as the M's
  64 "g0" stack because it's implemented as a stub G) and, on Unix
  65 platforms, a *signal stack* (also known as the M's "gsignal" stack).
  66 System and signal stacks cannot grow, but are large enough to execute
  67 runtime and cgo code (8K in a pure Go binary; system-allocated in a
  68 cgo binary).
  69
  70 Runtime code often temporarily switches to the system stack using
  71 `systemstack`, `mcall`, or `asmcgocall` to perform tasks that must not
  72 be preempted, that must not grow the user stack, or that switch user
  73 goroutines. Code running on the system stack is implicitly
  74 non-preemptible and the garbage collector does not scan system stacks.
  75 While running on the system stack, the current user stack is not used
  76 for execution.
  77
  78 nosplit functions
  79 -----------------
  80
  81 Most functions start with a prologue that inspects the stack pointer
  82 and the current G's stack bound and calls `morestack` if the stack
  83 needs to grow.
  84
  85 Functions can be marked `//go:nosplit` (or `NOSPLIT` in assembly) to
  86 indicate that they should not get this prologue. This has several
  87 uses:
  88
  89 - Functions that must run on the user stack, but must not call into
  90   stack growth, for example because this would cause a deadlock, or
  91   because they have untyped words on the stack.
  92
  93 - Functions that must not be preempted on entry.
  94
  95 - Functions that may run without a valid G. For example, functions
  96   that run in early runtime start-up, or that may be entered from C
  97   code such as cgo callbacks or the signal handler.
  98
  99 Splittable functions ensure there's some amount of space on the stack
 100 for nosplit functions to run in and the linker checks that any static
 101 chain of nosplit function calls cannot exceed this bound.
 102
 103 Any function with a `//go:nosplit` annotation should explain why it is
 104 nosplit in its documentation comment.
 105
 106 Error handling and reporting
 107 ============================
 108
 109 Errors that can reasonably be recovered from in user code should use
 110 `panic` like usual. However, there are some situations where `panic`
 111 will cause an immediate fatal error, such as when called on the system
 112 stack or when called during `mallocgc`.
 113
 114 Most errors in the runtime are not recoverable. For these, use
 115 `throw`, which dumps the traceback and immediately terminates the
 116 process. In general, `throw` should be passed a string constant to
 117 avoid allocating in perilous situations. By convention, additional
 118 details are printed before `throw` using `print` or `println` and the
 119 messages are prefixed with "runtime:".
 120
 121 For unrecoverable errors where user code is expected to be at fault for the
 122 failure (such as racing map writes), use `fatal`.
 123
 124 For runtime error debugging, it may be useful to run with `GOTRACEBACK=system`
 125 or `GOTRACEBACK=crash`. The output of `panic` and `fatal` is as described by
 126 `GOTRACEBACK`. The output of `throw` always includes runtime frames, metadata
 127 and all goroutines regardless of `GOTRACEBACK` (i.e., equivalent to
 128 `GOTRACEBACK=system`). Whether `throw` crashes or not is still controlled by
 129 `GOTRACEBACK`.
 130
 131 Synchronization
 132 ===============
 133
 134 The runtime has multiple synchronization mechanisms. They differ in
 135 semantics and, in particular, in whether they interact with the
 136 goroutine scheduler or the OS scheduler.
 137
 138 The simplest is `mutex`, which is manipulated using `lock` and
 139 `unlock`. This should be used to protect shared structures for short
 140 periods. Blocking on a `mutex` directly blocks the M, without
 141 interacting with the Go scheduler. This means it is safe to use from
 142 the lowest levels of the runtime, but also prevents any associated G
 143 and P from being rescheduled. `rwmutex` is similar.
 144
 145 For one-shot notifications, use `note`, which provides `notesleep` and
 146 `notewakeup`. Unlike traditional UNIX `sleep`/`wakeup`, `note`s are
 147 race-free, so `notesleep` returns immediately if the `notewakeup` has
 148 already happened. A `note` can be reset after use with `noteclear`,
 149 which must not race with a sleep or wakeup. Like `mutex`, blocking on
 150 a `note` blocks the M. However, there are different ways to sleep on a
 151 `note`:`notesleep` also prevents rescheduling of any associated G and
 152 P, while `notetsleepg` acts like a blocking system call that allows
 153 the P to be reused to run another G. This is still less efficient than
 154 blocking the G directly since it consumes an M.
 155
 156 To interact directly with the goroutine scheduler, use `gopark` and
 157 `goready`. `gopark` parks the current goroutine—putting it in the
 158 "waiting" state and removing it from the scheduler's run queue—and
 159 schedules another goroutine on the current M/P. `goready` puts a
 160 parked goroutine back in the "runnable" state and adds it to the run
 161 queue.
 162
 163 In summary,
 164
 165 <table>
 166 <tr><th></th><th colspan="3">Blocks</th></tr>
 167 <tr><th>Interface</th><th>G</th><th>M</th><th>P</th></tr>
 168 <tr><td>(rw)mutex</td><td>Y</td><td>Y</td><td>Y</td></tr>
 169 <tr><td>note</td><td>Y</td><td>Y</td><td>Y/N</td></tr>
 170 <tr><td>park</td><td>Y</td><td>N</td><td>N</td></tr>
 171 </table>
 172
 173 Atomics
 174 =======
 175
 176 The runtime uses its own atomics package at `runtime/internal/atomic`.
 177 This corresponds to `sync/atomic`, but functions have different names
 178 for historical reasons and there are a few additional functions needed
 179 by the runtime.
 180
 181 In general, we think hard about the uses of atomics in the runtime and
 182 try to avoid unnecessary atomic operations. If access to a variable is
 183 sometimes protected by another synchronization mechanism, the
 184 already-protected accesses generally don't need to be atomic. There
 185 are several reasons for this:
 186
 187 1. Using non-atomic or atomic access where appropriate makes the code
 188    more self-documenting. Atomic access to a variable implies there's
 189    somewhere else that may concurrently access the variable.
 190
 191 2. Non-atomic access allows for automatic race detection. The runtime
 192    doesn't currently have a race detector, but it may in the future.
 193    Atomic access defeats the race detector, while non-atomic access
 194    allows the race detector to check your assumptions.
 195
 196 3. Non-atomic access may improve performance.
 197
 198 Of course, any non-atomic access to a shared variable should be
 199 documented to explain how that access is protected.
 200
 201 Some common patterns that mix atomic and non-atomic access are:
 202
 203 * Read-mostly variables where updates are protected by a lock. Within
 204   the locked region, reads do not need to be atomic, but the write
 205   does. Outside the locked region, reads need to be atomic.
 206
 207 * Reads that only happen during STW, where no writes can happen during
 208   STW, do not need to be atomic.
 209
 210 That said, the advice from the Go memory model stands: "Don't be
 211 [too] clever." The performance of the runtime matters, but its
 212 robustness matters more.
 213
 214 Unmanaged memory
 215 ================
 216
 217 In general, the runtime tries to use regular heap allocation. However,
 218 in some cases the runtime must allocate objects outside of the garbage
 219 collected heap, in *unmanaged memory*. This is necessary if the
 220 objects are part of the memory manager itself or if they must be
 221 allocated in situations where the caller may not have a P.
 222
 223 There are three mechanisms for allocating unmanaged memory:
 224
 225 * sysAlloc obtains memory directly from the OS. This comes in whole
 226   multiples of the system page size, but it can be freed with sysFree.
 227
 228 * persistentalloc combines multiple smaller allocations into a single
 229   sysAlloc to avoid fragmentation. However, there is no way to free
 230   persistentalloced objects (hence the name).
 231
 232 * fixalloc is a SLAB-style allocator that allocates objects of a fixed
 233   size. fixalloced objects can be freed, but this memory can only be
 234   reused by the same fixalloc pool, so it can only be reused for
 235   objects of the same type.
 236
 237 In general, types that are allocated using any of these should be
 238 marked as not in heap by embedding `runtime/internal/sys.NotInHeap`.
 239
 240 Objects that are allocated in unmanaged memory **must not** contain
 241 heap pointers unless the following rules are also obeyed:
 242
 243 1. Any pointers from unmanaged memory to the heap must be garbage
 244    collection roots. More specifically, any pointer must either be
 245    accessible through a global variable or be added as an explicit
 246    garbage collection root in `runtime.markroot`.
 247
 248 2. If the memory is reused, the heap pointers must be zero-initialized
 249    before they become visible as GC roots. Otherwise, the GC may
 250    observe stale heap pointers. See "Zero-initialization versus
 251    zeroing".
 252
 253 Zero-initialization versus zeroing
 254 ==================================
 255
 256 There are two types of zeroing in the runtime, depending on whether
 257 the memory is already initialized to a type-safe state.
 258
 259 If memory is not in a type-safe state, meaning it potentially contains
 260 "garbage" because it was just allocated and it is being initialized
 261 for first use, then it must be *zero-initialized* using
 262 `memclrNoHeapPointers` or non-pointer writes. This does not perform
 263 write barriers.
 264
 265 If memory is already in a type-safe state and is simply being set to
 266 the zero value, this must be done using regular writes, `typedmemclr`,
 267 or `memclrHasPointers`. This performs write barriers.
 268
 269 Runtime-only compiler directives
 270 ================================
 271
 272 In addition to the "//go:" directives documented in "go doc compile",
 273 the compiler supports additional directives only in the runtime.
 274
 275 go:systemstack
 276 --------------
 277
 278 `go:systemstack` indicates that a function must run on the system
 279 stack. This is checked dynamically by a special function prologue.
 280
 281 go:nowritebarrier
 282 -----------------
 283
 284 `go:nowritebarrier` directs the compiler to emit an error if the
 285 following function contains any write barriers. (It *does not*
 286 suppress the generation of write barriers; it is simply an assertion.)
 287
 288 Usually you want `go:nowritebarrierrec`. `go:nowritebarrier` is
 289 primarily useful in situations where it's "nice" not to have write
 290 barriers, but not required for correctness.
 291
 292 go:nowritebarrierrec and go:yeswritebarrierrec
 293 ----------------------------------------------
 294
 295 `go:nowritebarrierrec` directs the compiler to emit an error if the
 296 following function or any function it calls recursively, up to a
 297 `go:yeswritebarrierrec`, contains a write barrier.
 298
 299 Logically, the compiler floods the call graph starting from each
 300 `go:nowritebarrierrec` function and produces an error if it encounters
 301 a function containing a write barrier. This flood stops at
 302 `go:yeswritebarrierrec` functions.
 303
 304 `go:nowritebarrierrec` is used in the implementation of the write
 305 barrier to prevent infinite loops.
 306
 307 Both directives are used in the scheduler. The write barrier requires
 308 an active P (`getg().m.p != nil`) and scheduler code often runs
 309 without an active P. In this case, `go:nowritebarrierrec` is used on
 310 functions that release the P or may run without a P and
 311 `go:yeswritebarrierrec` is used when code re-acquires an active P.
 312 Since these are function-level annotations, code that releases or
 313 acquires a P may need to be split across two functions.
 314
 315 go:uintptrkeepalive
 316 -------------------
 317
 318 The //go:uintptrkeepalive directive must be followed by a function declaration.
 319
 320 It specifies that the function's uintptr arguments may be pointer values that
 321 have been converted to uintptr and must be kept alive for the duration of the
 322 call, even though from the types alone it would appear that the object is no
 323 longer needed during the call.
 324
 325 This directive is similar to //go:uintptrescapes, but it does not force
 326 arguments to escape. Since stack growth does not understand these arguments,
 327 this directive must be used with //go:nosplit (in the marked function and all
 328 transitive calls) to prevent stack growth.
 329
 330 The conversion from pointer to uintptr must appear in the argument list of any
 331 call to this function. This directive is used for some low-level system call
 332 implementations.