#lang scribble/manual @(require "mz.rkt" scribble/bnf) @title[#:tag "memory" #:style 'toc]{Memory Management} @local-table-of-contents[] @section[#:tag "weakbox"]{Weak Boxes} A @deftech{weak box} is similar to a normal box (see @secref["boxes"]), but when the garbage collector (see @secref["gc-model"]) can prove that the content value of a weak box is only reachable via weak references, the content of the weak box is replaced with @racket[#f]. A @defterm{@tech{weak reference}} is a reference through a weak box, through a key reference in a weak hash table (see @secref["hashtables"]), through a value in an @tech{ephemeron} where the value can be replaced by @racket[#f] (see @secref["ephemerons"]), or through a custodian (see @secref["custodians"]). @defproc[(make-weak-box [v any/c]) weak-box?]{ Returns a new weak box that initially contains @racket[v].} @defproc[(weak-box-value [weak-box weak-box?] [gced-v any/c #f]) any/c]{ Returns the value contained in @racket[weak-box]. If the garbage collector has proven that the previous content value of @racket[weak-box] was reachable only through a weak reference, then @racket[gced-v] (which defaults to @racket[#f]) is returned.} @defproc[(weak-box? [v any/c]) boolean?]{ Returns @racket[#t] if @racket[v] is a weak box, @racket[#f] otherwise.} @;------------------------------------------------------------------------ @section[#:tag "ephemerons"]{Ephemerons} An @deftech{ephemeron} @cite{Hayes97} is a generalization of a @tech{weak box} (see @secref["weakbox"]). Instead of just containing one value, an ephemeron holds two values: one that is considered the value of the ephemeron and another that is the ephemeron's key. Like the value in a weak box, the value in an ephemeron may be replaced by @racket[#f], but when the @emph{key} is no longer reachable (except possibly via weak references) instead of when the value is no longer reachable. As long as an ephemeron's value is retained, the reference is considered a non-weak reference. References to the key via the value are treated specially, however, in that the reference does not necessarily count toward the key's reachability. A @tech{weak box} can be seen as a specialization of an ephemeron where the key and value are the same. One particularly common use of ephemerons is to combine them with a weak hash table (see @secref["hashtables"]) to produce a mapping where the memory manager can reclaim key--value pairs even when the value refers to the key. A related use is to retain a reference to a value as long as any value for which it is an @tech{impersonator} is reachable; see @racket[impersonator-ephemeron]. More precisely, @itemize[ @item{the value in an ephemeron is replaced by @racket[#f] when the automatic memory manager can prove that either the ephemeron or the key is reachable only through weak references (see @secref["weakbox"]); and} @item{nothing reachable from the value in an ephemeron counts toward the reachability of an ephemeron key (whether for the same ephemeron or another), unless the same value is reachable through a non-weak reference, or unless the value's ephemeron key is reachable through a non-weak reference (see @secref["weakbox"] for information on weak references).} ] @defproc[(make-ephemeron [key any/c] [v any/c]) ephemeron?]{ Returns a new @tech{ephemeron} whose key is @racket[key] and whose value is initially @racket[v].} @defproc[(ephemeron-value [ephemeron ephemeron?] [gced-v any/c #f]) any/c]{ Returns the value contained in @racket[ephemeron]. If the garbage collector has proven that the key for @racket[ephemeron] is only weakly reachable, then the result is @racket[gced-v] (which defaults to @racket[#f]).} @defproc[(ephemeron? [v any/c]) boolean?]{ Returns @racket[#t] if @racket[v] is an @tech{ephemeron}, @racket[#f] otherwise.} @;------------------------------------------------------------------------ @section[#:tag "willexecutor"]{Wills and Executors} A @deftech{will executor} manages a collection of values and associated @deftech{will} procedures (a.k.a. @deftech{finalizers}). The @tech{will} procedure for each value is ready to be executed when the value has been proven (by the garbage collector) to be unreachable, except through weak references (see @secref["weakbox"]) or as the registrant for other will executors. A @tech{will} is useful for triggering clean-up actions on data associated with an unreachable value, such as closing a port embedded in an object when the object is no longer used. Calling the @racket[will-execute] or @racket[will-try-execute] procedure executes a will that is ready in the specified will executor. A will executor is also a @tech{synchronizable event}, so @racket[sync] or @racket[sync/timeout] can be used to detect when a will executor has ready wills. Wills are not executed automatically, because certain programs need control to avoid race conditions. However, a program can create a thread whose sole job is to execute wills for a particular executor. If a value is registered with multiple wills (in one or multiple executors), the wills are readied in the reverse order of registration. Since readying a will procedure makes the value reachable again, the will must be executed and the value must be proven again unreachable through only weak references before another of the wills is readied or executed. However, wills for distinct unreachable values are readied at the same time, regardless of whether the values are reachable from each other. A will executor's register is held non-weakly until after the corresponding will procedure is executed. Thus, if the content value of a weak box (see @secref["weakbox"]) is registered with a will executor, the weak box's content is not changed to @racket[#f] until all wills have been executed for the value and the value has been proven again reachable through only weak references. A will executor can be used as a @tech{synchronizable event} (see @secref["sync"]). A will executor is @tech{ready for synchronization} when @racket[will-execute] would not block; @resultItself{will executor}. These examples show how to run cleanup actions when no synchronization is necessary. It simply runs the registered executors as they become ready in another thread. @mz-examples[(define an-executor (make-will-executor)) (eval:alts (void (thread (λ () (let loop () (will-execute an-executor) (loop))))) (void)) (define (executor-proc v) (printf "a-box is now garbage\n")) (define a-box-to-track (box #f)) (will-register an-executor a-box-to-track executor-proc) (eval:alts (collect-garbage) (void)) (set! a-box-to-track #f) (eval:alts (collect-garbage) (executor-proc 'random-junk))] @defproc[(make-will-executor) will-executor?]{ Returns a new will executor with no managed values.} @defproc[(will-executor? [v any/c]) boolean?]{ Returns @racket[#t] if @racket[v] is a will executor, @racket[#f] otherwise.} @defproc[(will-register [executor will-executor?] [v any/c] [proc (any/c . -> . any)]) void?]{ Registers the value @racket[v] with the will procedure @racket[proc] in the will executor @racket[executor]. When @racket[v] is proven unreachable, then the procedure @racket[proc] is ready to be called with @racket[v] as its argument via @racket[will-execute] or @racket[will-try-execute]. The @racket[proc] argument is strongly referenced until the will procedure is executed.} @defproc[(will-execute [executor will-executor?]) any]{ Invokes the will procedure for a single ``unreachable'' value registered with the executor @racket[executor]. The values returned by the will procedure are the result of the @racket[will-execute] call. If no will is ready for immediate execution, @racket[will-execute] blocks until one is ready.} @defproc[(will-try-execute [executor any/c]) any]{ Like @racket[will-execute] if a will is ready for immediate execution. Otherwise, @racket[#f] is returned.} @;------------------------------------------------------------------------ @section[#:tag "garbagecollection"]{Garbage Collection} Set the @as-index{@envvar{PLTDISABLEGC}} environment variable (to any value) before Racket starts to disable @tech{garbage collection}. Set the @as-index{@envvar{PLT_INCREMENTAL_GC}} environment variable to a value that starts with @litchar{1}, @litchar{y}, or @litchar{Y} to request incremental mode at all times, but calling @racket[(collect-garbage 'incremental)] in a program with a periodic task is generally a better mechanism for requesting incremental mode. In Racket 3m (the main variant of Racket), each garbage collection logs a message (see @secref["logging"]) at the @racket['debug] level with topic @racket['GC]. The data portion of the message is an instance of a @indexed-racket[gc-info] @tech{prefab} structure type with 10 fields as follows, but future versions of Racket may use a @racket[gc-info] @tech{prefab} structure with additional fields: @racketblock[ (struct gc-info (mode pre-amount pre-admin-amount code-amount post-amount post-admin-amount start-process-time end-process-time start-time end-time) #:prefab) ] @itemlist[ @item{The @racket[mode] field is a symbol @racket['major], @racket['minor], or @racket['incremental]; @racket['major] indicates a collection that inspects all memory, @racket['minor] indicates collection that mostly inspects just recent allocations, and @racket['incremental] indicates a minor collection that performs extra work toward the next major collection. @history[#:changed "6.3.0.7" @elem{Changed first field from a boolean (@racket[#t] for @racket['major], @racket[#f] for @racket['minor]) to a mode symbol.}]} @item{The @racket[pre-amount] field reports place-local memory use (i.e., not counting the memory use of child places) in bytes at the time that the @tech{garbage collection} started. Additional bytes registered via @racket[make-phantom-bytes] are included.} @item{The @racket[pre-admin-amount] is a larger number that includes memory use for the garbage collector's overhead, such as space on memory pages that are mapped but not currently used.} @item{The @racket[code-amount] field reports additional memory use for generated native code (which is the same just before and after a garbage collection, since it is released via finalization).} @item{The @racket[post-amount] and @racket[post-admin-amount] fields correspond to @racket[pre-amount] and @racket[pre-admin-amount], but after garbage collection. In typical configurations, the difference between @racket[post-amount] and @racket[pre-amount] contributes to @racket[post-admin-amount], since reclaimed pages tend to stay in reserve with the expectation that they'll be needed again (but the pages are released if multiple collections pass without need for the pages).} @item{The @racket[start-process-time] and @racket[end-process-time] fields report processor time (in the sense of @racket[current-process-milliseconds]) at the start and end of garbage collection. The difference between the times is the processor time consumed by collection.} @item{The @racket[start-time] and @racket[end-time] fields report real time (in the sense of @racket[current-inexact-milliseconds]) at the start and end of garbage collection. The difference between the times is the real time consumed by garbage collection.} ] The format of the logged message's text is subject to change. Currently, after a prefix that indicates the @tech{place} and collection mode, the text has the format @nested[#:style 'inset]{ @tt{@nonterm{used}(@nonterm{admin})[@nonterm{code}]; @; free @nonterm{reclaimed}(@nonterm{adjust}) @nonterm{elapsed} @"@" @nonterm{timestamp}} @tabular[ #:sep (hspace 1) (list (list @nonterm{used} @elem{Collectable memory in use just prior to garbage collection}) (list @nonterm{admin} @elem{Additional memory used as to manage collectable memory}) (list @nonterm{code} @elem{Additional memory used for generated machine code}) (list @nonterm{reclaimed} @elem{Collectable memory reclaimed by garbage collection}) (list @nonterm{adjust} @elem{Negation of change to administrative memory minus @nonterm{reclaimed}}) (list @nonterm{elapsed} @elem{Processor time used to perform garbage collection}) (list @nonterm{timestamp} @elem{Processor time since startup of garbage collection's start})) ]} @history[#:changed "6.3.0.7" @elem{Added @envvar{PLT_INCREMENTAL_GC}.}] @defproc[(collect-garbage [request (or/c 'major 'minor 'incremental) 'major]) void?]{ Requests an immediate @tech{garbage collection} or requests a garbage-collection mode, depending on @racket[request]: @itemlist[ @item{@racket['major] --- Forces a ``major'' collection, which inspects all memory. Some effectively unreachable data may remain uncollected, because the collector cannot prove that it is unreachable. This mode of @racket[collect-garbage] procedure provides some control over the timing of collections, but garbage will obviously be collected even if this procedure is never called---unless garbage collection is disabled by setting @envvar{PLTDISABLEGC}.} @item{@racket['minor] --- Requests a ``minor'' collection, which mostly inspects only recent allocations. If minor collection is not supported (e.g., when @racket[(system-type 'gc)] returns @racket['cgc]) or if the next collection must be a major collection, no collection is performed. More generally, minor collections triggered by @racket[(collect-garbage 'minor)] do not cause major collections any sooner than they would occur otherwise.} @item{@racket['incremental] --- Requests that each minor collection performs incremental work toward a major collection (but does not request an immediate minor collection). This incremental-mode request expires at the next major collection. The intent of incremental mode is to significantly reduce pause times due to major collections, but incremental mode typically implies longer minor-collection times and higher memory use. If the @envvar{PLT_INCREMENTAL_GC} environment variable's value starts with @litchar{0}, @litchar{n}, or @litchar{N} on start-up, then incremental-mode requests are ignored.} ] @history[#:changed "6.3" @elem{Added the @racket[request] argument.} #:changed "6.3.0.2" @elem{Added @racket['incremental] mode.}]} @defproc[(current-memory-use [mode (or/c #f 'cumulative custodian?) #f]) exact-nonnegative-integer?]{ Returns information about memory use: @itemlist[ @item{If @racket[mode] is @racket[#f] (the default), the result is an estimate of the number of bytes reachable from any custodian.} @item{If @racket[mode] is @racket['cumulative], returns an estimate of the total number of bytes allocated since start up, including bytes that have since been reclaimed by garbage collection.} @item{If @racket[mode] is a custodian, returns an estimate of the number of bytes of memory occupied by reachable data from @racket[mode]. This estimate is calculated by the last garbage collection, and can be 0 if none occurred (or if none occurred since the given custodian was created). The @racket[current-memory-use] function does @italic{not} perform a collection by itself; doing one before the call will generally decrease the result (or increase it from 0 if no collections happened yet). When Racket is compiled without support for memory accounting, the estimate is the same as when @racket[mode] is @racket[#f] (i.e., all memory) for any individual custodian. See also @racket[custodian-memory-accounting-available?].} ] See also @racket[vector-set-performance-stats!]. @history[#:changed "6.6.0.3" @elem{Added @racket['cumulative] mode.}]} @defproc[(dump-memory-stats [v any/c] ...) any]{ Dumps information about memory usage to the low-level error port or console. Various combinations of @racket[v] arguments can control the information in a dump. The information that is available depends on your Racket build; check the end of a dump from a particular build to see if it offers additional information; otherwise, all @racket[v]s are ignored.} @;------------------------------------------------------------------------ @section[#:tag "phantom-bytes"]{Phantom Byte Strings} A @deftech{phantom byte string} is a small Racket value that is treated by the Racket memory manager as having an arbitrary size, which is specified when the @tech{phantom byte string} is created or when it is changed via @racket[set-phantom-bytes!]. A @tech{phantom byte string} acts as a hint to Racket's memory manager that memory is allocated within the process but through a separate allocator, such as through a foreign library that is accessed via @racketmodname[ffi/unsafe]. This hint is used to trigger @tech{garbage collections} or to compute the result of @racket[current-memory-use]. Currently, the hint is used only in Racket 3m (the main variant of Racket). @defproc[(phantom-bytes? [v any/c]) boolean?]{ Returns @racket[#t] if @racket[v] is a @tech{phantom byte string}, @racket[#f] otherwise.} @defproc[(make-phantom-bytes [k exact-nonnegative-integer?]) phantom-bytes?]{ Creates a @tech{phantom byte string} that is treated by the Racket memory manager as being @racket[k] bytes in size. For a large enough @racket[k], the @exnraise[exn:fail:out-of-memory]---either because the size is implausibly large, or because a memory limit has been installed with @racket[custodian-limit-memory].} @defproc[(set-phantom-bytes! [phantom-bstr phantom-bytes?] [k exact-nonnegative-integer?]) phantom-bytes?]{ Adjusts the size of a @tech{phantom byte string} as it is treated by the Racket memory manager. For example, if the memory that @racket[phantom-bstr] represents is released through a foreign library, then @racket[(set-phantom-bytes! phantom-bstr 0)] can reflect the change in memory use. When @racket[k] is larger than the current size of @racket[phantom-bstr], then this function can raise @racket[exn:fail:out-of-memory], like @racket[make-phantom-bytes].}