
This commit merges changes that were developed in the "racket7" repo. See that repo (which is no longer modified) for a more fine-grained change history. The commit includes experimental support for running Racket on Chez Scheme, but that "CS" variant is not built by default.
449 lines
19 KiB
Racket
449 lines
19 KiB
Racket
#lang scribble/manual
|
|
@(require "mz.rkt"
|
|
scribble/bnf)
|
|
|
|
@title[#:tag "memory" #:style 'toc]{Memory Management}
|
|
|
|
@local-table-of-contents[]
|
|
|
|
@section[#:tag "weakbox"]{Weak Boxes}
|
|
|
|
A @deftech{weak box} is similar to a normal box (see
|
|
@secref["boxes"]), but when the garbage collector (see
|
|
@secref["gc-model"]) can prove that the content value of a weak box is
|
|
only reachable via weak references, the content of the weak box is
|
|
replaced with @racket[#f]. A @defterm{@tech{weak reference}} is a
|
|
reference through a weak box, through a key reference in a weak hash
|
|
table (see @secref["hashtables"]), through a value in an @tech{ephemeron}
|
|
where the value can be replaced by @racket[#f] (see
|
|
@secref["ephemerons"]), or through a custodian (see
|
|
@secref["custodians"]).
|
|
|
|
@defproc[(make-weak-box [v any/c]) weak-box?]{
|
|
|
|
Returns a new weak box that initially contains @racket[v].}
|
|
|
|
|
|
@defproc[(weak-box-value [weak-box weak-box?] [gced-v any/c #f]) any/c]{
|
|
|
|
Returns the value contained in @racket[weak-box]. If the garbage
|
|
collector has proven that the previous content value of
|
|
@racket[weak-box] was reachable only through a weak reference, then
|
|
@racket[gced-v] (which defaults to @racket[#f]) is returned.}
|
|
|
|
@defproc[(weak-box? [v any/c]) boolean?]{
|
|
|
|
Returns @racket[#t] if @racket[v] is a weak box, @racket[#f] otherwise.}
|
|
|
|
@;------------------------------------------------------------------------
|
|
@section[#:tag "ephemerons"]{Ephemerons}
|
|
|
|
An @deftech{ephemeron} @cite{Hayes97} is a generalization of a
|
|
@tech{weak box} (see @secref["weakbox"]). Instead of just containing
|
|
one value, an ephemeron holds two values: one that is considered the
|
|
value of the ephemeron and another that is the ephemeron's key. Like
|
|
the value in a weak box, the value in an ephemeron may be replaced by
|
|
@racket[#f], but when the @emph{key} is no longer reachable (except
|
|
possibly via weak references) instead of when the value is no longer
|
|
reachable.
|
|
|
|
As long as an ephemeron's value is retained, the reference is
|
|
considered a non-weak reference. References to the key via the value
|
|
are treated specially, however, in that the reference does not
|
|
necessarily count toward the key's reachability. A @tech{weak box} can
|
|
be seen as a specialization of an ephemeron where the key and value
|
|
are the same.
|
|
|
|
One particularly common use of ephemerons is to combine them with a
|
|
weak hash table (see @secref["hashtables"]) to produce a mapping where
|
|
the memory manager can reclaim key--value pairs even when the value
|
|
refers to the key. A related use is to retain a reference to a value
|
|
as long as any value for which it is an @tech{impersonator} is
|
|
reachable; see @racket[impersonator-ephemeron].
|
|
|
|
More precisely,
|
|
@itemize[
|
|
|
|
@item{the value in an ephemeron is replaced
|
|
by @racket[#f] when the automatic memory manager can prove that
|
|
either the ephemeron or the key is reachable only through weak
|
|
references (see @secref["weakbox"]); and}
|
|
|
|
@item{nothing reachable from the value in an ephemeron counts toward
|
|
the reachability of an ephemeron key (whether for the same ephemeron
|
|
or another), unless the same value is reachable through a non-weak
|
|
reference, or unless the value's ephemeron key is reachable through a
|
|
non-weak reference (see @secref["weakbox"] for information on weak
|
|
references).}
|
|
|
|
]
|
|
|
|
|
|
@defproc[(make-ephemeron [key any/c] [v any/c]) ephemeron?]{
|
|
|
|
Returns a new @tech{ephemeron} whose key is @racket[key] and whose
|
|
value is initially @racket[v].}
|
|
|
|
|
|
@defproc[(ephemeron-value [ephemeron ephemeron?] [gced-v any/c #f]) any/c]{
|
|
|
|
Returns the value contained in @racket[ephemeron]. If the garbage
|
|
collector has proven that the key for @racket[ephemeron] is only
|
|
weakly reachable, then the result is @racket[gced-v] (which defaults to @racket[#f]).}
|
|
|
|
|
|
@defproc[(ephemeron? [v any/c]) boolean?]{
|
|
|
|
Returns @racket[#t] if @racket[v] is an @tech{ephemeron}, @racket[#f]
|
|
otherwise.}
|
|
|
|
@;------------------------------------------------------------------------
|
|
@section[#:tag "willexecutor"]{Wills and Executors}
|
|
|
|
A @deftech{will executor} manages a collection of values and
|
|
associated @deftech{will} procedures
|
|
(a.k.a. @deftech{finalizers}). The @tech{will} procedure for each
|
|
value is ready to be executed when the value has been proven (by the
|
|
garbage collector) to be unreachable, except through weak references
|
|
(see @secref["weakbox"]) or as the registrant for other will
|
|
executors. A @tech{will} is useful for triggering clean-up actions on
|
|
data associated with an unreachable value, such as closing a port
|
|
embedded in an object when the object is no longer used.
|
|
|
|
Calling the @racket[will-execute] or @racket[will-try-execute]
|
|
procedure executes a will that is ready in the specified will
|
|
executor. A will executor is also a @tech{synchronizable event}, so @racket[sync]
|
|
or @racket[sync/timeout] can be used to detect when a will executor has
|
|
ready wills. Wills are not executed automatically, because certain
|
|
programs need control to avoid race conditions. However, a program can
|
|
create a thread whose sole job is to execute wills for a particular
|
|
executor.
|
|
|
|
If a value is registered with multiple wills (in one or multiple
|
|
executors), the wills are readied in the reverse order of
|
|
registration. Since readying a will procedure makes the value
|
|
reachable again, the will must be executed and the value must be
|
|
proven again unreachable through only weak references before another
|
|
of the wills is readied or executed. However, wills for distinct
|
|
unreachable values are readied at the same time, regardless of whether
|
|
the values are reachable from each other.
|
|
|
|
A will executor's registrant is held non-weakly until after the
|
|
corresponding will procedure is executed. Thus, if the content value
|
|
of a weak box (see @secref["weakbox"]) is registered with a will
|
|
executor, the weak box's content is not changed to @racket[#f] until
|
|
all wills have been executed for the value and the value has been
|
|
proven again reachable through only weak references.
|
|
|
|
A will executor can be used as a @tech{synchronizable event} (see @secref["sync"]).
|
|
A will executor is @tech{ready for synchronization} when
|
|
@racket[will-execute] would not block; @resultItself{will executor}.
|
|
|
|
|
|
These examples show how to run cleanup actions when
|
|
no synchronization is necessary. It simply runs the registered
|
|
executors as they become ready in another thread.
|
|
@mz-examples[(define an-executor (make-will-executor))
|
|
(eval:alts (void
|
|
(thread
|
|
(λ ()
|
|
(let loop ()
|
|
(will-execute an-executor)
|
|
(loop)))))
|
|
(void))
|
|
(define (executor-proc v) (printf "a-box is now garbage\n"))
|
|
(define a-box-to-track (box #f))
|
|
(will-register an-executor a-box-to-track executor-proc)
|
|
(eval:alts (collect-garbage) (void))
|
|
(set! a-box-to-track #f)
|
|
(eval:alts (collect-garbage) (executor-proc 'random-junk))]
|
|
|
|
|
|
@defproc[(make-will-executor) will-executor?]{
|
|
|
|
Returns a new will executor with no managed values.}
|
|
|
|
|
|
@defproc[(will-executor? [v any/c]) boolean?]{
|
|
|
|
Returns @racket[#t] if @racket[v] is a will executor, @racket[#f]
|
|
otherwise.}
|
|
|
|
|
|
@defproc[(will-register [executor will-executor?] [v any/c] [proc (any/c . -> . any)])
|
|
void?]{
|
|
|
|
Registers the value @racket[v] with the will procedure @racket[proc]
|
|
in the will executor @racket[executor]. When @racket[v] is proven
|
|
unreachable, then the procedure @racket[proc] is ready to be called
|
|
with @racket[v] as its argument via @racket[will-execute] or
|
|
@racket[will-try-execute]. The @racket[proc] argument is strongly
|
|
referenced until the will procedure is executed.}
|
|
|
|
|
|
@defproc[(will-execute [executor will-executor?]) any]{
|
|
|
|
Invokes the will procedure for a single ``unreachable'' value
|
|
registered with the executor @racket[executor]. The values returned
|
|
by the will procedure are the result of the @racket[will-execute]
|
|
call. If no will is ready for immediate execution,
|
|
@racket[will-execute] blocks until one is ready.}
|
|
|
|
|
|
@defproc[(will-try-execute [executor any/c]) any]{
|
|
|
|
Like @racket[will-execute] if a will is ready for immediate
|
|
execution. Otherwise, @racket[v] is returned.
|
|
|
|
@history[#:changed "6.90.0.4" @elem{Added the @racket[v] argument.}]}
|
|
|
|
@;------------------------------------------------------------------------
|
|
@section[#:tag "garbagecollection"]{Garbage Collection}
|
|
|
|
Set the @as-index{@envvar{PLTDISABLEGC}} environment variable (to any
|
|
value) before Racket starts to disable @tech{garbage collection}. Set
|
|
the @as-index{@envvar{PLT_INCREMENTAL_GC}} environment variable
|
|
to a value that starts with @litchar{1}, @litchar{y}, or @litchar{Y} to
|
|
request incremental mode at all times, but calling
|
|
@racket[(collect-garbage 'incremental)] in a program with a periodic
|
|
task is generally a better mechanism for requesting incremental mode.
|
|
|
|
In Racket 3m (the main variant of Racket), each garbage collection
|
|
logs a message (see @secref["logging"]) at the @racket['debug] level with topic @racket['GC].
|
|
The data portion of the message is an instance of a @indexed-racket[gc-info]
|
|
@tech{prefab} structure type with 10 fields as follows, but future
|
|
versions of Racket may use a @racket[gc-info] @tech{prefab} structure
|
|
with additional fields:
|
|
|
|
@racketblock[
|
|
(struct gc-info (mode pre-amount pre-admin-amount code-amount
|
|
post-amount post-admin-amount
|
|
start-process-time end-process-time
|
|
start-time end-time)
|
|
#:prefab)
|
|
]
|
|
|
|
@itemlist[
|
|
|
|
@item{The @racket[mode] field is a symbol @racket['major],
|
|
@racket['minor], or @racket['incremental]; @racket['major]
|
|
indicates a collection that inspects all memory,
|
|
@racket['minor] indicates collection that mostly inspects just
|
|
recent allocations, and @racket['incremental] indicates a minor
|
|
collection that performs extra work toward the next major
|
|
collection.
|
|
|
|
@history[#:changed "6.3.0.7" @elem{Changed first field from a
|
|
boolean (@racket[#t] for
|
|
@racket['major], @racket[#f]
|
|
for @racket['minor]) to a
|
|
mode symbol.}]}
|
|
|
|
@item{The @racket[pre-amount] field reports place-local memory use
|
|
(i.e., not counting the memory use of child places) in bytes at
|
|
the time that the @tech{garbage collection} started. Additional
|
|
bytes registered via @racket[make-phantom-bytes] are included.}
|
|
|
|
@item{The @racket[pre-admin-amount] is a larger number that includes
|
|
memory use for the garbage collector's overhead, such as space
|
|
on memory pages that are mapped but not currently used.}
|
|
|
|
@item{The @racket[code-amount] field reports additional memory use
|
|
for generated native code (which is the same just before and
|
|
after a garbage collection, since it is released via
|
|
finalization).}
|
|
|
|
@item{The @racket[post-amount] and @racket[post-admin-amount] fields
|
|
correspond to @racket[pre-amount] and
|
|
@racket[pre-admin-amount], but after garbage collection. In
|
|
typical configurations, the difference between
|
|
@racket[post-amount] and @racket[pre-amount] contributes to
|
|
@racket[post-admin-amount], since reclaimed pages tend to stay
|
|
in reserve with the expectation that they'll be needed again
|
|
(but the pages are released if multiple collections pass
|
|
without need for the pages).}
|
|
|
|
@item{The @racket[start-process-time] and @racket[end-process-time]
|
|
fields report processor time (in the sense of
|
|
@racket[current-process-milliseconds]) at the start and end of
|
|
garbage collection. The difference between the times is the
|
|
processor time consumed by collection.}
|
|
|
|
@item{The @racket[start-time] and @racket[end-time] fields report
|
|
real time (in the sense of
|
|
@racket[current-inexact-milliseconds]) at the start and end of
|
|
garbage collection. The difference between the times is the
|
|
real time consumed by garbage collection.}
|
|
|
|
]
|
|
|
|
The format of the logged message's text is subject to change.
|
|
Currently, after a prefix that indicates the @tech{place} and
|
|
collection mode, the text has the format
|
|
|
|
@nested[#:style 'inset]{
|
|
@tt{@nonterm{used}(@nonterm{admin})[@nonterm{code}]; @;
|
|
free @nonterm{reclaimed}(@nonterm{adjust}) @nonterm{elapsed} @"@" @nonterm{timestamp}}
|
|
|
|
@tabular[
|
|
#:sep (hspace 1)
|
|
(list (list @nonterm{used}
|
|
@elem{Collectable memory in use just prior to garbage collection})
|
|
(list @nonterm{admin}
|
|
@elem{Additional memory used as to manage collectable memory})
|
|
(list @nonterm{code}
|
|
@elem{Additional memory used for generated machine code})
|
|
(list @nonterm{reclaimed}
|
|
@elem{Collectable memory reclaimed by garbage collection})
|
|
(list @nonterm{adjust}
|
|
@elem{Negation of change to administrative memory minus @nonterm{reclaimed}})
|
|
(list @nonterm{elapsed}
|
|
@elem{Processor time used to perform garbage collection})
|
|
(list @nonterm{timestamp}
|
|
@elem{Processor time since startup of garbage collection's start}))
|
|
]}
|
|
|
|
@history[#:changed "6.3.0.7" @elem{Added @envvar{PLT_INCREMENTAL_GC}.}]
|
|
|
|
|
|
@defproc[(collect-garbage [request (or/c 'major 'minor 'incremental) 'major]) void?]{
|
|
|
|
Requests an immediate @tech{garbage collection} or requests a
|
|
garbage-collection mode, depending on @racket[request]:
|
|
|
|
@itemlist[
|
|
|
|
@item{@racket['major] --- Forces a ``major'' collection, which
|
|
inspects all memory. Some effectively unreachable data may
|
|
remain uncollected, because the collector cannot prove that it
|
|
is unreachable.
|
|
|
|
This mode of @racket[collect-garbage] procedure provides some
|
|
control over the timing of collections, but garbage will
|
|
obviously be collected even if this procedure is never
|
|
called---unless garbage collection is disabled by setting
|
|
@envvar{PLTDISABLEGC}.}
|
|
|
|
@item{@racket['minor] --- Requests a ``minor'' collection, which
|
|
mostly inspects only recent allocations. If minor collection is
|
|
not supported (e.g., when @racket[(system-type 'gc)] returns
|
|
@racket['cgc]) or if the next collection must be a major
|
|
collection, no collection is performed. More generally, minor collections
|
|
triggered by @racket[(collect-garbage 'minor)] do not cause
|
|
major collections any sooner than they would occur otherwise.}
|
|
|
|
@item{@racket['incremental] --- Requests that each minor
|
|
collection performs incremental work toward a major collection
|
|
(but does not request an immediate minor collection).
|
|
This incremental-mode request expires at the next major
|
|
collection.
|
|
|
|
The intent of incremental mode is to significantly reduce pause
|
|
times due to major collections, but incremental mode typically
|
|
implies longer minor-collection times and higher memory use.
|
|
|
|
If the @envvar{PLT_INCREMENTAL_GC} environment variable's value
|
|
starts with @litchar{0}, @litchar{n}, or @litchar{N} on
|
|
start-up, then incremental-mode requests are ignored.}
|
|
|
|
]
|
|
|
|
@history[#:changed "6.3" @elem{Added the @racket[request] argument.}
|
|
#:changed "6.3.0.2" @elem{Added @racket['incremental] mode.}]}
|
|
|
|
|
|
@defproc[(current-memory-use [mode (or/c #f 'cumulative custodian?) #f])
|
|
exact-nonnegative-integer?]{
|
|
|
|
Returns information about memory use:
|
|
|
|
@itemlist[
|
|
|
|
@item{If @racket[mode] is @racket[#f] (the default), the result is an
|
|
estimate of the number of bytes reachable from any custodian.}
|
|
|
|
@item{If @racket[mode] is @racket['cumulative], returns an estimate
|
|
of the total number of bytes allocated since start up,
|
|
including bytes that have since been reclaimed by garbage
|
|
collection.}
|
|
|
|
@item{If @racket[mode] is a custodian, returns an estimate of the
|
|
number of bytes of memory occupied by reachable data from
|
|
@racket[mode]. This estimate is calculated by the last garbage
|
|
collection, and can be 0 if none occurred (or if none occurred
|
|
since the given custodian was created). The
|
|
@racket[current-memory-use] function does @italic{not} perform
|
|
a collection by itself; doing one before the call will
|
|
generally decrease the result (or increase it from 0 if no
|
|
collections happened yet).
|
|
|
|
When Racket is compiled without support for memory accounting,
|
|
the estimate is the same as when @racket[mode] is @racket[#f]
|
|
(i.e., all memory) for any individual custodian. See also
|
|
@racket[custodian-memory-accounting-available?].}
|
|
|
|
]
|
|
|
|
See also @racket[vector-set-performance-stats!].
|
|
|
|
@history[#:changed "6.6.0.3" @elem{Added @racket['cumulative] mode.}]}
|
|
|
|
|
|
@defproc[(dump-memory-stats [v any/c] ...) any]{
|
|
|
|
Dumps information about memory usage to the low-level error port
|
|
or console.
|
|
|
|
Various combinations of @racket[v] arguments can control the
|
|
information in a dump. The information that is available depends on
|
|
your Racket build; check the end of a dump from a particular build to
|
|
see if it offers additional information; otherwise, all @racket[v]s are
|
|
ignored.}
|
|
|
|
@;------------------------------------------------------------------------
|
|
@section[#:tag "phantom-bytes"]{Phantom Byte Strings}
|
|
|
|
A @deftech{phantom byte string} is a small Racket value that is
|
|
treated by the Racket memory manager as having an arbitrary size,
|
|
which is specified when the @tech{phantom byte string} is created or
|
|
when it is changed via @racket[set-phantom-bytes!].
|
|
|
|
A @tech{phantom byte string} acts as a hint to Racket's memory
|
|
manager that memory is allocated within the process but through a
|
|
separate allocator, such as through a foreign library that is accessed
|
|
via @racketmodname[ffi/unsafe]. This hint is used to trigger
|
|
@tech{garbage collections} or to compute the result of
|
|
@racket[current-memory-use]. Currently, the hint is used only in
|
|
Racket 3m (the main variant of Racket).
|
|
|
|
@defproc[(phantom-bytes? [v any/c]) boolean?]{
|
|
|
|
Returns @racket[#t] if @racket[v] is a @tech{phantom byte string},
|
|
@racket[#f] otherwise.}
|
|
|
|
|
|
@defproc[(make-phantom-bytes [k exact-nonnegative-integer?])
|
|
phantom-bytes?]{
|
|
|
|
Creates a @tech{phantom byte string} that is treated by the Racket
|
|
memory manager as being @racket[k] bytes in size. For a large enough
|
|
@racket[k], the @exnraise[exn:fail:out-of-memory]---either because the
|
|
size is implausibly large, or because a memory limit has been
|
|
installed with @racket[custodian-limit-memory].}
|
|
|
|
|
|
@defproc[(set-phantom-bytes! [phantom-bstr phantom-bytes?]
|
|
[k exact-nonnegative-integer?])
|
|
phantom-bytes?]{
|
|
|
|
Adjusts the size of a @tech{phantom byte string} as it is treated by
|
|
the Racket memory manager.
|
|
|
|
For example, if the memory that @racket[phantom-bstr] represents is
|
|
released through a foreign library, then @racket[(set-phantom-bytes!
|
|
phantom-bstr 0)] can reflect the change in memory use.
|
|
|
|
When @racket[k] is larger than the current size of
|
|
@racket[phantom-bstr], then this function can raise
|
|
@racket[exn:fail:out-of-memory], like @racket[make-phantom-bytes].}
|