add missing doc files

svn: r18401
2010-03-01 01:45:49 +00:00 · 2010-03-01 01:45:49 +00:00 · 2ddfa89a7a
commit 2ddfa89a7a
parent 57ab0dee65
2 changed files with 267 additions and 0 deletions
--- a/collects/scribblings/guide/futures.scrbl
+++ b/collects/scribblings/guide/futures.scrbl
@ -0,0 +1,182 @@
 #lang scribble/doc
@(require scribble/manual
          "guide-utils.ss"
          (for-label scheme/flonum scheme/future))
@title[#:tag "effective-futures"]{Parallelism with Futures}
 The @schememodname[scheme/future] library provides support for
 performance improvement through parallelism with the @scheme[future]
 and @scheme[touch] functions. The level of parallelism available from
 those constructs, however, is limited by several factors, and the
 current implementation is best suited to numerical tasks.
@margin-note{Other functions, such as @scheme[thread], support the
 creation of reliably concurrent tasks. However, thread never run truly
 in parallel, even if the hardware and operating system support
 parallelism.}
 As a starting example, the @scheme[any-double?] function below takes a
 list of numbers and determines whether any number in the list has a
 double that is also in the list:
@schemeblock[
 (define (any-double? l)
  (for/or ([i (in-list l)])
    (for/or ([i2 (in-list l)])
      (= i2 (* 2 i)))))
 ]
 This function runs in quadratic time, so it can take a long time (on
 the order of a second) on large lists like @scheme[l1] and
@scheme[l2]:
@schemeblock[
 (define l1 (for/list ([i (in-range 5000)]) 
             (+ (* 2 i) 1)))
 (define l2 (for/list ([i (in-range 5000)]) 
             (- (* 2 i) 1)))
 (or (any-double? l1)
    (any-double? l2))
 ]
 The best way to speed up @scheme[any-double?]  is to use a different
 algorithm. However, on a machine that offers at least two processing
 units, the example above can run in about half the time using
@scheme[future] and @scheme[touch]:
@schemeblock[
 (let ([f (future (lambda () (any-double? l2)))])
  (or (any-double? l1)
      (touch f)))
 ]
 The future @scheme[f] runs @scheme[(any-double? l2)] in parallel to
@scheme[(any-double? l1)], and the result for @scheme[(any-double?
 l2)] becomes available about the same time that it is demanded by
@scheme[(touch f)].
 Futures run in parallel as long as they can do so safely, but the
 notion of ``safe'' for parallelism is inherently tied to the system
 implementation. The distinction between ``safe'' and ``unsafe''
 operations may be far from apparent at the level of a Scheme program.
 Consider the following core of a Mandelbrot-set computation:
@schemeblock[
 (define (mandelbrot iterations x y n)
  (let ((ci (- (/ (* 2.0 y) n) 1.0))
        (cr (- (/ (* 2.0 x) n) 1.5)))
    (let loop ((i 0) (zr 0.0) (zi 0.0))
      (if (> i iterations)
          i
          (let ((zrq (* zr zr)) 
                (ziq (* zi zi)))
            (cond
             ((> (+ zrq ziq) 4.0) i)
             (else (loop (add1 i) 
                         (+ (- zrq ziq) cr) 
                         (+ (* 2.0 zr zi) ci)))))))))
 ]
 The expressions @scheme[(mandelbrot 10000000 62 500 1000)] and
@scheme[(mandelbrot 10000000 62 501 1000)] each take a while to
 produce an answer. Computing them both, of course, takes twice as
 long:
@schemeblock[
 (list (mandelbrot 10000000 62 500 1000)
      (mandelbrot 10000000 62 501 1000))
 ]
 Unfortunately, attempting to run the two computations in parallel with
@scheme[future] does not improve performance:
@schemeblock[
 (let ([f (future (lambda () (mandelbrot 10000000 62 501 1000)))])
   (list (mandelbrot 10000000 62 500 1000)
         (touch f)))
 ]
 One problem is that the @scheme[*] and @scheme[/] operations in the
 first two lines of @scheme[mandelbrot] involve a mixture of exact and
 inexact real numbers. Such mixtures typically trigger a slow path in
 execution, and the general slow path is not safe for
 parallelism. Consequently, the future created in this example is
 almost immediately suspended, and it cannot resume until
@scheme[touch] is called.
 Changing the first two lines of @scheme[mandelbrot] addresses that
 first the problem:
@schemeblock[
 (define (mandelbrot iterations x y n)
  (let ((ci (- (/ (* 2.0 (->fl y)) (->fl n)) 1.0))
        (cr (- (/ (* 2.0 (->fl x)) (->fl n)) 1.5)))
    ....))
 ]
 With that change, @scheme[mandelbrot] computations can run in
 parallel. Nevertheless, performance still does not improve. The
 problem is that most every arithmetic operation in this example
 produces an inexact number whose storage must be allocated. Especially
 frequent allocation triggers communication between parallel tasks that
 defeats any performance improvement.
 By using @tech{flonum}-specific operations (see
@secref["fixnums+flonums"]), we can re-write @scheme[mandelbot] to use
 much less allocation:
@schemeblock[
 (define (mandelbrot iterations x y n)
  (let ((ci (fl- (fl/ (* 2.0 (->fl y)) (->fl n)) 1.0))
        (cr (fl- (fl/ (* 2.0 (->fl x)) (->fl n)) 1.5)))
    (let loop ((i 0) (zr 0.0) (zi 0.0))
      (if (> i iterations)
          i
          (let ((zrq (fl* zr zr)) 
                (ziq (fl* zi zi)))
            (cond
             ((fl> (fl+ zrq ziq) 4.0) i)
             (else (loop (add1 i) 
                         (fl+ (fl- zrq ziq) cr) 
                         (fl+ (fl* 2.0 (fl* zr zi)) ci)))))))))
 ]
 This conversion can speed @scheme[mandelbrot] by a factor of 8, even
 in sequential mode, but avoiding allocation also allows
@scheme[mandelbrot] to run usefully faster in parallel.
 As a general guideline, any operation that is inlined by the
@tech{JIT} compiler runs safely in parallel, while other operations
 that are not inlined (including all operations if the JIT compiler is
 disabled) are considered unsafe. The @exec{mzc} decompiler tool
 annotates operations that can be inlined by the compiler (see
@secref[#:doc '(lib "scribblings/mzc/mzc.scrbl") "decompile"]), so the
 decompiler can be used to help predict parallel performance.
 To more directly report what is happening in a program that uses
@scheme[future] and @scheme[touch], operations are logged when they
 suspend a computation or synchronize with the main computation.  For
 example, running the original @scheme[mandelbrot] in a future produces
 the following output in the @scheme['debug] log level:
@margin-note{To see @scheme['debug] logging output on stderr, set the
@envvar{PLTSTDERR} environment variable to @tt{debug} or start
@exec{mzscheme} with @Flag{W} @tt{debug}.}
@verbatim[#:indent 2]|{
  future: 0 waiting for runtime at 1267392979341.989: *
 }|
 The message indicates which internal future-running task became
 blocked on an unsafe operation, the time it blocked (in terms of
@scheme[current-inexact-miliseconds]), and the operation that caused
 the computation it to block.
 The first revision to @scheme[mandelbrot] avoids suspending at
@scheme[*], but produces many log entries of the form
@verbatim[#:indent 2]|{
  future: 0 waiting for runtime at 1267392980465.066: [acquire_gc_page]
 }|
--- a/collects/scribblings/reference/futures.scrbl
+++ b/collects/scribblings/reference/futures.scrbl
@ -0,0 +1,85 @@
 #lang scribble/doc
@(require "mz.ss"
          (for-label scheme
                     scheme/base
                     scheme/contract
                     scheme/future))
@(define future-eval (make-base-eval))
@(interaction-eval #:eval future-eval (require scheme/future))
@title[#:tag "futures"]{Futures for Parallelism}
@note-lib[scheme/future]
@margin-note{Currently, parallel support for @scheme[future] is
 enabled by default for Windows, Linux x86/x86_64, and Mac OS X
 x86/x86_64. To enable support for other platforms, use
@DFlag{enable-futures} with @exec{configure} when building PLT
 Scheme.}
 The @scheme[future] and @scheme[touch] functions from
@schememodname[scheme/future] provide access to parallelism as
 supported by the hardware and operation system.
 In contrast to @scheme[thread], which provides concurrency for
 arbitrary computations without parallelism, @scheme[future] provides
 parallelism for limited computations. A future executes its work in
 parallel (assuming that support for parallelism is available) until it
 detects an attempt to perform an operation that is too complex for the
 system to run safely in parallel. Similarly, work in a future is
 suspended if it depends in some way on the current continuation, such
 as raising an exception. A suspended computation for a future is
 resumed when @scheme[touch] is applied to the future descriptor.
 ``Safe'' parallel execution of a future means that all operations
 provided by the system must be able to enforce contracts and produce
 results as documented. ``Safe'' does not preclude concurrent access to
 mutable data that is visible in the program.  For example, a
 computation in a future might use @scheme[set!] to modify a shared
 variable, in which case concurrent assignment to the variable can be
 visible in other futures and threads. Furthermore, guarantees about
 the visibility of effects and ordering are determined by the operating
 system and hardware---which rarely support, for example, the guarantee
 of sequential consistency that is provided for @scheme[thread]-based
 concurrency. At the same time, operations that seem obviously safe may
 have a complex enough implementation internally that they cannot run
 in parallel. See also @guidesecref["effective-futures"].
@deftogether[(
@defproc[(future [thunk (-> any)]) future?]
@defproc[(touch [f future?]) any]
 )]{
 The @scheme[future] procedure returns a future-descriptor value that
 encapsulates @scheme[thunk]. The @scheme[touch] function forces the
 evaluation of the @scheme[thunk] inside the given future, returning
 the values produced by @scheme[thunk]. After @scheme[touch] forces
 the evaluation of a @scheme[thunk], the resulting values are retained
 by the future descriptor in place of @scheme[thunk], and additional
 @scheme[touch]es of the future descriptor return those values.
 Between a call to @scheme[future] and @scheme[touch] for a given
 future, the given @scheme[thunk] may run speculatively in parallel to
 other computations, as described above.
@interaction[
 #:eval future-eval
 (let ([f (future (lambda () (+ 1 2)))])
  (list (+ 3 4) (touch f)))
 ]}
@defproc[(future? [v any/c]) boolean?]{
  Returns @scheme[#t] if @scheme[v] is a future-descriptor value,
  @scheme[#f] otherwise.
 }
@defproc[(processor-count) exact-positive-integer?]{
  Returns the number of parallel computations units (e.g., processors
  or cores) that are available on the current machine.
 }
@; ----------------------------------------------------------------------
@close-eval[future-eval]