racket/collects/scribblings/guide/io.scrbl
Eli Barzilay ac26fe7554 A ton of @scheme*' -> @racket*' and related updates.
Also, updates some of the mzlib files to point at `racket/*' libraries
rather than to `scheme/*' ones.
2011-06-25 04:08:47 -04:00

444 lines
14 KiB
Racket

#lang scribble/doc
@(require scribble/manual scribble/struct scribble/eval racket/system
"guide-utils.rkt"
(for-label racket/tcp racket/serialize racket/port))
@(define io-eval (make-base-eval))
@(define (threecolumn a b c)
(make-table #f
(list (list (make-flow (list a))
(make-flow (list (make-paragraph (list (hspace 1)))))
(make-flow (list b))
(make-flow (list (make-paragraph (list (hspace 1)))))
(make-flow (list c))))))
@(interaction-eval #:eval io-eval (print-hash-table #t))
@title[#:tag "i/o" #:style 'toc]{Input and Output}
A Racket @deftech{port} represents an input or output stream, such as
a file, a terminal, a TCP connection, or an in-memory string. More
specifically, an @defterm{input port} represents a stream from which a
program can read data, and an @defterm{output port} represents a
stream for writing data.
@local-table-of-contents[]
@;------------------------------------------------------------------------
@section[#:tag "ports"]{Varieties of Ports}
Various functions create various kinds of ports. Here are a few
examples:
@itemize[
@;- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
@item{@bold{Files:} The @racket[open-output-file] function opens a
file for writing, and @racket[open-input-file] opens a file for
reading.
@(interaction-eval #:eval io-eval (define old-dir (current-directory)))
@(interaction-eval #:eval io-eval (current-directory (find-system-path 'temp-dir)))
@(interaction-eval #:eval io-eval (when (file-exists? "data") (delete-file "data")))
@examples[
#:eval io-eval
(define out (open-output-file "data"))
(display "hello" out)
(close-output-port out)
(define in (open-input-file "data"))
(read-line in)
(close-input-port in)
]
If a file exists already, then @racket[open-output-file] raises an
exception by default. Supply an option like @racket[#:exists
'truncate] or @racket[#:exists 'update] to re-write or update the
file:
@examples[
#:eval io-eval
(define out (open-output-file "data" #:exists 'truncate))
(display "howdy" out)
(close-output-port out)
]
Instead of having to match @racket[open-input-file] and
@racket[open-output-file] calls, most Racket programmers will instead
use @racket[call-with-output-file], which takes a function to call
with the output port; when the function returns, the port is closed.
@examples[
#:eval io-eval
(call-with-output-file "data"
#:exists 'truncate
(lambda (out)
(display "hello" out)))
(call-with-input-file "data"
(lambda (in)
(read-line in)))
]
@(interaction-eval #:eval io-eval (when (file-exists? "data") (delete-file "data")))
@(interaction-eval #:eval io-eval (current-directory old-dir))}
@;- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
@item{@bold{Strings:} The @racket[open-output-string] function creates
a port that accumulates data into a string, and @racket[get-output-string]
extracts the accumulated string. The @racket[open-input-string] function
creates a port to read from a string.
@examples[
#:eval io-eval
(define p (open-output-string))
(display "hello" p)
(get-output-string p)
(read-line (open-input-string "goodbye\nfarewell"))
]}
@;- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
@item{@bold{TCP Connections:} The @racket[tcp-connect] function
creates both an input port and an output port for the client side of
a TCP communication. The @racket[tcp-listen] function creates a
server, which accepts connections via @racket[tcp-accept].
@examples[
#:eval io-eval
(eval:alts (define server (tcp-listen 12345)) (void))
(eval:alts (define-values (c-in c-out) (tcp-connect "localhost" 12345)) (void))
(eval:alts (define-values (s-in s-out) (tcp-accept server))
(begin (define-values (s-in c-out) (make-pipe))
(define-values (c-in s-out) (make-pipe))))
(display "hello\n" c-out)
(close-output-port c-out)
(read-line s-in)
(read-line s-in)
]}
@;- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
@item{@bold{Process Pipes:} The @racket[subprocess] function runs a new
process at the OS level and returns ports that correspond to the
subprocess's stdin, stdout, and stderr. (The first three arguments
can be certain kinds of existing ports to connect directly to the
subprocess, instead of creating new ports.)
@examples[
#:eval io-eval
(eval:alts
(define-values (p stdout stdin stderr)
(subprocess #f #f #f "/usr/bin/wc" "-w"))
(define-values (p stdout stdin stderr)
(values #f (open-input-string " 3") (open-output-string) (open-input-string ""))))
(display "a b c\n" stdin)
(close-output-port stdin)
(read-line stdout)
(close-input-port stdout)
(close-input-port stderr)
]}
@;- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
@item{@bold{Internal Pipes:} The @racket[make-pipe] function returns
two ports that are ends of a pipe. This kind of pipe is internal to
Racket, and not related to OS-level pipes for communicating between
different processes.
@examples[
#:eval io-eval
(define-values (in out) (make-pipe))
(display "garbage" out)
(close-output-port out)
(read-line in)
]}
]
@;------------------------------------------------------------------------
@section[#:tag "default-ports"]{Default Ports}
For most simple I/O functions, the target port is an optional
argument, and the default is the @defterm{current input port} or
@defterm{current output port}. Furthermore, error messages are written
to the @defterm{current error port}, which is an output port. The
@racket[current-input-port], @racket[current-output-port], and
@racket[current-error-port] functions return the corresponding current
ports.
@examples[
#:eval io-eval
(display "Hi")
(code:line (display "Hi" (current-output-port)) (code:comment @#,t{the same}))
]
If you start the @exec{racket} program in a terminal, then the
current input, output, and error ports are all connected to the
terminal. More generally, they are connected to the OS-level stdin,
stdout, and stderr. In this guide, the examples show output written to
stdout in purple, and output written to stderr in red italics.
@defexamples[
#:eval io-eval
(define (swing-hammer)
(display "Ouch!" (current-error-port)))
(swing-hammer)
]
The current-port functions are actually @tech{parameters}, which means
that their values can be set with @racket[parameterize].
@margin-note{See @secref["parameterize"] for an introduction to parameters.}
@examples[
#:eval io-eval
(let ([s (open-output-string)])
(parameterize ([current-error-port s])
(swing-hammer)
(swing-hammer)
(swing-hammer))
(get-output-string s))
]
@; ----------------------------------------------------------------------
@section[#:tag "read-write"]{Reading and Writing Racket Data}
As noted throughout @secref["datatypes"], Racket provides three
ways to print an instance of a built-in value:
@itemize[
@item{@racket[print], which prints a value in the same way that is it
printed for a @tech{REPL} result; and }
@item{@racket[write], which prints a value in such a way that
@racket[read] on the output produces the value back; and }
@item{@racket[display], which tends to reduce a value to just its
character or byte content---at least for those datatypes that
are primarily about characters or bytes, otherwise it falls
back to the same output as @racket[write].}
]
Here are some examples using each:
@threecolumn[
@interaction[
(print 1/2)
(print #\x)
(print "hello")
(print #"goodbye")
(print '|pea pod|)
(print '("i" pod))
(print write)
]
@interaction[
(write 1/2)
(write #\x)
(write "hello")
(write #"goodbye")
(write '|pea pod|)
(write '("i" pod))
(write write)
]
@interaction[
(display 1/2)
(display #\x)
(display "hello")
(display #"goodbye")
(display '|pea pod|)
(display '("i" pod))
(display write)
]
]
Overall, @racket[print] corresponds to the expression layer of
Racket syntax, @racket[write] corresponds to the reader layer, and
@racket[display] roughly corresponds to the character layer.
The @racket[printf] function supports simple formatting of data and
text. In the format string supplied to @racket[printf], @litchar{~a}
@racket[display]s the next argument, @litchar{~s}
@racket[write]s the next argument, and @litchar{~v}
@racket[print]s the next argument.
@defexamples[
#:eval io-eval
(define (deliver who when what)
(printf "Items ~a for shopper ~s: ~v" who when what))
(deliver '("list") '("John") '("milk"))
]
After using @racket[write], as opposed to @racket[display] or
@racket[print], many forms of data can be read back in using
@racket[read]. The same values @racket[print]ed can also be parsed by
@racket[read], but the result may have extra quote forms, since a
@racket[print]ed form is meant to be read like an expression.
@examples[
#:eval io-eval
(define-values (in out) (make-pipe))
(write "hello" out)
(read in)
(write '("alphabet" soup) out)
(read in)
(write #hash((a . "apple") (b . "banana")) out)
(read in)
(print '("alphabet" soup) out)
(read in)
(display '("alphabet" soup) out)
(read in)
]
@; ----------------------------------------------------------------------
@section[#:tag "serialization"]{Datatypes and Serialization}
@tech{Prefab} structure types (see @secref["prefab-struct"])
automatically support @deftech{serialization}: they can be written to
an output stream, and a copy can be read back in from an input stream:
@interaction[
(define-values (in out) (make-pipe))
(write #s(sprout bean) out)
(read in)
]
Other structure types created by @racket[struct], which offer
more abstraction than @tech{prefab} structure types, normally
@racket[write] either using @racketresultfont{#<....>} notation (for
opaque structure types) or using @racketresultfont{#(....)} vector
notation (for transparent structure types). In neither can the
result be read back in as an instance of the structure type:
@interaction[
(struct posn (x y))
(write (posn 1 2))
(define-values (in out) (make-pipe))
(write (posn 1 2) out)
(read in)
]
@interaction[
(struct posn (x y) #:transparent)
(write (posn 1 2))
(define-values (in out) (make-pipe))
(write (posn 1 2) out)
(define v (read in))
v
(posn? v)
(vector? v)
]
The @racket[serializable-struct] form defines a structure type
that can be @racket[serialize]d to a value that can be printed using
@racket[write] and restored via @racket[read]. The @racket[serialize]d
result can be @racket[deserialize]d to get back an instance of the
original structure type. The serialization form and functions are
provided by the @racketmodname[racket/serialize] library.
@examples[
(require racket/serialize)
(serializable-struct posn (x y) #:transparent)
(deserialize (serialize (posn 1 2)))
(write (serialize (posn 1 2)))
(define-values (in out) (make-pipe))
(write (serialize (posn 1 2)) out)
(deserialize (read in))
]
In addition to the names bound by @racket[struct],
@racket[serializable-struct] binds an identifier with deserialization
information, and it automatically @racket[provide]s the
deserialization identifier from a module context. This deserialization
identifier is accessed reflectively when a value is deserialized.
@; ----------------------------------------------------------------------
@section[#:tag "encodings"]{Bytes, Characters, and Encodings}
Functions like @racket[read-line], @racket[read], @racket[display],
and @racket[write] all work in terms of @tech{characters} (which
correspond to Unicode scalar values). Conceptually, they are
implemented in terms of @racket[read-char] and @racket[write-char].
More primitively, ports read and write @tech{bytes}, instead of
@tech{characters}. The functions @racket[read-byte] and
@racket[write-byte] read and write raw bytes. Other functions, such as
@racket[read-bytes-line], build on top of byte operations instead of
character operations.
In fact, the @racket[read-char] and @racket[write-char] functions are
conceptually implemented in terms of @racket[read-byte] and
@racket[write-byte]. When a single byte's value is less than 128, then
it corresponds to an ASCII character. Any other byte is treated as
part of a UTF-8 sequence, where UTF-8 is a particular standard way of
encoding Unicode scalar values in bytes (which has the nice property
that ASCII characters are encoded as themselves). Thus, a single
@racket[read-char] may call @racket[read-byte] multiple times, and a
single @racket[write-char] may generate multiple output bytes.
The @racket[read-char] and @racket[write-char] operations
@emph{always} use a UTF-8 encoding. If you have a text stream that
uses a different encoding, or if you want to generate a text stream in
a different encoding, use @racket[reencode-input-port] or
@racket[reencode-output-port]. The @racket[reencode-input-port]
function converts an input stream from an encoding that you specify
into a UTF-8 stream; that way, @racket[read-char] sees UTF-8
encodings, even though the original used a different encoding. Beware,
however, that @racket[read-byte] also sees the re-encoded data,
instead of the original byte stream.
@; ----------------------------------------------------------------------
@section[#:tag "io-patterns"]{I/O Patterns}
@(begin
(define port-eval (make-base-eval))
(interaction-eval #:eval port-eval (require racket/port)))
If you want to process individual lines of a file, then you can use
@racket[for] with @racket[in-lines]:
@interaction[
(define (upcase-all in)
(for ([l (in-lines in)])
(display (string-upcase l))
(newline)))
(upcase-all (open-input-string
(string-append
"Hello, World!\n"
"Can you hear me, now?")))
]
If you want to determine whether ``hello'' appears in a file, then you
could search separate lines, but it's even easier to simply apply a
regular expression (see @secref["regexp"]) to the stream:
@interaction[
(define (has-hello? in)
(regexp-match? #rx"hello" in))
(has-hello? (open-input-string "hello"))
(has-hello? (open-input-string "goodbye"))
]
If you want to copy one port into another, use @racket[copy-port] from
@racketmodname[racket/port], which efficiently transfers large blocks
when lots of data is available, but also transfers small blocks
immediately if that's all that is available:
@interaction[
#:eval port-eval
(define o (open-output-string))
(copy-port (open-input-string "broom") o)
(get-output-string o)
]
@close-eval[port-eval]
@; ----------------------------------------------------------------------
@close-eval[io-eval]