662 lines
19 KiB
Racket
662 lines
19 KiB
Racket
#reader(lib "docreader.ss" "scribble")
|
|
@require[(lib "manual.ss" "scribble")]
|
|
@require[(lib "bnf.ss" "scribble")]
|
|
@require["utils.ss"]
|
|
|
|
@title[#:tag "reader"]{The Scribble Reader}
|
|
|
|
The Scribble @"@"-reader is designed to be a convenient facility for
|
|
using free-form text in Scheme code, where ``@"@"'' is chosen as one of
|
|
the least-used characters in Scheme code.
|
|
|
|
You can use the reader via MzScheme's @schemefont{#reader} form:
|
|
|
|
@schemeblock[
|
|
#, @schemefont|{
|
|
#reader(lib "reader.ss" "scribble")@{This is free-form text!}
|
|
}|]
|
|
|
|
Note that the reader will only read @"@"-forms as S-expressions. The
|
|
meaning of these S-expressions depends on the rest of your own code.
|
|
|
|
A PLT Scheme manual more likely starts with
|
|
|
|
@schemeblock[
|
|
#, @schemefont{#reader(lib "docreader.ss" "scribble")}
|
|
]
|
|
|
|
which installs a reader, wraps the file content afterward into a
|
|
MzScheme module, and parses the body into a document using
|
|
@file{decode.ss}. See @secref["docreader"] for more information.
|
|
|
|
Another way to use the reader is to use the @scheme[use-at-readtable]
|
|
function to switch the current readtable to a readtable that parses
|
|
@"@"-forms. You can do this in a single command line:
|
|
|
|
@commandline{mzscheme -Le reader.ss scribble "(use-at-readtable)"}
|
|
|
|
In addition to @scheme[read] and @scheme[read-syntax], which are used
|
|
by @schemefont{#reader}, the @file{reader.ss} library provides the
|
|
procedures @scheme[read-inside] and @scheme[read-inside-syntax]; these
|
|
@schemeid[-inner] variants parse as if starting inside a
|
|
@litchar["@{"]...@litchar["}"], and they return a (syntactic) list.
|
|
|
|
@section{Concrete Syntax}
|
|
|
|
Informally, the concrete syntax of @"@"-forms is
|
|
|
|
@schemeblock[
|
|
#, @BNF-seq[@litchar["@"]
|
|
@nonterm{cmd}
|
|
@litchar{[} @kleenestar{@nonterm{datum}} @litchar{]}
|
|
@litchar["{"] @kleenestar{@nonterm{text-body}} @litchar["}"]]
|
|
]
|
|
|
|
where all three parts after @litchar["@"] are optional, but at least
|
|
one should be present. (Note that spaces are not allowed between the
|
|
three parts.) @litchar["@"] is set as a non-terminating reader macro,
|
|
so it can be used as usual in Scheme identifiers unless you want to
|
|
use it as a first character of an identifier; in this case you need to
|
|
quote with a backslash (@schemefont["\\@foo"]) or quote the whole
|
|
identifier with bars (@schemefont["|@foo|"]).
|
|
|
|
@schemeblock[
|
|
#, @schemefont|!{
|
|
(define |@foo| '\@bar@baz)
|
|
}!|]
|
|
|
|
Of course, @litchar["@"] is not treated specially in Scheme strings,
|
|
character constants, etc.
|
|
|
|
Roughly, a form matching the above grammar is read as
|
|
|
|
@schemeblock[
|
|
(#, @nonterm{cmd}
|
|
#, @kleenestar{@nonterm{datum}}
|
|
#, @kleenestar{@nonterm{parsed-body}})
|
|
]
|
|
|
|
where @nonterm{parsed-body} is the translation of each
|
|
@nonterm{text-body} in the input. Thus, the initial @nonterm{cmd}
|
|
determines the Scheme code that the input is translated into. The
|
|
common case is when @nonterm{cmd} is a Scheme identifier, which
|
|
generates a plain Scheme form.
|
|
|
|
A @nonterm{text-body} is made of text, newlines, and nested
|
|
@"@"-forms. Note that the syntax for @"@"-forms is the same in a
|
|
@nonterm{text-body} context as in a Scheme context. A
|
|
@nonterm{text-body} that isn't an @"@"-form is converted to a string
|
|
expression for its @nonterm{parsed-body}, and newlines are converted
|
|
to @scheme["\n"] expressions.
|
|
|
|
@scribble-examples|==={
|
|
@foo{bar baz
|
|
blah}
|
|
@foo{bar @baz[3]
|
|
blah}
|
|
@foo{bar @baz{3}
|
|
blah}
|
|
@foo{bar @baz[2 3]{4 5}
|
|
blah}
|
|
}===|
|
|
|
|
Note that spaces are not allowed before a @litchar["["] or a
|
|
@litchar["{"], or they will be part of the following text (or Scheme
|
|
code). (More on using braces in body texts below.)
|
|
|
|
@scribble-examples|==={
|
|
@foo{bar @baz[2 3] {4 5}}
|
|
}===|
|
|
|
|
When the above @"@"-forms appear in a Scheme expression context, the
|
|
lexical environment must provide bindings for @scheme[foo] (as a procedure or
|
|
a macro).
|
|
|
|
@; FIXME: need to show evaluation here (with the scribble syntax)
|
|
@schemeblock[
|
|
(let* ([formatter (lambda (fmt)
|
|
(lambda args (format fmt (apply string-append args))))]
|
|
[bf (formatter "*~a*")]
|
|
[it (formatter "/~a/")]
|
|
[ul (formatter "_~a_")]
|
|
[text string-append])
|
|
@text{@it{Note}: @bf{This is @ul{not} a pipe}.})
|
|
--> "/Note/: *This is _not_ a pipe*."
|
|
]
|
|
|
|
If you want to see the expression that is actually being read, you can
|
|
use Scheme's @scheme[quote].
|
|
|
|
@scribble-examples|==={
|
|
'@foo{bar}
|
|
}===|
|
|
|
|
@; - - - - - - - - - - - - - - - - - - - - - - - -
|
|
@subsection{The Command Part}
|
|
|
|
Besides being a Scheme identifier, the @nonterm{cmd} part of an
|
|
@"@"-form can have Scheme punctuation prefixes, which will end up
|
|
wrapping the @italic{whole} expression.
|
|
|
|
@scribble-examples|==={
|
|
@`',@foo{blah}
|
|
}===|
|
|
|
|
When writing Scheme code, this means that @litchar["@`',@foo{blah}"]
|
|
is exactly the same as @litchar["`@',@foo{blah}"] and
|
|
@litchar["`',@@foo{blah}"], but unlike the latter two, the first
|
|
construct can appear in body texts with the same meaning, whereas the
|
|
other two would not work (see below).
|
|
|
|
After the optional punctuation prefix, the @nonterm{cmd} itself is not
|
|
limited to identifiers; it can be @italic{any} Scheme expression.
|
|
|
|
@scribble-examples|==={
|
|
@(lambda (x) x){blah}
|
|
@`(unquote foo){blah}
|
|
}===|
|
|
|
|
In addition, the command can be omitted altogether, which will omit it
|
|
from the translation, resulting in an S-expression that usually
|
|
contains, say, just strings:
|
|
|
|
@scribble-examples|==={
|
|
@{foo bar
|
|
baz}
|
|
@'{foo bar
|
|
baz}
|
|
}===|
|
|
|
|
If the command part begins with a @litchar{;} (with no newline between
|
|
the @litchar["@"] and the @litchar{;}), then the construct is a
|
|
comment. There are two comment forms, one for arbitrary-text and
|
|
possibly nested comments, and another one for line comments:
|
|
|
|
@schemeblock[
|
|
#, @BNF-seq[@litchar["@;{"] @kleenestar{@nonterm{any}} @litchar["}"]]
|
|
|
|
#, @BNF-seq[@litchar["@;"] @kleenestar{@nonterm{anything-else-without-newline}}]
|
|
]
|
|
|
|
In the first form, the commented body must still parse correctly; see
|
|
the description of the body syntax below. In the second form, all
|
|
text from the @litchar["@;"] to the end of the line @italic{and} all
|
|
following spaces (or tabs) are part of the comment (similar to
|
|
@litchar["%"] comments in TeX).
|
|
|
|
@scribble-examples|==={
|
|
@foo{bar @; comment
|
|
baz@;
|
|
blah}
|
|
}===|
|
|
|
|
Tip: if you're editing in a Scheme-aware editor (like DrScheme or
|
|
Emacs), it is useful to comment out blocks like this:
|
|
|
|
@verbatim["
|
|
@;{
|
|
...
|
|
;}
|
|
"]
|
|
|
|
so the editor does not treat the file as having unbalanced
|
|
parenthesis.
|
|
|
|
If only the @nonterm{cmd} part of an @"@"-form is specified, then the
|
|
result is the command part only, without an extra set of parenthesis.
|
|
This makes it suitable for Scheme escapes in body texts. (More on this
|
|
below, in the description of the body part.)
|
|
|
|
@scribble-examples|==={
|
|
@foo{x @y z}
|
|
@foo{x @(* y 2) z}
|
|
@{@foo bar}
|
|
}===|
|
|
|
|
Finally, note that there are currently no special rules for using
|
|
@litchar["@"] in the command itself, which can lead to things like:
|
|
|
|
@scribble-examples|==={
|
|
@@foo{bar}{baz}
|
|
}===|
|
|
|
|
@subsection{The Datum Part}
|
|
|
|
The datum part can contains arbitrary Scheme expressions, which
|
|
are simply stacked before the body text arguments:
|
|
|
|
@scribble-examples|==={
|
|
@foo[1 (* 2 3)]{bar}
|
|
@foo[@bar{...}]{blah}
|
|
}===|
|
|
|
|
The body part can still be omitted, which is essentially an
|
|
alternative syntax for plain (non-textual) S-expressions:
|
|
|
|
@scribble-examples|==={
|
|
@foo[bar]
|
|
@foo{bar @f[x] baz}
|
|
}===|
|
|
|
|
The datum part can be empty, which makes no difference, except when
|
|
the body is omitted. It is more common, however, to use an empty body
|
|
for the same purpose.
|
|
|
|
@scribble-examples|==={
|
|
@foo[]{bar}
|
|
@foo[]
|
|
@foo
|
|
@foo{}
|
|
}===|
|
|
|
|
The most common use of the datum part is for Scheme forms that expect
|
|
keyword-value arguments that precede the body of text arguments.
|
|
|
|
@scribble-examples|==={
|
|
@foo[#:style 'big]{bar}
|
|
}===|
|
|
|
|
@subsection{The Body Part}
|
|
|
|
The syntax of the body part is intended to be as convenient as
|
|
possible for free text. It can contain almost any text---the only
|
|
characters with special meaning is @litchar["@"] for sub-@"@"-forms,
|
|
and @litchar["}"] for the end of the text. In addition, a
|
|
@litchar["{"] is allowed as part of the text, and it makes the
|
|
matching @litchar["}"] be part of the text too---so balanced braces
|
|
are valid text.
|
|
|
|
@scribble-examples|==={
|
|
@foo{f{o}o}
|
|
@foo{{{}}{}}
|
|
}===|
|
|
|
|
As described above, the text turns to a sequence of string arguments
|
|
for the resulting form. Spaces at the beginning and end of lines are
|
|
discarded, and newlines turn to individual @scheme["\n"] strings
|
|
(i.e., they are not merged with other body parts). (See also the
|
|
information about newlines and indentation below.) Spaces are
|
|
@italic{not} discarded if they appear after the open @litchar["{"]
|
|
(before the closing @litchar["}"]) when there is also text that
|
|
follows (precedes) it; specifically, they are preserved in a
|
|
single-line body.
|
|
|
|
@scribble-examples|==={
|
|
@foo{bar}
|
|
@foo{ bar }
|
|
@foo[1]{ bar }
|
|
}===|
|
|
|
|
If @litchar["@"] appears in a body, then it is interpreted as Scheme
|
|
code, which means that the @"@"-reader is applied recursively, and the
|
|
resulting syntax appears as part of the S-expression, among other
|
|
string contents.
|
|
|
|
@scribble-examples|==={
|
|
@foo{a @bar{b} c}
|
|
}===|
|
|
|
|
If the nested @"@" construct has only a command---no body or datum
|
|
parts---it will not appear in a subform. Given that the command part
|
|
can be any Scheme expression, this makes @"@" a general escape to
|
|
arbitrary Scheme code.
|
|
|
|
@scribble-examples|==={
|
|
@foo{a @bar c}
|
|
@foo{a @(bar 2) c}
|
|
}===|
|
|
|
|
This is particularly useful with strings, which can be used to include
|
|
arbitrary text.
|
|
|
|
@scribble-examples|==={
|
|
@foo{This @"}" is a closing brace}
|
|
}===|
|
|
|
|
Note that the escaped string is (intentionally) merged with the rest
|
|
of the text. This works for @litchar["@"] too:
|
|
|
|
@scribble-examples|==={
|
|
@foo{The command prefix is @"@".}
|
|
@foo{@"@foo{bar}" reads as (foo "bar")}
|
|
}===|
|
|
|
|
@subsubsub*section{Alternative Body Syntax}
|
|
|
|
In addition to the above, there is an alternative syntax for the body,
|
|
one that specifies a new marker for its end: use @litchar["|{"] for
|
|
the opening marker to have the text terminated by a @litchar["}|"].
|
|
|
|
@scribble-examples|==={
|
|
@foo|{...}|
|
|
@foo|{"}" closes, "{" opens}|
|
|
@foo|{Nesting |{is}| ok}|
|
|
}===|
|
|
|
|
This applies to sub-@"@"-forms too---the @litchar["@"] must be
|
|
prefixed with a @litchar["|"]:
|
|
|
|
@scribble-examples|==={
|
|
@foo|{Maze
|
|
|@bar{is}
|
|
Life!}|
|
|
@foo|{Works for |@bar|{subforms}| too}|
|
|
}===|
|
|
|
|
Note that the subform uses its own delimiters, @litchar["{...}"] or
|
|
@litchar["|{...}|"]. This means that you can copy and paste Scribble
|
|
text with @"@"-forms freely, just prefix the @litchar["@"] if the
|
|
immediate surrounding text has a prefix.
|
|
|
|
For even better control, you can add characters in the opening
|
|
delimiter, between the @litchar["|"] and the @litchar["{"].
|
|
Characters that are put there (non alphanumeric ASCII characters only,
|
|
excluding @litchar["{"] and @litchar["@"]) should also be used for
|
|
sub-@"@"-forms, and the end-of-body marker should have these characters
|
|
in reverse order with paren-like characters (@litchar["("],
|
|
@litchar["["], @litchar["<"]) mirrored.
|
|
|
|
@scribble-examples|==={
|
|
@foo|<<<{Some @x{more} |@{text}|.}>>>|
|
|
@foo|!!{Blah |!!@bold{blah}...}!!|
|
|
}===|
|
|
|
|
Finally, remember that you can use an expression escape with a Scheme
|
|
string for confusing situations. This works well when you only need
|
|
to quote short pieces, and the above works well when you have larger
|
|
multi-line body texts.
|
|
|
|
@subsubsub*section{Scheme Expression Escapes}
|
|
|
|
In some cases, you may want to use a Scheme identifier (or a number or
|
|
a boolean etc.) in a position that touches the following text; in
|
|
these situations you should surround the escaped Scheme expression by
|
|
a pair of @litchar["|"] characters. The text inside the bars is
|
|
parsed as a Scheme expression.
|
|
|
|
@scribble-examples|==={
|
|
@foo{foo@bar.}
|
|
@foo{foo@|bar|.}
|
|
@foo{foo@3.}
|
|
@foo{foo@|3|.}
|
|
}===|
|
|
|
|
This form is a generic Scheme expression escape, there is no body text
|
|
or datum part when you use this form.
|
|
|
|
@scribble-examples|==={
|
|
@foo{foo@|(f 1)|{bar}}
|
|
@foo{foo@|bar|[1]{baz}}
|
|
}===|
|
|
|
|
This works for string expressions too, but note that unlike the above,
|
|
the string is (intentionally) not merged with the rest of the text:
|
|
|
|
@scribble-examples|==={
|
|
@foo{x@"y"z}
|
|
@foo{x@|"y"|z}
|
|
}===|
|
|
|
|
Expression escapes also work with @italic{any} number of expressions,
|
|
|
|
@scribble-examples|==={
|
|
@foo{x@|1 (+ 2 3) 4|y}
|
|
@foo{x@|*
|
|
*|y}
|
|
}===|
|
|
|
|
It seems that @litchar["@||"] has no purpose---but remember that these escapes
|
|
are never merged with the surrounding text, which can be useful when
|
|
you want to control the sub expressions in the form.
|
|
|
|
@scribble-examples|==={
|
|
@foo{Alice@||Bob@|
|
|
|Carol}
|
|
}===|
|
|
|
|
Note that @litchar["@|{...}|"] can be parsed as either an escape expression or
|
|
as a no-command @"@"-form. The latter is used in this case (since there
|
|
is little point in Scheme code that uses braces.
|
|
|
|
@scribble-examples|==={
|
|
@|{blah}|
|
|
}===|
|
|
|
|
@subsubsub*section{Comments}
|
|
|
|
As noted above, there are two kinds of Scribble comments: @litchar["@;{...}"] is
|
|
a (nestable) comment for a whole body of text (following the same
|
|
rules for @"@"-forms), and @litchar["@;..."] is a line-comment.
|
|
|
|
@scribble-examples|==={
|
|
@foo{First line@;{there is still a
|
|
newline at this point;}
|
|
Second line}
|
|
}===|
|
|
|
|
One useful property of line-comments is that they continue to the end
|
|
of the line @italic{and} all following spaces (or tabs). Using this,
|
|
you can get further control of the subforms.
|
|
|
|
@scribble-examples|==={
|
|
@foo{This is @;
|
|
a pretty long @;
|
|
single string-@;
|
|
argument.}
|
|
}===|
|
|
|
|
Note how this is different from using @litchar["@||"]s in that strings
|
|
around it are not merged.
|
|
|
|
@subsubsub*section{Spaces, Newlines, and Indentation}
|
|
|
|
The Scribble syntax treats spaces and newlines in a special way is
|
|
meant to be sensible for dealing with text. As mentioned above,
|
|
spaces at the beginning and end of body lines are discarded, except
|
|
for spaces between a @litchar["{"] and text, or between text and a
|
|
@litchar["}"].
|
|
|
|
@scribble-examples|==={
|
|
@foo{bar}
|
|
@foo{ bar }
|
|
@foo{ bar
|
|
baz }
|
|
}===|
|
|
|
|
A single newline that follows an open brace or precedes a closing
|
|
brace is discarded, unless there are only newlines in the body; other
|
|
newlines are read as a @scheme["\n"] string
|
|
|
|
@;FIXME empty lines are ignored in generated HTML output
|
|
@scribble-examples|==={
|
|
@foo{bar
|
|
}
|
|
@foo{
|
|
bar
|
|
}
|
|
@foo{
|
|
|
|
bar
|
|
|
|
}
|
|
@foo{
|
|
bar
|
|
|
|
baz
|
|
}
|
|
@foo{
|
|
}
|
|
@foo{
|
|
|
|
}
|
|
@foo{ bar
|
|
baz }
|
|
}===|
|
|
|
|
In the parsed S-expression syntax, a single newline string is used for
|
|
all newlines; you can use @scheme[eq?] to identify this line. This
|
|
can be used to identify newlines in the original @nonterm{text-body}.
|
|
|
|
@; FIXME: need to show printout here (with the scribble syntax)
|
|
@schemeblock[
|
|
(let ([nl (car @'{
|
|
})])
|
|
(for-each (lambda (x) (display (if (eq? x nl) "\n... " x)))
|
|
@`{foo
|
|
@,@(list "bar" "\n" "baz")
|
|
blah})
|
|
(newline))
|
|
--prints-->
|
|
foo
|
|
... bar
|
|
baz
|
|
... blah
|
|
]
|
|
|
|
Spaces at the beginning of body lines do not appear in the resulting
|
|
S-expressions, but the column of each line is noticed, and all-space
|
|
indentation strings are added so the result has the same indentation.
|
|
A indentation string is added to each line according to its distance
|
|
from the leftmost syntax object (except for empty lines). (Note: if
|
|
you try these examples on a mzscheme REPL, you should be aware that
|
|
the reader does not know about the "@litchar["> "]" prompt.)
|
|
|
|
@scribble-examples|==={
|
|
@foo{
|
|
bar
|
|
baz
|
|
blah
|
|
}
|
|
@foo{
|
|
begin
|
|
x++;
|
|
end}
|
|
@foo{
|
|
a
|
|
b
|
|
c}
|
|
}===|
|
|
|
|
If the first string came from the openning @litchar["{"] line, it is
|
|
not prepended with an indentation (but it can affect the leftmost
|
|
syntax object used for indentation). This makes sense when formatting
|
|
structured code as well as text (see the last example in the following
|
|
block).
|
|
|
|
@;FIXME: last example too long, messes up output
|
|
@scribble-examples|==={
|
|
@foo{bar
|
|
baz
|
|
bbb}
|
|
@foo{ bar
|
|
baz
|
|
bbb}
|
|
@foo{bar
|
|
baz
|
|
bbb}
|
|
@foo{ bar
|
|
baz
|
|
bbb}
|
|
@foo{ bar
|
|
baz
|
|
bbb}
|
|
@text{Some text@footnote{And a
|
|
footnote comment.}. More text.}
|
|
}===|
|
|
|
|
Note that each @"@"-form is parsed to an S-expression that has its own
|
|
indentation. This means that Scribble source can be indented like
|
|
code, but if indentation matters then you may need to apply
|
|
indentation of the outer item to all lines of the inner one. For
|
|
example, in
|
|
|
|
@litchar/lines|==={
|
|
@code{
|
|
begin
|
|
i = 1, r = 1
|
|
@bold{while i < n do
|
|
r *= i++
|
|
done}
|
|
end
|
|
}
|
|
}===|
|
|
|
|
a formatter will need to apply the 2-space indentation to the
|
|
rendering of the @scheme[bold] body.
|
|
|
|
Note that to get a first-line text to be counted as a leftmost line,
|
|
line and column accounting should be on for the input port
|
|
(@scheme[use-at-readtable] turns them on for the current input port).
|
|
Without this,
|
|
|
|
@litchar/lines|==={
|
|
@foo{x1
|
|
x2
|
|
x3}
|
|
}===|
|
|
|
|
will not have 2-space indentations in the parsed S-expression if
|
|
source accounting is not on, but
|
|
|
|
@litchar/lines|==={
|
|
@foo{x1
|
|
x2
|
|
x3}
|
|
}===|
|
|
|
|
will (due to the last line). Pay attention to this, as it can be a
|
|
problem with Scheme code, for example:
|
|
|
|
@litchar/lines|==={
|
|
@code{(define (foo x)
|
|
(+ x 1))}
|
|
}===|
|
|
|
|
For rare situations where spaces at the beginning (or end) of lines
|
|
matter, you can begin (or end) a line with a "@||".
|
|
|
|
@scribble-examples|==={
|
|
@foo{
|
|
@|| bar @||
|
|
@|| baz}
|
|
}===|
|
|
|
|
Finally, you might be need a verbatim-like environment, where the
|
|
parsed body matches exactly the textual source. To make this
|
|
possible, the @"@"-parser uses syntax properties on the resulting
|
|
syntax values. All items that are not physically in the Scribble
|
|
source---newlines and indentation-spaces---have a 'scribble property.
|
|
An indentation string will have @scheme['indentation] as the value of
|
|
this property, and a newline will have a @scheme['(newline S)] value
|
|
where S is the original newline string including spaces that precede
|
|
and follow it (which includes the indentation for the following item).
|
|
To implement a verbatim environment you need to drop indentation
|
|
strings, and use the original newline strings instead of the
|
|
single-newline string. Here is an example of this.
|
|
|
|
@; FIXME: need to show evaluation here (with the scribble syntax)
|
|
@schemeblock[
|
|
(define-syntax (verb stx)
|
|
(syntax-case stx ()
|
|
[(_ cmd item ...)
|
|
;;FIXME: show a "#`" in the rendering of the following line
|
|
#`(cmd .
|
|
;;FIXME: the next line should begin with a #,
|
|
,(let loop ([items (syntax->list #'(item ...))])
|
|
(if (null? items)
|
|
'()
|
|
(let* ([fst (car items)]
|
|
[prop (syntax-property fst 'scribble)]
|
|
[rst (loop (cdr items))])
|
|
(cond [(not prop) (cons fst rst)]
|
|
[(eq? prop 'indentation) rst]
|
|
[else (cons (datum->syntax-object
|
|
fst (cadr prop) fst)
|
|
rst)])))))]))
|
|
@verb[string-append]{
|
|
foo
|
|
bar
|
|
}
|
|
--> "foo\n bar"
|
|
]
|