Add an example to the Redex typesetting docs that discusses how

macros screw up syntax locations and what to do about it Thanks to Lindsey Kuper for the inspiration to do this
2013-06-05 09:54:36 -05:00 · 2013-06-05 09:54:36 -05:00 · 5ea3a1ce6d
commit 5ea3a1ce6d
parent 25c982ad37
4 changed files with 179 additions and 1 deletions
--- a/collects/redex/scribblings/dynamic-typesetting-and-macros.scrbl
+++ b/collects/redex/scribblings/dynamic-typesetting-and-macros.scrbl
@ -0,0 +1,9 @@
+#lang racket/base
+(require racket/runtime-path)
+;; this uses a dynamic require here so that
+;; the zo compiler does not see a connection 
+;; between this file and typesetting-and-macros.scrbl
+;; so that we can avoid compiling that file. 
+(define-runtime-path typesetting-and-macros.scrbl "typesetting-and-macros.scrbl")
+(define doc (dynamic-require typesetting-and-macros.scrbl 'doc))
+(provide doc)
--- a/collects/redex/scribblings/info.rkt
+++ b/collects/redex/scribblings/info.rkt
@ -0,0 +1,6 @@
+#lang setup/infotab
+;; skip this file because we want to preserve the source
+;; location information in the syntax templates so the
+;; redex examples typset properly
+(define compile-omit-paths '("typesetting-and-macros.scrbl"))
+
--- a/collects/redex/scribblings/ref.scrbl
+++ b/collects/redex/scribblings/ref.scrbl
@ -3142,6 +3142,8 @@ would have been just outside the sequence is replaced with an
                                  expression)]{
 Shorthand for nested @racket[with-compound-rewriter] expressions.}

+@subsection{LWs}
+                                              
@defstruct[lw ([e (or/c string?
                        symbol?
                        pict? 
@ -3317,7 +3319,6 @@ the empty string and the @racket[x] in the typeset output.
@defproc[(just-after [stuff (or/c pict? string? symbol?)]
                     [lw lw?])
                    lw?]{}]]{
-
 These two helper functions build new lws whose contents are
 the first argument, and whose line and column are based on
 the second argument, making the new loc wrapper be either
@ -3325,5 +3326,6 @@ just before or just after that argument. The line-span and
 column-span of the new lw is always zero.
 }

+@include-section["dynamic-typesetting-and-macros.scrbl"]

@close-eval[redex-eval]
--- a/collects/redex/scribblings/typesetting-and-macros.scrbl
+++ b/collects/redex/scribblings/typesetting-and-macros.scrbl
@ -0,0 +1,161 @@
+#lang scribble/base
+
+@(require scribble/manual
+          scribble/bnf
+          scribble/struct
+          scribble/eval
+          scribble/racket
+          (for-syntax racket/base)
+          (for-label racket/base
+                     (except-in racket/gui make-color)
+                     racket/pretty
+                     racket/contract
+                     mrlib/graph
+                     (except-in 2htdp/image make-pen text)
+                     (only-in pict pict? text dc-for-text-size text-style/c
+                              vc-append)
+                     redex))
+
+@(define redex-macros-eval (make-base-eval))
+@(interaction-eval #:eval redex-macros-eval (require redex/reduction-semantics
+                                                     redex/pict
+                                                     (for-syntax racket/base)))
+
+@title{Macros and Typesetting}
+
+When you have a macro that abstracts over variations in 
+Redex programs, then typesetting is unlikely to work
+without some help from your macros.
+
+To see the issue, consider this macro abstraction over
+a Redex grammar:
+@interaction[#:eval 
+             redex-macros-eval
+             (define-syntax-rule
+               (def-my-lang L prim ...)
+               (define-language L
+                 (e ::=
+                    (λ (x) e)
+                    (e e)
+                    prim ...
+                    x)
+                 (x ::= variable-not-otherwise-mentioned)))
+
+             (def-my-lang L + - *)
+             (render-language L)]
+Redex thinks that the grammar is going ``backwards'' because 
+of the way macro expansion synthesizes source locations.
+In particular, in the result of the macro expansion, the
+third production for @racket[_e] appears to come later
+in the file than the fourth production and this confuses
+Redex, making it unable to typeset this language.
+
+One simple, not-very-general work-around is to just avoid 
+typesetting the parts that come from the macro arguments. For
+example if you move the primitives into their own non-terminal
+and then just avoid typesetting that, Redex can cope:
+
+@(define exp
+   (quote-syntax
+    (define-syntax-rule
+      (def-my-lang L prim ...)
+      (define-language L
+        (e ::=
+           (λ (x) e)
+           (e e)
+           prims
+           x)
+        (prims ::= prim ...)
+        (x ::= variable-not-otherwise-mentioned)))))
+
+@(redex-macros-eval exp)
+@(to-paragraph exp)
+
+@interaction[#:eval 
+             redex-macros-eval
+             (def-my-lang L + - *)
+             (render-language L #:nts '(e x))]
+
+You can also, however, exploit Racket's macro system to rewrite the
+source locations in a way that tells Redex where the macro-introduced
+parts of the language are supposed to be, and then typesetting
+will work normally. For example, here is one way to do this with
+the original language:
+
+@(require redex/reduction-semantics (for-syntax racket/base))
+@(define fancy-exp
+  (quote-syntax
+   (define-syntax (def-my-lang stx)
+      (syntax-case stx ()
+        [(_ L a ...)
+         (let ()
+           (define template
+             #'(define-language L
+                 (e (λ (x) e)
+                    (e e)
+                    HERE
+                    x)
+                 (x variable-not-otherwise-mentioned)))
+           (car
+            (let loop ([stx template])
+              (syntax-case stx (HERE)
+                [HERE 
+                 (let loop ([as (syntax->list #'(a ...))]
+                            [pos (syntax-position stx)]
+                            [col (syntax-column stx)])
+                   (cond
+                     [(null? as) '()]
+                     [else 
+                      (define a (car as))
+                      (define span 
+                        (string-length
+                         (symbol->string (syntax-e a))))
+                      (define srcloc
+                        (vector (syntax-source stx)
+                                (syntax-line stx)
+                                col
+                                pos
+                                span))
+                      (cons
+                       (datum->syntax a 
+                                      (syntax-e a)
+                                      srcloc
+                                      a)
+                       (loop (cdr as)
+                             (+ pos span 1)
+                             (+ col span 1)))]))]
+                [(a ...)
+                 (list
+                  (datum->syntax
+                   stx
+                   (apply append (map loop (syntax->list #'(a ...))))
+                   stx
+                   stx))]
+                [a
+                 (list stx)]))))]))))
+
+@(redex-macros-eval fancy-exp)
+@(to-paragraph fancy-exp)
+
+@interaction[#:eval redex-macros-eval
+                    (def-my-lang L + - *)]
+@interaction[#:eval redex-macros-eval
+             (render-language L)]
+
+
+And one final caveat: when Racket compiles source files to bytecode format, 
+it discards source location information in syntax constants, 
+which means that if a file containing
+a macro like the one above is compiled to bytecode, then it cannot properly
+adjust the source locations and the typeset language will not look right 
+(the important constant whose source locations are lost is @racket[template], above).
+
+The simplest way to avoid this problem is to just avoid creating bytecode for
+these files.
+
+It is possible to write the constant in the source code, however, and then
+process it so the compiled version of the file contains different data structures
+that record the source locations of the expressions. Redex does this internally so that 
+compiled files that use, e.g., @racket[define-language], still correctly typeset.
+But when you write macros that expand into @racket[define-language], you must
+also take this step yourself (or avoid @tt{.zo} files).