|Version 5 (modified by simonpj, 3 years ago) (diff)|
Add MetaML-style quotes
In the case of terms (only), we know from MetaML that we can have typed quotations. These are rather useful as the metaocaml exploration of the Shonan Challenge report and code (amongst others) shows. Right now, however, all this work is done in (the recently reborn) metaocaml rather than in (Template) Haskell because of the availability of typed quotes and splices for increased correctness.
- Add a new abstract type of typed expressions TExp a. Internally, this can be just an AST (possibly re-using Exp or something internal to GHC) and its type.
- Add a new term quotation form [|| e ||], called a typed quote; the type of the quote is TExp ty, where ty is the type of e. In the type-system jargon, this is the "introduction form" for TExp. The value of this quote is the (typed) AST for e.
- Add a new splice form $$e, called a typed splice. The term e must have type TExp ty, and the splice $$e then has type ty. This is the "elimination form" for TExp. It can, of course, only be used in a context which is expecting a TExp ty.
- Add a constant which takes a typed quote and returns an untyped one: unType :: TExp a -> Q Exp
- Run these new typed splices in the typechecker, not the renamer.
- Use renaming of binding forms to insure capture-avoidance (a la MetaML).
- Cross-stage persistence will remain unchanged. To be able to use an identifier at future stages, it must be fully available, which means that it needs to be defined in a previous compilation unit if it will be spliced (by name) into a term.
The justification for (2) and (3) are classical. For ensuring type-safety (at the current state of knowledge), it is important that TExp be abstract, as sound typed-expression manipulation is very hard to achieve, especially in the presence of binders. A future extension may open this up, whenever this particular tough nut is cracked, but for now we must have (1). In theory, unType (4) is not needed; in practice, it probably will be, but mainly for top-level splices. (5) is obvious: type information needs to be available for typed splices, and this is not available in the renamer. (6) comes from MetaML, and basically just means that the renamer will be applied to typed splices as well. (7) documents a non-change [which is a little awkward in code, was not present in the old metaocaml, but is actually in the new metaocaml -- see the data constructor restriction).
As for syntax, it would obviously be best if one could simply use [| |] for typed quotes as, in the Haskell world, one would want to have as many things as possible be typed. But this is entirely unrealistic: surely there will be "correct" TH code out there which will not type. It would be quite interesting to see what these codes are, but it would also be unfair to inflict such random pain on TH users. Furthermore, as [| |] would be typed only for terms (at least until a lot more research is done, to provide "types" for the other syntactic categories of Haskell), this would likely end up more frustrating than anything else. Thus, at present, a new syntactic category is best. As was documented in the proposal, there are a lot of options here, but it is probably simplest to just double-up, aka [|| e ||] and $$e, rather than steal another sort of generalized bracket for this purpose.Of course, it would be nice to also use Unicode for something nicer; 〈 e 〉 is certainly tempting. ⟪ e ⟫ too. Using ''denotation brackets'' would definitely be wrong though. But, to go back to logic, most tempting is actually ⸢ e ⸣ which logicians have used for quite some time. Then splice could be denoted by ⸤ e ⸥ . The official suggestion is for the latter two, aka unicode points 2e22, 2e23, 2e24 and 2e25 respectively.
Quite a number of other items have already been described in detail in the Proposal, so there is no point in repeating them here.
Scope Extrusion and effects
Since effects in Haskell are typed quite differently than in ocaml, scope extrusion is unlikely to be much of a problem. However, these items should be tested.
Note that there should be no restriction on a to be a pure type, i.e. it could be m a for some Monad m. Normal Haskell typing rules would just apply. Since top-level declarations cannot be generated as (typed) terms, all such effects would have to be present in the environment already for things to be typeable.
It is implicit in the above, but should be made explicit: untyped quotes/splices/quasiquotes should continue to work as they are -- which in detail means that they should be run in the renamer. Typed quotes/splices/quasiquotes, should be run by the typechecker.