amelia
/
blag

---title: The Amulet Programming Languagedate: January 18, 2018---
As you might have noticed, I like designing and implementing programminglanguages. This is another of these projects. Amulet is astrictly-evaluated, statically typed impure roughly functionalprogramming language with support for parametric data types and rank-1polymorphism _à la_ Hindley-Milner (but [nolet-generalization](#letgen)), along with row-polymorphic records. Whilesyntactically inspired by the ML family, it's a disservice to thoselanguages to group Amulet with them, mostly because of the (present)lack of modules.
Planned features (that I haven't even started working on, as of writingthis post) include generalized algebraic data types, modules and modularimplicits, a reworked type inference engine based on _OutsideIn(X)_[^4]to support the other features, and, perhaps most importantly, a back-endthat's not a placeholder (i.e. something that generates either C or LLVMand can be compiled to a standalone executable).
The compiler is still very much a work in progress, and is activelybeing improved in several ways: Rewriting the parser for efficiencyconcerns (see [Lexing and Parsing](#parser)), improving the quality ofgenerated code by introducing more intermediate representations, andintroducing several optimisations on the one intermediate language we_do_ have.
## The Technical Bits

In this section, I'm going to describe the implementation of thecompiler as it exists at the time of writing - warts and all.Unfortunately, we have a bit too much code for all of it to fit in thisblag post, so I'm only going to include the horribly broken bits here,and leave the rest out. Of course, the compiler is open source, and isavailable on my [GitHub][2].
### Lexing and Parsing {#parser}

To call what we have a _lexer_ is a bit of an overstatement: The`Parser.Lexer` module, which underpins the actual parser, contains onlya handful of imports and some definitions for use with [Parsec's][3][`Text.Parsec.Token`][4] module; Everything else is boilerplate, namely,declaring, at top-level, the functions generated by `makeTokenParser`.
Our parser is then built on top of this infrastructure (and the othercombinators provided by Parsec) in a monadic style. Despite havingchosen to use strict `Text`s, many of the Parsec combinators return`Char`s, and using the Alternative type class' ability to repeat actionsmakes linked lists of these - the dreaded `String` type. Due to this,and other inefficiencies, the parser is ridiculously bad at memorymanagement.
However, it does have some cute hacks. For example, the pattern parserhas to account for being used in the parsing of both `match`{.ml} and`fun`{.ml} - in the former, destructuring patterns may appear withoutparenthesis, but in the latter, they _must_ be properly parenthesised:since `fun`{.ml} may have multiple patterns, it would be ambiguous if`fun Foo x -> ...`{.ml} is destructuring a `Foo` or takes two arguments.
Instead of duplicating the pattern parser, one for `match`{.ml}es andone for function arguments, we instead _parametrised_ the parser overneeding parenthesis or not by adding a rank-2 polymorphic continuationargument.
```haskellpatternP :: (forall a. Parser a -> Parser a) -> Parser Pattern'patternP cont = wildcard <|> {- some bits omitted -} try destructure where  destructure = withPos . cont $ do    ps <- constrName    Destructure ps <$> optionMaybe (patternP id)```
When we're parsing a pattern `match`{.ml}-style, the continuation givenis `id`, and when we're parsing an argument, the continuation is`parens`.
For the aforementioned efficiency concerns, however, we've decided toscrap the Parsec-based parser and move to an Alex/Happy based solution,which is not only going to be more maintainable and more easily hackablein the future, but will also be more efficient overall. Of course, fora toy compiler such as this one, efficiency doesn't matter that much,but using _one and a half gigabytes_ to compile a 20-line file is reallybad.
### Renaming {#renamer}

To simplify scope handling in both the type checker and optimiser, afterparsing, each variable is tagged with a globally unique integer that isenough to compare variables. This also lets us use more efficient datastructures later in the compiler, such as `VarSet`, which stores only theinteger identifier of a variable in a big-endian Patricia tree[^1].
Our approach, described in _[Secrets of the Glasgow Haskell Compilerinliner][5]_ as "the Sledgehammer", consists of duplicating _every_bound variable to avoid name capture problems. However, while the firstof the listed disadvantages surely does apply, by doing all of the_renaming_ in one go, we mostly avoid the latter. Of course, since then,the Haskell ecosystem has evolved significantly, and the plumbingrequired is a lot less intrusive.
In our compiler, we use MTL-style classes instead of concrete monadtransformer stacks. We also run every phase after parsing in a single`GenT`{.haskell} monad, which provides a fresh supply of integers fornames. "Plumbing" the fresh name supply, then, only involves adding a`MonadGen Int m` constraint to the context of functions that need it.
Since the string component of parsed names is not thrown away, we alsohave to make up strings themselves. This is where another cute hackcomes in: We generate, lazily, an infinite stream of names that goes`["a" .. "z", "aa" .. "az", "ba" .. "bz", ..]`, then use the`MonadGen`{.haskell} counter as an index into that stream.
```haskellalpha :: [Text]alpha = map T.pack $ [1..] >>= flip replicateM ['a'..'z']```
### Desugaring

The desugarer is a very simple piece of code which, through use of _ScrapYour Boilerplate_-style generic programming, traverses the syntax treeand rewrites nodes representing syntax sugar to their more explicitversions.
Currently, the desugarer only expands _sections_: That is, expressionsof the form `(+ e)` become `fun x -> x + e` (where `e` is a fresh name),expressions like `(e +)` become `fun x -> e + x`, and expressions like`.foo` becomes `fun x -> x.foo`.
This is the only component of the compiler that I can reasonablyinclude, in its entirety, in this post.
```haskelldesugarProgram = everywhereM (mkM defaults) where  defaults :: Expr Parsed -> m (Expr Parsed)  defaults (BothSection op an) = do    (ap, ar) <- fresh an    (bp, br) <- fresh an    pure (Fun ap (Fun bp (BinOp ar op br an) an) an)  defaults (LeftSection op vl an) = do    (cap, ref) <- fresh an    pure (Fun cap (BinOp ref op vl an) an)  defaults (RightSection op vl an) = do    (cap, ref) <- fresh an    pure (Fun cap (BinOp vl op ref an) an)  defaults (AccessSection key an) = do    (cap, ref) <- fresh an    pure (Fun cap (Access ref key an) an)  defaults x = pure x```
### Type Checking

By far the most complicated stage of the compiler pipeline, ourinference algorithm is modelled after Algorithm W (extended with kindsand kind inference), with constraint generation and solving being twoseparate steps.
We first traverse the syntax tree, in order, making up constraints andfresh type variables as needed, then invoke a unification algorithm toproduce a substitution, then apply that over both the generated type (askeleton of the actual result) and the syntax tree (which is explicitlyannotated with types everywhere).
The type inference code also generates and inserts explicit typeapplications when instancing polymorphic types, since we internallylower Amulet into a System F core language with explicit typeabstraction and application. We have `TypeApp` nodes in the syntax treethat never get parsed or renamed, and are generated by the type checkerbefore lowering happens.
Our constraint solver is quite rudimentary, but it does the job nicely.We operate with a State monad with the current substitution. When weunify a variable with another type, it is added to the currentsubstitution. Everything else is just zipping the types together. Whenwe try to unify, say, a function type with a constructor, that's anerror. If a variable has already been added to the current substitution andencounter it again, the new type is unified with the previously recordedone.
```haskellunify :: Type Typed -> Type Typed -> SolveM ()unify (TyVar a) b = bind a bunify a (TyVar b) = bind b aunify (TyArr a b) (TyArr a' b') = unify a a' *> unify b b'unify (TyApp a b) (TyApp a' b') = unify a a' *> unify b b'unify ta@(TyCon a) tb@(TyCon b)  | a == b = pure ()  | otherwise = throwError (NotEqual ta tb)```
This is only an excerpt, because we have very complicated types.
#### Polymorphic Records

One of Amulet's selling points (if one could call it that) is its supportfor row-polymorphic records. We have two types of first-class recordtypes: _closed_ record types (the type of literals) and _open_ recordtypes (the type inferred by record patterns and field getters.).  Openrecord types have the shape `{ 'p | x_n : t_n ... x_n : t_n }`{.ml},while closed records lack the type variable `'p`{.ml}.
Unification of records has 3 cases, but in all 3 cases it is checked thatfields present in both records have unifiable types.
- When unifying an open record with a closed one, present in bothrecords have unifiable types, and instance the type variable to containthe extra fields.- When unifying two closed records, they must have exactly the sameshape and unifiable types for common fields.- When unifying two open record types, a new fresh type variable iscreated to use as the "hole" and tack the fields together.
As an example, `{ x = 1 }` has type `{ x : int }`{.ml}, the function`fun x -> x.foo` has type `{ 'p | foo : 'a } -> 'a`{.ml}, and`(fun r -> r.x) { y = 2 }` is a type error[^2].
#### No Let Generalisation {#letgen}

Vytiniotis, Peyton Jones and Schrijvers argue[^5] that HM-style`let`{.ml} generalisation interacts badly with complex type systemextensions such as GADTs and type families, and should therefore beomitted from such systems. In a deviation from the paper, GHC 7.2reintroduces `let`{.ml} generalisation for local definitions that meetsome criteria[^3].
> Here's the rule. With `-XMonoLocalBinds` (the default), a binding
> without a type signature is **generalised only if all its free variables
> are closed.**  
>> A binding is **closed** if and only if
>> - It has a type signature, and the type signature has no free variables; or  
> - It has no type signature, and all its free variables are closed, and it
is unaffected by the monomorphism restriction. And hence it is fullygeneralised.
We, however, have chosen to follow that paper to a tee. Despite not(yet!) having any of those fancy type system features that interactpoorly with let generalisation, we do not generalise _any_ localbindings.

### Lowering

After type checking is done (and, conveniently, type applications havebeen left in the correct places for us by the type checker), Amulet codeis converted into an explicitly-typed intermediate representation, indirect style, which is used for (local) program optimisation. The AST issimplified considerably: from 19 constructors to 9.
Type inference is no longer needed: the representation of core is packedwith all the information we need to check that programs aretype-correct. This includes types in every binder (lambda abstractions,`let`{.ml}s, pattern bindings in `match`{.ml}), big-lambda abstractionsaround polymorphic values (a $\lambda$ binds a value, while a $\Lambda$binds a type), along with the already mentioned type applications.
Here, code also gets the error branches for non-exhaustive `match`{.ml}expressions, and, as a general rule, gets a lot uglier.
```ocamllet main _ = (fun r -> r.x) { x = 2 }
(* Is elaborated into *)
let  main : ∀ 'e. 'e -> int =  Λe : *. λk : 'e. match k {    (p : 'e) : 'e -> (λl : { 'g | x : int }. match l {      (r : { 'g | x : int }) : { 'g | x : int } -> match r {        { (n : { 'g | x : int }) | x = (m : int) } : { 'g | x : int } -> m      };      (o : { 'g | x : int }) : { 'g | x : int } ->        error @int "<test>[1:15 .. 1:27]"    }) ({ {} | x : int = 2 });    (q : 'e) : 'e -> error @int "<test>[1:14 .. 1:38]"  } ```
### Optimisation

As the code we initially get from lowering is ugly and inefficient -along with being full of the abstractions functional programs have bynature, it is full of redundant matches created by e.g. the fact thatfunctions can not do pattern matching directly, and that field accessgets reduced to pattern matching - the optimiser's job is to make itprettier, and more efficient.
The optimiser works by applying, in order, a series of localtransformations operating on individual sub-terms to produce an efficientprogram, 25 times. The idea of applying them several times is that, whena simplification pass kicks in, more simplification opportunities mightarise.
#### `dropBranches`, `foldExpr`, `dropUselessLets`

These trivial passes remove similarly trivial pieces of code that onlyadd noise to the program. `dropBranches` will do its best to removeredundant arms from a `match`{.ml} expression, such as those thatappear after an irrefutable pattern.  `foldExpr` reduces uses ofoperators where both sides are known, e.g. `2 + 2` (replaced by theliteral `5`) or `"foo " ^ "bar"` (replaced by the literal `"foobar"`). `dropUselessLets` removes `let`{.ml}s that bind unused variableswhose right-hand sides are pure expressions.
#### `trivialPropag`, `constrPropag`

The Amulet optimiser does inlining decisions in two (well, three)separate phases: One is called _propagation_, in which a `let` decidesto propagate its bound values into the expression, and the other is themore traditional `inlining`, where variables get their values from thecontext.
Propagation is by far the easiest of the two: The compiler can see boththe definitions and all of the use sites, and could in theory decide ifpropagating is beneficial or not. Right now, we propagate all literals(and records made up solely of other trivial expressions), and do around of propagation that is best described as a rule.
```ocamllet { v = C e } in ... v ...(* becomes *)let { v' = e } in ... C v' ...```
This _constructor propagation_ allows the `match`{.ml} optimisations to kickin more often, and is semantics preserving.
#### `match`{.ml}-of-known-constructor

This pass identifies `match`{.ml} expressions where we can staticallydetermine the expression being analysed and, therefore, decide whichbranch is going to be taken.
```ocamlmatch C x with| C e -> ... e ......(* becomes *)... x ...```
#### `match`{.ml}-of-bottom

It is always safe to turn a `match`{.ml} where the term being matched is adiverging expression into only that diverging expression, thus reducingcode size several times.
```ocamlmatch (error @int "message") with ...(* becomes *)error @int "message"```
As a special case, when one of the arms is itself a divergingexpression, we use the type mentioned in that application to `error` tofix up the type of the value being scrutinized.
```ocamlmatch (error @foo "message") with| _ -> error @bar "message 2"...(* becomes *)error @bar "message"```
#### `match`{.ml}-of-`match`{.ml}

This transformation turns `match`{.ml} expressions where the expressionbeing dissected is itself another `match`{.ml} "inside-out": we push thebranches of the _outer_ `match`{.ml} "into" the _inner_ `match` (whatused to be the expression being scrutinized). In doing so, sometimes,new opportunities for match-of-known-constructor arise, and the codeends up simpler.
```ocamlmatch (match x with        | A -> B        | C -> D) with  | B -> e  | D -> f(* becomes *)match x with  | A -> match B with    | B -> e    | D -> f  | C -> match D with    | B -> e    | D -> f```
A clear area of improvement here is extracting the outer branches intolocal `let`{.ml}-bound lambda abstractions to avoid an explosion in codesize.
#### `inlineVariable`, `betaReduce`

In this pass, use of a variable is replaced with the definition of thatvariable, if it meets the following conditions:
- The variable is a lambda abstraction; and- The lambda abstraction's body is not too _expensive_. Computing thecost of a term boils down to computing the depth of the treerepresenting that term, with some extra cost added to some specifictypes of expression.
In doing this, however, we end up with pathological terms of the form`(fun x -> e) y`{.ml}. The `betaReduce` pass turns this into `let x = y ine`{.ml}. We generate `let`{.ml} bindings instead of substituting thevariable with the parameter to maintain the same evaluation order andobservable effects of the original code. This does mean that, often,propagation kicks in and gives rise to new simplification opportunities.
## Epilogue

I was planning to write a section with a formalisation of the language'ssemantics and type system, but it turns out I'm no mathematician, nomatter how hard I pretend. Maybe in the future.
Our code generator is wholly uninteresting, and, most of all, aplaceholder: This is why it is not described in detail (that is, at all)in this post. I plan to write a follow-up when we actually finish thenative code generator.
As previously mentioned, the compiler _is_ open source: the code is[here][2]. I recommend using the [Nix package manager][9] to acquire theHaskell dependencies, but Cabal should work too. Current work inrewriting the parser is happening in the `feature/alex-happy` branch.
[^1]: This sounds fancy, but in practice, it boils down to using  `Data.IntSet`{.haskell} instead of `Data.Set`{.haskell}.
[^2]: As shown [here][6]. Yes, the error messages need improvement.
[^3]: As explained in [this blog post][8].
[^4]: Dimitrios Vytiniotis, Simon Peyton Jones, Tom Schrijvers,  and Martin Sulzmann. 2011. [OutsideIn(X): Modular Type Inference With  Local Assumptions][1]. _Note that, although the paper has been  published in the Journal of Functional Programming, the version linked  to here is a preprint._
[^5]: Dimitrios Vytiniotis, Simon Peyton Jones, Tom Schrijvers. 2010.  [Let Should not be Generalised][7].
[1]: <https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/jfp-outsidein.pdf>[2]: <https://github.com/zardyh/amulet/tree/66a4143af32c3e261af51b74f975fc48c0155dc8>[3]: <https://hackage.haskell.org/package/parsec-3.1.11>[4]: <https://hackage.haskell.org/package/parsec-3.1.11/docs/Text-Parsec-Token.html>[5]: <https://www.microsoft.com/en-us/research/wp-content/uploads/2002/07/inline.pdf>[6]: </snip/sel.b0e94.txt>[7]: <https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/tldi10-vytiniotis.pdf>[8]: <https://ghc.haskell.org/trac/ghc/blog/LetGeneralisationInGhc7>[9]: <https://nixos.org/nix/>