GrammarLab: Foundations for a Grammar Laboratory

• Organisation • Future • Activities • Technicalities • Publications •

Organisation

GrammarLab was an NWO-sponsored project in 2010–2013.
GrammarLab was designed and implemented by the SWAT team at CWI.
The lead programmer of GrammarLab is Vadim Zaytsev.
Other members of the GrammarLab project were Paul Klint, Jurgen Vinju and Tijs van der Storm.

Future

Currently there are several projects available to continue the life of GrammarLab, with key publications related to them:

GA: Grammar Analytics [SMR2004] [IWPC2000] [SLE2013]: There are many measurements that can be taken on a grammar, metrics that can be used to analyse them, assess them, visualise them, even compare them. In this extension, you start with a clean grammar, perform calculations based on its contents and present them in a way that makes sense to the user. Advanced execution of this project will involve some form of fact extraction and smell detection.
GI: Grammar Inference [SAC2003] [ASN2005] [CSMR2005] [CSMR2005] [SAC2006] [ICGI2006] [SC2008] [IETS2008] [SCP2014] [UvA2014]: Grammars are not always written by humans: they can be inferred by an algorithm: completely or partially, in a form of an automaton or directly of grammar production rules. Research shows that flat out inference of grammars based on just samples is slow and unreliable, but it works realistically well if inference is done partially or based on trees or token models. There is no current component of GrammarLab that does this.
GN: Grammar Notation [CACM1977] [VU2010] [SAC2012] [LDTA2012]: Grammars are written in a variety of notations, and translations from one to another are frequently needed when a grammar engineer finds a suitable grammar in one notation but wants to use it with a toolkit that requires another. Advanced execution of this project will involve notation analysis and feature modelling.
GM: Grammar Mutation [BX2012] [SQM2014] [SCP2014]: More often than not, changes to a grammar can be expressed in an abstract policy or convention: lowercase nonterminal names, lack of left recursion, etc. Automatically inferred large scale adjustments like these are called grammar mutations and are implemented in a component called SLEIR. Advanced execution of this project will involve empirical evaluation based on improving quality of some extracted grammars of the Grammar Zoo.
GR: Grammar Recovery [CSMR2000] [IEEES2001] [SPE2001] [SLE2008] [SQJ2011] [SAC2012] [LDTA2012] [SCP2014]: Grammars found in books and other non-executable sources could be a nice source of information, if they did not contain so many errors. There are heuristics that help extract a proper grammar from a lexically unreliable source, implemented in a Python component called Grammar Hunter. Advanced execution of this project will involve empirical evaluation based on turning some fetched grammars of the Grammar Zoo into extracted ones.
GT: Grammar Testing [FASE2001] [ICTSS2006] [SLE2011] [UvA2014]: Grammars can be used as guides to generate test data (each test case is a program) representing a language. Such functionality was implemented a couple of times outside GrammarLab in Java+ANTLR, source code is available. Advanced execution of this project will involve generation of negative test cases.
GX: Grammar Transformations [FME2001] [LDTA2001] [LDTA2002] [ATEM2004] [SQJ2011]: Unlike grammar mutations where changes are inferred based on the grammar, there are some transformations that are needed to be specified manually and precisely. Typical uses of these include expressing differences between two conceptually different implementations of the same language, or expressing changes between language versions. GrammarLab includes a suite of such transformation operators called XBGF, a previous implementation of most of them is available in Prolog as well.

Activities

A half-day tutorial about GrammarLab was given at MoDELS 2013 in Miami: if you missed it, you can still have a look at the slides.
Each of the papers published about GrammarLab, except for journal ones, had their corresponding talks with slides available.
A number of PEM Colloquium presentations concerned GrammarLab or research fragments that later became its components:
- Recovery, Convergence and Documentation of Languages, December 2010.
- Grammar Investigation, February 2011.
- Cheating on the Undecidability of Language Equivalence, April 2011.
- Toward an Engineering Discipline for Grammar Recovery, August 2011.
- Bidirectional Transformations and Grammarware, February 2012.
- Tolerance in Grammarware, May 2012.
- Negotiated Transformations, January 2013.
- Modeling Software Structures with GrammarLab, May 2013.

Technicalities

You can inspect the git repo of GrammarLab.
A lot of experimental code that ended up deployed at GrammarLab, comes from a sibling project SLPS.
GrammarLab GGrammar ADT is based on BGF, a BNF-like Grammar Format.
GrammarLab grammar transformation engine is based on XBGF, an operator suite for transforming grammars.
GrammarLab grammar mutation module SLEIR is a systematic intentional generalisation of XBGF.
Grammar Zoo is being extended and maintained with the help of GrammarLab.

Publications

Brian Malloy, James Power, Deriving Grammar Transformations for Developing and Maintaining Multiple Parser Versions, Parsing @ SLE, 2016.
Vadim Zaytsev, Evolution of Metaprograms: XSLT as a Metaprogramming Language, META @ SPLASH, 2016.
Vadim Zaytsev, Evolution of Metaprograms, or How to Transform XSLT to Rascal, SATToSE, 2015.
Vadim Zaytsev, Grammar Zoo: A Repository of Experimental Grammarware, SCP EST5, 2014.
Vadim Zaytsev, Grammar Maturity Model, ME @ MoDELS, 2014.
Vadim Zaytsev, Negotiated Grammar Evolution, JOT, 2014.
Vadim Zaytsev, Software Language Engineering by Intentional Rewriting, SQM @ CSMR-WCRE, EC-EASST, 2014.
Vadim Zaytsev, Formal Foundations for Semi-parsing, ERA @ CSMR-WCRE, 2014.
Vadim Zaytsev, Pending Evolution of Grammars, XM @ MoDELS, 2013.
Ralf Lämmel, Vadim Zaytsev, Language Support for Megamodel Renarration, XM @ MoDELS, 2013.
Vadim Zaytsev, Micropatterns in Grammars, SLE, 2013.
Vadim Zaytsev, Guided Grammar Convergence, SLE poster, 2013.
Vadim Zaytsev, Modelling Robustness with Conjunctive Grammars, SATToSE, 2013.
Vadim Zaytsev, The Grammar Hammer of 2012, CoRR 1212.4446, 2012.
Vadim Zaytsev, Negotiated Grammar Transformation, XM @ MoDELS, 2012.
Vadim Zaytsev, Renarrating Linguistic Architecture: A Case Study, MPM @ MoDELS, 2012.
Vadim Zaytsev, Guided Grammar Convergence. Full Case Study Report. Generated by converge::Guided, CoRR 1207.6541, 2012.
Vadim Zaytsev, Notation-Parametric Grammar Recovery, LDTA @ ETAPS, 2012.
Vadim Zaytsev, BNF WAS HERE: What Have We Done About the Unnecessary Diversity of Notation for Syntactic Definitions, PL @ SAC, 2012.
Vadim Zaytsev, Language Evolution, Metasyntactically, BX @ ETAPS, EC-EASST, 2012.
Vadim Zaytsev, Wiki Migration, Wikimania, 2011.
Vadim Zaytsev, MediaWiki Grammar Recovery, CoRR 1107.4661, 2011.
Paul Klint, Ralf Lämmel, Chris Verhoef, Toward an Engineering Discipline for Grammarware, ACM ToSEM, 2005.

The page is maintained by Dr. Vadim Zaytsev a.k.a. @grammarware. Last updated: December 2016.