Pruning the Search Space of the Wolof LFG Grammar Using a Probabilistic and a Constraint Grammar Parser
This paper presents a method for greatly reducing parse times in LFG by integrating a Constraint Grammar parser into a probabilistic context-free grammar. The CG parser is used in the pre-processing phase to reduce morphological and lexical ambiguity. Similarly, the c-structure pruning mechanism of XLE is used in the parsing phase to discard low-probability c-structures, before f-annotations are solved. The experiment results show a considerable increase in parsing efficiency and robustness in the annotation of Wolof running text. The Wolof CG parser indicated an f-score of 90{\%} for morphological disambiguation and a speedup of ca. 40{\%}, while the c-structure pruning method increased the speed of the Wolof grammar by over 36{\%}. On a small amount of data, CG disambiguation and c-structure pruning allowed for a speedup of 58{\%}, however with a substantial drop in parse accuracy of 3.62.
PDF Abstract