Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian

LREC 2014 · Anton Karl Ingason, Hrafn Loftsson, Eir{\'\i}kur R{\"o}gnvaldsson, Einar Freyr Sigur{\dh}sson, Joel C. Wallenberg ·

This paper presents ongoing work that aims to improve machine parsing of Faroese using a combination of Faroese and Icelandic training data. We show that even if we only have a relatively small parsed corpus of one language, namely 53,000 words of Faroese, we can obtain better results by adding information about phrase structure from a closely related language which has a similar syntax. Our experiment uses the Berkeley parser. We demonstrate that the addition of Icelandic data without any other modification to the experimental setup results in an f-measure improvement from 75.44{\%} to 78.05{\%} in Faroese and an improvement in part-of-speech tagging accuracy from 88.86{\%} to 90.40{\%}.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Part-Of-Speech Tagging

Datasets

Add Datasets introduced or used in this paper

Results from the Paper

Add Remove

Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Rapid Deployment of Phrase Structure Parsing for Related Languages: A Case Study of Insular Scandinavian

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove