no code implementations • NAACL 2022 • Arturo Oncevay, Duygu Ataman, Niels van Berkel, Barry Haddow, Alexandra Birch, Johannes Bjerva
In this work, we propose to reduce the rigidity of such claims, by quantifying morphological typology at the word and segment level.
no code implementations • EACL 2021 • Johannes Bjerva, Isabelle Augenstein
Our hypothesis is that a model trained in a cross-lingual setting will pick up on typological cues from the input data, thus overshadowing the utility of explicitly using such features.
no code implementations • EMNLP (SIGTYP) 2020 • Johannes Bjerva, Elizabeth Salesky, Sabrina J. Mielke, Aditi Chaudhary, Giuseppe G. A. Celano, Edoardo M. Ponti, Ekaterina Vylomova, Ryan Cotterell, Isabelle Augenstein
Typological knowledge bases (KBs) such as WALS (Dryer and Haspelmath, 2013) contain information about linguistic properties of the world's languages.
no code implementations • EMNLP (BlackboxNLP) 2020 • Lukas Muttenthaler, Isabelle Augenstein, Johannes Bjerva
We observe a consistent pattern in the answer representations, which we show can be used to automatically evaluate whether or not a predicted answer span is correct.
1 code implementation • Joint Conference on Lexical and Computational Semantics 2021 • Wei Zhao, Steffen Eger, Johannes Bjerva, Isabelle Augenstein
Cross-lingual representations have the potential to make NLP techniques available to the vast majority of languages in the world.
1 code implementation • EMNLP 2020 • Johannes Bjerva, Nikita Bhutani, Behzad Golshan, Wang-Chiew Tan, Isabelle Augenstein
We find that subjectivity is also an important feature in the case of QA, albeit with more intricate interactions between subjectivity and QA performance.
1 code implementation • EMNLP 2020 • Farhad Nooralahzadeh, Giannis Bekoulis, Johannes Bjerva, Isabelle Augenstein
We show that this challenging setup can be approached using meta-learning, where, in addition to training a source language model, another model learns to select which training instances are the most beneficial to the first.
1 code implementation • 8 Sep 2019 • Johannes Bjerva, Wouter Kouw, Isabelle Augenstein
In particular, language evolution causes data drift between time-steps in sequential decision-making tasks.
no code implementations • WS 2019 • Johannes Bjerva, Katharina Kann, Isabelle Augenstein
Multi-task learning and self-training are two common ways to improve a machine learning model's performance in settings with limited training data.
no code implementations • ACL 2019 • Johannes Bjerva, Yova Kementchedjhieva, Ryan Cotterell, Isabelle Augenstein
The study of linguistic typology is rooted in the implications we find between linguistic features, such as the fact that languages with object-verb word ordering tend to have post-positions.
1 code implementation • NAACL 2019 • Johannes Bjerva, Yova Kementchedjhieva, Ryan Cotterell, Isabelle Augenstein
In the principles-and-parameters framework, the structural features of languages depend on parameters that may be toggled on or off, with a single parameter often dictating the status of multiple features.
no code implementations • CL 2019 • Johannes Bjerva, Robert Östling, Maria Han Veiga, Jörg Tiedemann, Isabelle Augenstein
If the corpus is multilingual, the same model can be used to learn distributed representations of languages, such that similar languages end up with similar representations.
no code implementations • 7 Sep 2018 • Johannes Bjerva
In Natural Language Processing (NLP), one traditionally considers a single task (e. g. part-of-speech tagging) for a single language (e. g. English) at a time.
no code implementations • CONLL 2018 • Yova Kementchedjhieva, Johannes Bjerva, Isabelle Augenstein
This paper documents the Team Copenhagen system which placed first in the CoNLL--SIGMORPHON 2018 shared task on universal morphological reinflection, Task 2 with an overall accuracy of 49. 87.
1 code implementation • EMNLP 2018 • Miryam de Lhoneux, Johannes Bjerva, Isabelle Augenstein, Anders Søgaard
We find that sharing transition classifier parameters always helps, whereas the usefulness of sharing word and/or character LSTM parameters varies.
no code implementations • WS 2018 • Katharina Kann, Johannes Bjerva, Isabelle Augenstein, Barbara Plank, Anders S{\o}gaard
Neural part-of-speech (POS) taggers are known to not perform well with little training data.
no code implementations • SEMEVAL 2018 • Thomas Nyegaard-Signori, Casper Veistrup Helms, Johannes Bjerva, Isabelle Augenstein
We take a multi-task learning approach to the shared Task 1 at SemEval-2018.
no code implementations • WS 2018 • Joachim Bingel, Johannes Bjerva
We approach the 2018 Shared Task on Complex Word Identification by leveraging a cross-lingual multitask learning approach.
no code implementations • NAACL 2018 • Johannes Bjerva, Isabelle Augenstein
A core part of linguistic typology is the classification of languages according to linguistic properties, such as those detailed in the World Atlas of Language Structure (WALS).
no code implementations • WS 2018 • Johannes Bjerva, Isabelle Augenstein
Although linguistic typology has a long history, computational approaches have only recently gained popularity.
no code implementations • 3 Nov 2017 • Johannes Bjerva
For instance, if you are a skilled violinist, you will likely have an easier time learning to play cello.
no code implementations • WS 2017 • Johannes Bjerva, Gintar{\.e} Grigonyt{\.e}, Robert {\"O}stling, Barbara Plank
We present the RUG-SU team{'}s submission at the Native Language Identification Shared Task 2017.
no code implementations • WS 2017 • Artur Kulmizev, Bo Blankers, Johannes Bjerva, Malvina Nissim, Gertjan van Noord, Barbara Plank, Martijn Wieling
In this paper, we explore the performance of a linear SVM trained on language independent character features for the NLI Shared Task 2017.
no code implementations • SEMEVAL 2017 • Johannes Bjerva, Robert {\"O}stling
Shared Task 1 at SemEval-2017 deals with assessing the semantic similarity between sentences, either in the same or in different languages.
no code implementations • CONLL 2017 • Robert Östling, Johannes Bjerva
This paper describes the Stockholm University/University of Groningen (SU-RUG) system for the SIGMORPHON 2017 shared task on morphological inflection.
no code implementations • 10 Jun 2017 • Johan Sjons, Thomas Hörberg, Robert Östling, Johannes Bjerva
In earlier work, we have shown that articulation rate in Swedish child-directed speech (CDS) increases as a function of the age of the child, even when utterance length and differences in articulation rate between subjects are controlled for.
1 code implementation • EACL 2017 • Lasha Abzianidze, Johannes Bjerva, Kilian Evang, Hessel Haagsma, Rik van Noord, Pierre Ludmann, Duc-Duy Nguyen, Johan Bos
The Parallel Meaning Bank is a corpus of translations annotated with shared, formal meaning representations comprising over 11 million words divided over four languages (English, German, Italian, and Dutch).
no code implementations • WS 2016 • Johannes Bjerva, Carl B{\"o}rstell
Computational linguistic approaches to sign languages could benefit from investigating how complexity influences structure.
no code implementations • WS 2016 • Johannes Bjerva
The system, named ResIdent, is trained only on the data released with the task (closed training).
1 code implementation • COLING 2016 • Johannes Bjerva, Barbara Plank, Johan Bos
We propose a novel semantic tagging task, sem-tagging, tailored for the purpose of multilingual semantic parsing, and present the first tagger using deep residual networks (ResNets).