no code implementations • NIDCP (LREC) 2022 • Barbara Heinisch
In the field of citizen linguistics, various initiatives are aimed at the creation of language resources by members of the public.
1 code implementation • NeurIPS Data-Centric AI Workshop 2021 • Christian Lang, Lennart Wachowiak, Barbara Heinisch, Dagmar Gromann
Predicting lexical-semantic relations between word pairs has successfully been accomplished by pre-trained neural language models.
1 code implementation • 3rd Conference on Language, Data and Knowledge 2021 • Lennart Wachowiak, Christian Lang, Barbara Heinisch, Dagmar Gromann
Terminological Concept Systems (TCS) provide a means of organizing, structuring and representing domain-specific multilingual information and are important to ensure terminological consistency in many tasks, such as translation and cross-border communication.
1 code implementation • 12 Dec 2020 • Lennart Wachowiak, Christian Lang, Barbara Heinisch, Dagmar Gromann
We describe our submission to the CogALex-VI shared task on the identification of multilingual paradigmatic relations building on XLM-RoBERTa (XLM-R), a robustly optimized and multilingual BERT model.
no code implementations • LREC 2020 • Barbara Heinisch
Citizen linguistics can help to create language resources and annotate language resources, not only for the improvement of language technologies, such as machine translation but also for the advancement of linguistic research.
no code implementations • LREC 2020 • Barbara Heinisch, Vesna Lu{\v{s}}icky
Therefore, the Austrian Language Resource Portal stresses the importance of language resources specific to a language variety, thus paving the way for the re-use of variety-specific language data for human language technology, such as machine translation training, for the Austrian standard variety.