no code implementations • Joint Conference on Lexical and Computational Semantics 2021 • Anna H{\"a}tty, Julia Bettinger, Michael Dorna, Jonas Kuhn, Sabine Schulte im Walde
Predicting the difficulty of domain-specific vocabulary is an important task towards a better understanding of a domain, and to enhance the communication between lay people and experts.
no code implementations • ACL 2020 • Anna H{\"a}tty, Dominik Schlechtweg, Michael Dorna, Sabine Schulte im Walde
While automatic term extraction is a well-researched area, computational approaches to distinguish between degrees of technicality are still understudied.
no code implementations • LREC 2020 • Julia Bettinger, Anna H{\"a}tty, Michael Dorna, Sabine Schulte im Walde
We present a dataset with difficulty ratings for 1, 030 German closed noun compounds extracted from domain-specific texts for do-it-ourself (DIY), cooking and automotive.
no code implementations • LREC 2020 • Anurag Nigam, Anna H{\"a}tty, Sabine Schulte im Walde
We perform a comparative study for automatic term extraction from domain-specific language using a PageRank model with different edge-weighting methods.
no code implementations • SEMEVAL 2019 • Anna H{\"a}tty, Dominik Schlechtweg, Sabine Schulte im Walde
We introduce SURel, a novel dataset with human-annotated meaning shifts between general-language and domain-specific contexts.
no code implementations • COLING 2018 • Anna H{\"a}tty, Sabine Schulte im Walde
Automatic term identification and investigating the understandability of terms in a specialized domain are often treated as two separate lines of research.
no code implementations • NAACL 2018 • Anna H{\"a}tty, Sabine Schulte im Walde
This paper introduces a new dataset of term annotation.
no code implementations • EACL 2017 • Anna H{\"a}tty, Michael Dorna, Sabine Schulte im Walde
Feature design and selection is a crucial aspect when treating terminology extraction as a machine learning classification problem.
no code implementations • LREC 2016 • Sabine Schulte im Walde, Anna H{\"a}tty, Stefan Bott, Nana Khvtisavrishvili
This paper presents a novel gold standard of German noun-noun compounds (Ghost-NN) including 868 compounds annotated with corpus frequencies of the compounds and their constituents, productivity and ambiguity of the constituents, semantic relations between the constituents, and compositionality ratings of compound-constituent pairs.