1 code implementation • 20 Oct 2022 • Elisa Bassignana, Max Müller-Eberstein, Mike Zhang, Barbara Plank
With the increase in availability of large pre-trained language models (LMs) in Natural Language Processing (NLP), it becomes critical to assess their fit for a specific target task a priori - as fine-tuning the entire space of available LMs is computationally prohibitive and unsustainable.
1 code implementation • 16 Sep 2022 • Mike Zhang, Kristian Nørgaard Jensen, Rob van der Goot, Barbara Plank
Aggregated data obtained from job postings provide powerful insights into labor market demands, and emerging skills, and aid job matching.
2 code implementations • LREC 2022 • Mike Zhang, Kristian Nørgaard Jensen, Barbara Plank
Skill Classification (SC) is the task of classifying job competences from job postings.
2 code implementations • NAACL 2022 • Mike Zhang, Kristian Nørgaard Jensen, Sif Dam Sonniks, Barbara Plank
We introduce a BERT baseline (Devlin et al., 2019).
1 code implementation • 13 Apr 2022 • Dennis Ulmer, Elisa Bassignana, Max Müller-Eberstein, Daniel Varab, Mike Zhang, Rob van der Goot, Christian Hardmeier, Barbara Plank
The field of Deep Learning (DL) has undergone explosive growth during the last decade, with a substantial impact on Natural Language Processing (NLP) as well.
2 code implementations • Findings (EMNLP) 2021 • Mike Zhang, Barbara Plank
We propose Cartography Active Learning (CAL), a novel Active Learning (AL) algorithm that exploits the behavior of the model on individual instances during training as a proxy to find the most informative instances for labeling.
1 code implementation • NoDaLiDa 2021 • Kristian Nørgaard Jensen, Mike Zhang, Barbara Plank
We present JobStack, a new corpus for de-identification of personal data in job vacancies on Stackoverflow.
1 code implementation • WS 2019 • Mike Zhang, Antonio Toral
The effect of translationese has been studied in the field of machine translation (MT), mostly with respect to training data.
no code implementations • SEMEVAL 2019 • Mike Zhang, Roy David, Leon Graumans, Gerben Timmerman
The first task (A) is to decide whether a given tweet contains hate against immigrants or women, in a multilingual perspective, for English and Spanish.