Search Results for author: Barbara Plank

Found 143 papers, 53 papers with code

Paper
Add Code

Learning part-of-speech taggers with inter-annotator agreement loss

no code implementations • EACL 2014 • Barbara Plank, Dirk Hovy, Anders S{\o}gaard

Paper
Add Code

SenTube: A Corpus for Sentiment Analysis on YouTube Social Media

no code implementations • LREC 2014 • Olga Uryupina, Barbara Plank, Aliaksei Severyn, Agata Rotondi, Aless Moschitti, ro

In this paper we present SenTube -- a dataset of user-generated comments on YouTube videos annotated for information content and sentiment polarity.

Document Classification Informativeness +3

Paper
Add Code

When POS data sets don't add up: Combatting sample bias

no code implementations • LREC 2014 • Dirk Hovy, Barbara Plank, Anders S{\o}gaard

We present a systematic study of several Twitter POS data sets, the problems of label and data bias, discuss their effects on model performance, and show how to overcome them to learn models that perform well on various test sets, achieving relative error reduction of up to 21{\%}.

POS TAG

Paper
Add Code

What's in a p-value in NLP?

no code implementations • WS 2014 • Anders S{\o}gaard, Anders Johannsen, Barbara Plank, Dirk Hovy, Hector Mart{\'\i}nez Alonso

Paper
Add Code

Linguistically debatable or just plain wrong?

no code implementations • ACL 2014 • Barbara Plank, Dirk Hovy, Anders S{\o}gaard

Part-Of-Speech Tagging

Paper
Add Code

Robust Cross-Domain Sentiment Analysis for Low-Resource Languages

no code implementations • WS 2014 • Jakob Elming, Barbara Plank, Dirk Hovy

Domain Adaptation Sentiment Analysis

Paper
Add Code

Experiments with crowdsourced re-annotation of a POS tagging data set

no code implementations • ACL 2014 • Dirk Hovy, Barbara Plank, Anders S{\o}gaard

Document Classification Named Entity Recognition (NER) +4

Paper
Add Code

Opinion Mining on YouTube

no code implementations • ACL 2014 • Aliaksei Severyn, Aless Moschitti, ro, Olga Uryupina, Barbara Plank, Katja Filippova

Opinion Mining

Paper
Add Code

Adapting taggers to Twitter with not-so-distant supervision

1 code implementation • COLING 2014 • Barbara Plank, Dirk Hovy, Ryan Mcdonald, Anders S{\o}gaard

Paper
Code

Selection Bias, Label Bias, and Bias in Ground Truth

no code implementations • COLING 2014 • Anders S{\o}gaard, Barbara Plank, Dirk Hovy

Dependency Parsing Domain Adaptation +1

Paper
Add Code

Copenhagen-Malm\"o: Tree Approximations of Semantic Parsing Problems

no code implementations • SEMEVAL 2014 • Natalie Schluter, Anders S{\o}gaard, Jakob Elming, Dirk Hovy, Barbara Plank, H{\'e}ctor Mart{\'\i}nez Alonso, Anders Johanssen, Sigrid Klerke

Semantic Parsing Semantic Role Labeling

Paper
Add Code

More or less supervised supersense tagging of Twitter

no code implementations • SEMEVAL 2014 • Anders Johannsen, Dirk Hovy, H{\'e}ctor Mart{\'\i}nez Alonso, Barbara Plank, Anders S{\o}gaard

Domain Adaptation Named Entity Recognition (NER) +2

Paper
Add Code

Importance weighting and unsupervised domain adaptation of POS taggers: a negative result

no code implementations • EMNLP 2014 • Barbara Plank, Anders Johannsen, Anders S{\o}gaard

POS Relation Extraction +1

Paper
Add Code

Active learning for sense annotation

no code implementations • WS 2015 • H{\'e}ctor Mart{\'\i}nez Alonso, Barbara Plank, Anders Johannsen, Anders S{\o}gaard

Active Learning

Paper
Add Code

Mining for unambiguous instances to adapt part-of-speech taggers to new domains

no code implementations • HLT 2015 • Anders Søgaard, Dirk Hovy, Barbara Plank, Héctor Martínez Alonso

POS TAG

Paper
Add Code

Learning to parse with IAA-weighted loss

no code implementations • HLT 2015 • Anders Søgaard, Arne Skjærholt, Barbara Plank, Héctor Martínez Alonso

POS POS Tagging

Paper
Add Code

CPH: Sentiment analysis of Figurative Language on Twitter \#easypeasy \#not

no code implementations • SEMEVAL 2015 • Sarah McGillion, H{\'e}ctor Mart{\'\i}nez Alonso, Barbara Plank

Sentiment Analysis

Paper
Add Code

Non-canonical language is not harder to annotate than canonical language

no code implementations • WS 2015 • Barbara Plank, H{\'e}ctor Mart{\'\i}nez Alonso, Anders S{\o}gaard

Paper
Add Code

Do dependency parsing metrics correlate with human judgments?

no code implementations • CONLL 2015 • Barbara Plank, H{\'e}ctor Mart{\'\i}nez Alonso, {\v{Z}}eljko Agi{\'c}, Danijela Merkler, Anders S{\o}gaard

Dependency Parsing Machine Translation +1

Paper
Add Code

Inverted indexing for cross-lingual NLP

no code implementations • IJCNLP 2015 • Anders S{\o}gaard, {\v{Z}}eljko Agi{\'c}, H{\'e}ctor Mart{\'\i}nez Alonso, Barbara Plank, Bernd Bohnet, Anders Johannsen

Cross-Lingual Transfer Dependency Parsing +2

Paper
Add Code

Semantic Representations for Domain Adaptation: A Case Study on the Tree Kernel-based Method for Relation Extraction

no code implementations • IJCNLP 2015 • Thien Huu Nguyen, Barbara Plank, Ralph Grishman

Relation Extraction Unsupervised Domain Adaptation +1

Paper
Add Code

Personality Traits on Twitter---or---How to Get 1,500 Personality Tests in a Week

no code implementations • WS 2015 • Barbara Plank, Dirk Hovy

Paper
Add Code

Multilingual Projection for Parsing Truly Low-Resource Languages

no code implementations • TACL 2016 • {\v{Z}}eljko Agi{\'c}, Anders Johannsen, Barbara Plank, H{\'e}ctor Mart{\'\i}nez Alonso, Natalie Schluter, Anders S{\o}gaard

We propose a novel approach to cross-lingual part-of-speech tagging and dependency parsing for truly low-resource languages.

Cross-Lingual Transfer Dependency Parsing +2

Paper
Add Code

Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures

no code implementations • 15 Jan 2016 • Raffaella Bernardi, Ruket Cakici, Desmond Elliott, Aykut Erdem, Erkut Erdem, Nazli Ikizler-Cinbis, Frank Keller, Adrian Muscat, Barbara Plank

Automatic description generation from natural images is a challenging problem that has recently received a large amount of interest from the computer vision and natural language processing communities.

Retrieval

Paper
Add Code

Multilingual Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Models and Auxiliary Loss

3 code implementations • ACL 2016 • Barbara Plank, Anders Søgaard, Yoav Goldberg

Bidirectional long short-term memory (bi-LSTM) networks have recently proven successful for various NLP sequence modeling tasks, but little is known about their reliance to input representations, target languages, data set size, and label noise.

Ranked #4 on Part-Of-Speech Tagging on UD

Part-Of-Speech Tagging POS +1

148

Paper
Code

TwiSty: A Multilingual Twitter Stylometry Corpus for Gender and Personality Profiling

no code implementations • LREC 2016 • Ben Verhoeven, Walter Daelemans, Barbara Plank

Personality profiling is the task of detecting personality traits of authors based on writing style.

Gender Prediction

Paper
Add Code

LiMoSINe Pipeline: Multilingual UIMA-based NLP Platform

no code implementations • ACL 2016 • Olga Uryupina, Barbara Plank, Gianni Barlacchi, Francisco J. Valverde Albacete, Manos Tsagkias, Antonio Uva, Aless Moschitti, ro

Paper
Add Code

Supersense tagging with inter-annotator disagreement

no code implementations • WS 2016 • H{\'e}ctor Mart{\'\i}nez Alonso, Anders Johannsen, Barbara Plank

Paper
Add Code

What to do about non-standard (or non-canonical) language in NLP

no code implementations • 28 Aug 2016 • Barbara Plank

The solution is not obvious: we cannot control for all factors, and it is not clear how to best go beyond the current practice of training on homogeneous data from a single domain and language.

Sentence

Paper
Add Code

Semantic Tagging with Deep Residual Networks

1 code implementation • COLING 2016 • Johannes Bjerva, Barbara Plank, Johan Bos

We propose a novel semantic tagging task, sem-tagging, tailored for the purpose of multilingual semantic parsing, and present the first tagger using deep residual networks (ResNets).

Part-Of-Speech Tagging POS +2

Paper
Code

Keystroke dynamics as signal for shallow syntactic parsing

1 code implementation • COLING 2016 • Barbara Plank

Keystroke dynamics have been extensively used in psycholinguistic and writing research to gain insights into cognitive processing.

CCG Supertagging Chunking

Paper
Code

When silver glitters more than gold: Bootstrapping an Italian part-of-speech tagger for Twitter

no code implementations • 9 Nov 2016 • Barbara Plank, Malvina Nissim

We bootstrap a state-of-the-art part-of-speech tagger to tag Italian Twitter data, in the context of the Evalita 2016 PoSTWITA shared task.

TAG

Paper
Add Code

Processing non-canonical or noisy text: fortuitous data to the rescue

no code implementations • WS 2016 • Barbara Plank

on which texts can differ from the standard.

Sentence

Paper
Add Code

Multi-view and multi-task training of RST discourse parsers

1 code implementation • COLING 2016 • Chlo{\'e} Braud, Barbara Plank, Anders S{\o}gaard

We experiment with different ways of training LSTM networks to predict RST discourse trees.

Ranked #10 on Discourse Parsing on RST-DT

Discourse Parsing

Paper
Code

When is multitask learning effective? Semantic sequence prediction under varying data conditions

no code implementations • EACL 2017 • Héctor Martínez Alonso, Barbara Plank

Multitask learning has been applied successfully to a range of tasks, mostly morphosyntactic.

Paper
Add Code

Parsing Universal Dependencies without training

1 code implementation • EACL 2017 • Héctor Martínez Alonso, Željko Agić, Barbara Plank, Anders Søgaard

We propose UDP, the first training-free parser for Universal Dependencies (UD).

Paper
Code

When Sparse Traditional Models Outperform Dense Neural Networks: the Curious Case of Discriminating between Similar Languages

no code implementations • WS 2017 • Maria Medvedeva, Martin Kroon, Barbara Plank

We present the results of our participation in the VarDial 4 shared task on discriminating closely related languages.

Language Identification

Paper
Add Code

Cross-lingual tagger evaluation without test data

no code implementations • EACL 2017 • {\v{Z}}eljko Agi{\'c}, Barbara Plank, Anders S{\o}gaard

We address the challenge of cross-lingual POS tagger evaluation in absence of manually annotated test data.

POS

Paper
Add Code

Learning to select data for transfer learning with Bayesian Optimization

1 code implementation • EMNLP 2017 • Sebastian Ruder, Barbara Plank

Domain similarity measures can be used to gauge adaptability and select suitable data for transfer learning, but existing approaches define ad hoc measures that are deemed suitable for respective tasks.

Bayesian Optimization Part-Of-Speech Tagging +2

169

Paper
Code

To Normalize, or Not to Normalize: The Impact of Normalization on Part-of-Speech Tagging

1 code implementation • WS 2017 • Rob van der Goot, Barbara Plank, Malvina Nissim

Does normalization help Part-of-Speech (POS) tagging accuracy on noisy, non-canonical data?

Part-Of-Speech Tagging POS +1

Paper
Code

Neural Networks and Spelling Features for Native Language Identification

no code implementations • WS 2017 • Johannes Bjerva, Gintar{\.e} Grigonyt{\.e}, Robert {\"O}stling, Barbara Plank

We present the RUG-SU team{'}s submission at the Native Language Identification Shared Task 2017.

Native Language Identification Word Embeddings

Paper
Add Code

The Power of Character N-grams in Native Language Identification

no code implementations • WS 2017 • Artur Kulmizev, Bo Blankers, Johannes Bjerva, Malvina Nissim, Gertjan van Noord, Barbara Plank, Martijn Wieling

In this paper, we explore the performance of a linear SVM trained on language independent character features for the NLI Shared Task 2017.

Native Language Identification Text Classification

Paper
Add Code

ALL-IN-1: Short Text Classification with One Model for All Languages

1 code implementation • 26 Oct 2017 • Barbara Plank

We present ALL-IN-1, a simple model for multilingual text classification that does not require any parallel data.

General Classification Multilingual text classification +3

Paper
Code

Last Words: Sharing Is Caring: The Future of Shared Tasks

no code implementations • CL 2017 • Malvina Nissim, Lasha Abzianidze, Kilian Evang, Rob van der Goot, Hessel Haagsma, Barbara Plank, Martijn Wieling

Paper
Add Code

All-In-1 at IJCNLP-2017 Task 4: Short Text Classification with One Model for All Languages

no code implementations • IJCNLP 2017 • Barbara Plank

We present All-In-1, a simple model for multilingual text classification that does not require any parallel data.

General Classification Multilingual text classification +3

Paper
Add Code

Strong Baselines for Neural Semi-supervised Learning under Domain Shift

2 code implementations • ACL 2018 • Sebastian Ruder, Barbara Plank

In this paper, we re-evaluate classic general-purpose bootstrapping approaches in the context of neural networks under domain shifts vs. recent neural approaches and propose a novel multi-task tri-training method that reduces the time and space complexity of classic tri-training.

Ranked #3 on Sentiment Analysis on Multi-Domain Sentiment Dataset

Domain Adaptation Multi-Task Learning +2

Paper
Code

Bleaching Text: Abstract Features for Cross-lingual Gender Prediction

1 code implementation • ACL 2018 • Rob van der Goot, Nikola Ljubešić, Ian Matroos, Malvina Nissim, Barbara Plank

Gender prediction has typically focused on lexical and social network features, yielding good performance, but making systems highly language-, topic-, and platform-dependent.

Gender Prediction

Paper
Code

Grotoco@SLAM: Second Language Acquisition Modeling with Simple Features, Learners and Task-wise Models

no code implementations • WS 2018 • Sigrid Klerke, H{\'e}ctor Mart{\'\i}nez Alonso, Barbara Plank

We present our submission to the 2018 Duolingo Shared Task on Second Language Acquisition Modeling (SLAM).

Language Acquisition

Paper
Add Code

Predicting Authorship and Author Traits from Keystroke Dynamics

no code implementations • WS 2018 • Barbara Plank

Written text transmits a good deal of nonverbal information related to the author{'}s identity and social factors, such as age, gender and personality.

Attribute Machine Translation

Paper
Add Code

Character-level Supervision for Low-resource POS Tagging

no code implementations • WS 2018 • Katharina Kann, Johannes Bjerva, Isabelle Augenstein, Barbara Plank, Anders S{\o}gaard

Neural part-of-speech (POS) taggers are known to not perform well with little training data.

Feature Engineering LEMMA +4

Paper
Add Code

When Simple n-gram Models Outperform Syntactic Approaches: Discriminating between Dutch and Flemish

no code implementations • COLING 2018 • Martin Kroon, Masha Medvedeva, Barbara Plank

In this paper we present the results of our participation in the Discriminating between Dutch and Flemish in Subtitles VarDial 2018 shared task.

Paper
Add Code

Distant Supervision from Disparate Sources for Low-Resource Part-of-Speech Tagging

1 code implementation • EMNLP 2018 • Barbara Plank, Željko Agić

We introduce DsDs: a cross-lingual neural part-of-speech tagger that learns from disparate sources of distant supervision, and realistically scales to hundreds of low-resource languages.

Part-Of-Speech Tagging TAG

123

Paper
Code

Beyond task success: A closer look at jointly learning to see, ask, and GuessWhat

3 code implementations • NAACL 2019 • Ravi Shekhar, Aashish Venkatesh, Tim Baumgärtner, Elia Bruni, Barbara Plank, Raffaella Bernardi, Raquel Fernández

We compare our approach to an alternative system which extends the baseline with reinforcement learning.

Multi-Task Learning Visual Grounding

Paper
Code

The Best of Both Worlds: Lexical Resources To Improve Low-Resource Part-of-Speech Tagging

no code implementations • 21 Nov 2018 • Barbara Plank, Sigrid Klerke, Zeljko Agic

In natural language processing, the deep learning revolution has shifted the focus from conventional hand-crafted symbolic representations to dense inputs, which are adequate representations learned automatically from corpora.

Cross-Lingual POS Tagging Part-Of-Speech Tagging +2

Paper
Add Code

Psycholinguistics meets Continual Learning: Measuring Catastrophic Forgetting in Visual Question Answering

no code implementations • ACL 2019 • Claudio Greco, Barbara Plank, Raquel Fernández, Raffaella Bernardi

We study the issue of catastrophic forgetting in the context of neural multimodal approaches to Visual Question Answering (VQA).

Continual Learning Question Answering +1

Paper
Add Code

MoRTy: Unsupervised Learning of Task-specialized Word Embeddings by Autoencoding

1 code implementation • WS 2019 • Nils Rethmeier, Barbara Plank

Word embeddings have undoubtedly revolutionized NLP.

Word Embeddings

Paper
Code

SyntaxFest 2019 Invited talk - Transferring NLP models across languages and domains

no code implementations • WS 2019 • Barbara Plank

Paper
Add Code

At a Glance: The Impact of Gaze Aggregation Views on Syntactic Tagging

no code implementations • WS 2019 • Sigrid Klerke, Barbara Plank

Hence, caution is warranted when using gaze data as signal for NLP, as no single view is robust over tasks, modeling choice and gaze corpus.

Chunking Part-Of-Speech Tagging +2

Paper
Add Code

Neural Cross-Lingual Transfer and Limited Annotated Data for Named Entity Recognition in Danish

1 code implementation • WS (NoDaLiDa) 2019 • Barbara Plank

Named Entity Recognition (NER) has greatly advanced by the introduction of deep neural architectures.

Cross-Lingual Transfer named-entity-recognition +2

Paper
Code

Cross-Domain Evaluation of Edge Detection for Biomedical Event Extraction

no code implementations • LREC 2020 • Alan Ramponi, Barbara Plank, Rosario Lombardo

Biomedical event extraction is a crucial task in order to automatically extract information from the increasingly growing body of biomedical literature.

Domain Adaptation Edge Detection +1

Paper
Add Code

FT Speech: Danish Parliament Speech Corpus

no code implementations • 25 May 2020 • Andreas Kirkedal, Marija Stepanović, Barbara Plank

A combination of FT Speech with in-domain language data provides comparable results to models trained specifically on Spr\r{a}kbanken, showing that FT Speech transfers well to this data set.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP

2 code implementations • EACL 2021 • Rob van der Goot, Ahmet Üstün, Alan Ramponi, Ibrahim Sharaf, Barbara Plank

In this paper we present MaChAmp, a toolkit for easy fine-tuning of contextualized embeddings in multi-task settings.

Dependency Parsing Language Modelling +5

Paper
Code

Neural Unsupervised Domain Adaptation in NLP---A Survey

1 code implementation • COLING 2020 • Alan Ramponi, Barbara Plank

We also revisit the notion of domain, and we uncover a bias in the type of Natural Language Processing tasks which received most attention.

Out-of-Distribution Generalization Unsupervised Domain Adaptation

262

Paper
Code

Team DiSaster at SemEval-2020 Task 11: Combining BERT and Hand-crafted Features for Identifying Propaganda Techniques in News

no code implementations • SEMEVAL 2020 • Anders Kaas, Viktor Torp Thomsen, Barbara Plank

We present an ablation study which shows that even though BERT representations are very powerful also for this task, BERT still benefits from being combined with carefully designed task-specific features.

Paper
Add Code

Buhscitu at SemEval-2020 Task 7: Assessing Humour in Edited News Headlines Using Hand-Crafted Features and Online Knowledge Bases

no code implementations • SEMEVAL 2020 • Kristian N{\o}rgaard Jensen, Nicolaj Filrup Rasmussen, Thai Wang, Marco Placenti, Barbara Plank

This paper describes a system that aims at assessing humour intensity in edited news headlines as part of the 7th task of SemEval-2020 on {``}Humor, Emphasis and Sentiment{''}.

Language Modelling regression

Paper
Add Code

Longitudinal Citation Prediction using Temporal Graph Neural Networks

no code implementations • 10 Dec 2020 • Andreas Nugaard Holm, Barbara Plank, Dustin Wright, Isabelle Augenstein

Citation count prediction is the task of predicting the number of citations a paper has gained after a period of time.

Citation Prediction

Paper
Add Code

On the Effectiveness of Dataset Embeddings in Mono-lingual,Multi-lingual and Zero-shot Conditions

no code implementations • EACL (AdaptNLP) 2021 • Rob van der Goot, Ahmet Üstün, Barbara Plank

However, it remains unclear in which situations these dataset embeddings are most effective, because they are used in a large variety of settings, languages and tasks.

Dependency Parsing Lemmatization +1

Paper
Add Code

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding

2 code implementations • NAACL 2021 • Rob van der Goot, Ibrahim Sharaf, Aizhan Imankulova, Ahmet Üstün, Marija Stepanović, Alan Ramponi, Siti Oryza Khairunnisa, Mamoru Komachi, Barbara Plank

To tackle the challenge, we propose a joint learning approach, with English SLU training data and non-English auxiliary tasks from raw text, syntax and translation for transfer.

intent-classification Intent Classification +7

315

Paper
Code

DaN+: Danish Nested Named Entities and Lexical Normalization

1 code implementation • COLING 2020 • Barbara Plank, Kristian Nørgaard Jensen, Rob van der Goot

We examine language-specific versus multilingual BERT, and study the effect of lexical normalization on NER.

Cross-Lingual Transfer Lexical Normalization +4

Paper
Code

De-identification of Privacy-related Entities in Job Postings

1 code implementation • NoDaLiDa 2021 • Kristian Nørgaard Jensen, Mike Zhang, Barbara Plank

We present JobStack, a new corpus for de-identification of personal data in job vacancies on Stackoverflow.

De-identification Multi-Task Learning +1

Paper
Code

Beyond Black \& White: Leveraging Annotator Disagreement via Soft-Label Multi-Task Learning

no code implementations • NAACL 2021 • Tommaso Fornaciari, Alexandra Uma, Silviu Paun, Barbara Plank, Dirk Hovy, Massimo Poesio

Supervised learning assumes that a ground truth label exists.

Multi-Task Learning

Paper
Add Code

Proceedings of the First Workshop on Weakly Supervised Learning (WeaSuL)

no code implementations • 8 Jul 2021 • Michael A. Hedderich, Benjamin Roth, Katharina Kann, Barbara Plank, Alex Ratner, Dietrich Klakow

Welcome to WeaSuL 2021, the First Workshop on Weakly Supervised Learning, co-located with ICLR 2021.

Weakly-supervised Learning

Paper
Add Code

SemEval-2021 Task 12: Learning with Disagreements

no code implementations • SEMEVAL 2021 • Alexandra Uma, Tommaso Fornaciari, Anca Dumitrache, Tristan Miller, Jon Chamberlain, Barbara Plank, Edwin Simpson, Massimo Poesio

Disagreement between coders is ubiquitous in virtually all datasets annotated with human judgements in both natural language processing and computer vision.

Paper
Add Code

Cartography Active Learning

2 code implementations • Findings (EMNLP) 2021 • Mike Zhang, Barbara Plank

We propose Cartography Active Learning (CAL), a novel Active Learning (AL) algorithm that exploits the behavior of the model on individual instances during training as a proxy to find the most informative instances for labeling.

Active Learning text-classification +1

315

Paper
Code

Genre as Weak Supervision for Cross-lingual Dependency Parsing

1 code implementation • EMNLP 2021 • Max Müller-Eberstein, Rob van der Goot, Barbara Plank

Recent work has shown that monolingual masked language models learn to represent data-driven notions of language variation which can be used for domain-targeted training data selection.

Dependency Parsing Sentence

Paper
Code

How Universal is Genre in Universal Dependencies?

1 code implementation • ACL (TLT, SyntaxFest) 2021 • Max Müller-Eberstein, Rob van der Goot, Barbara Plank

This work provides the first in-depth analysis of genre in Universal Dependencies (UD).

Specificity

Paper
Code

Probing for Labeled Dependency Trees

1 code implementation • ACL 2022 • Max Müller-Eberstein, Rob van der Goot, Barbara Plank

Probing has become an important tool for analyzing representations in Natural Language Processing (NLP).

Dependency Parsing Informativeness

Paper
Code

Experimental Standards for Deep Learning in Natural Language Processing Research

1 code implementation • 13 Apr 2022 • Dennis Ulmer, Elisa Bassignana, Max Müller-Eberstein, Daniel Varab, Mike Zhang, Rob van der Goot, Christian Hardmeier, Barbara Plank

The field of Deep Learning (DL) has undergone explosive growth during the last decade, with a substantial impact on Natural Language Processing (NLP) as well.

Paper
Code

SkillSpan: Hard and Soft Skill Extraction from English Job Postings

2 code implementations • NAACL 2022 • Mike Zhang, Kristian Nørgaard Jensen, Sif Dam Sonniks, Barbara Plank

We introduce a BERT baseline (Devlin et al., 2019).

Multi-Task Learning

315

Paper
Code

What do You Mean by Relation Extraction? A Survey on Datasets and Study on Scientific Relation Classification

1 code implementation • ACL 2022 • Elisa Bassignana, Barbara Plank

Over the last five years, research on Relation Extraction (RE) witnessed extensive progress with many new dataset releases.

Classification Relation +1

315

Paper
Code

Kompetencer: Fine-grained Skill Classification in Danish Job Postings via Distant Supervision and Transfer Learning

2 code implementations • LREC 2022 • Mike Zhang, Kristian Nørgaard Jensen, Barbara Plank

Skill Classification (SC) is the task of classifying job competences from job postings.

Transfer Learning

315

Paper
Code

Sort by Structure: Language Model Ranking as Dependency Probing

no code implementations • NAACL 2022 • Max Müller-Eberstein, Rob van der Goot, Barbara Plank

Making an informed choice of pre-trained language model (LM) is critical for performance, yet environmentally costly, and as such widely underexplored.

Language Modelling Structured Prediction

Paper
Add Code

Skill Extraction from Job Postings using Weak Supervision

1 code implementation • 16 Sep 2022 • Mike Zhang, Kristian Nørgaard Jensen, Rob van der Goot, Barbara Plank

Aggregated data obtained from job postings provide powerful insights into labor market demands, and emerging skills, and aid job matching.

Paper
Code

An Interdisciplinary Perspective on Evaluation and Experimental Design for Visual Text Analytics: Position Paper

no code implementations • 23 Sep 2022 • Kostiantyn Kucher, Nicole Sultanum, Angel Daza, Vasiliki Simaki, Maria Skeppstedt, Barbara Plank, Jean-Daniel Fekete, Narges Mahyar

We identify four key groups of challenges for evaluating visual text analytics approaches (data ambiguity, experimental design, user trust, and "big picture" concerns) and provide suggestions for research opportunities from an interdisciplinary perspective.

Experimental Design Position

Paper
Add Code

CrossRE: A Cross-Domain Dataset for Relation Extraction

1 code implementation • 17 Oct 2022 • Elisa Bassignana, Barbara Plank

Relation Extraction (RE) has attracted increasing attention, but current RE evaluation is limited to in-domain evaluation setups.

Relation Relation Classification

Paper
Code

Evidence > Intuition: Transferability Estimation for Encoder Selection

1 code implementation • 20 Oct 2022 • Elisa Bassignana, Max Müller-Eberstein, Mike Zhang, Barbara Plank

With the increase in availability of large pre-trained language models (LMs) in Natural Language Processing (NLP), it becomes critical to assess their fit for a specific target task a priori - as fine-tuning the entire space of available LMs is computationally prohibitive and unsustainable.

Structured Prediction

Paper
Code

Spectral Probing

1 code implementation • 21 Oct 2022 • Max Müller-Eberstein, Rob van der Goot, Barbara Plank

Linguistic information is encoded at varying timescales (subwords, phrases, etc.)

Informativeness

Paper
Code

Stop Measuring Calibration When Humans Disagree

1 code implementation • 28 Oct 2022 • Joris Baan, Wilker Aziz, Barbara Plank, Raquel Fernández

Calibration is a popular framework to evaluate whether a classifier knows when it does not know - i. e., its predictive probabilities are a good indication of how likely a prediction is to be correct.

Paper
Code

The 'Problem' of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation

1 code implementation • 4 Nov 2022 • Barbara Plank

Human variation in labeling is often considered noise.

Paper
Code

A Survey of Corpora for Germanic Low-Resource Languages and Dialects

2 code implementations • 19 Apr 2023 • Verena Blaschke, Hinrich Schütze, Barbara Plank

In this work, we instead focus on low-resource languages and in particular non-standardized low-resource languages.

Paper
Code

Low-resource Bilingual Dialect Lexicon Induction with Large Language Models

1 code implementation • 19 Apr 2023 • Ekaterina Artemova, Barbara Plank

Bilingual word lexicons are crucial tools for multilingual natural language understanding and machine translation tasks, as they facilitate the mapping of words in one language to their synonyms in another language.

Bilingual Lexicon Induction Natural Language Understanding +4

Paper
Code

Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages

5 code implementations • 20 Apr 2023 • Verena Blaschke, Hinrich Schütze, Barbara Plank

This can for instance be observed when finetuning PLMs on one language and evaluating them on data in a closely related language variety with no standardized orthography.

Cross-Lingual Transfer Part-Of-Speech Tagging +2

Paper
Code

SemEval-2023 Task 11: Learning With Disagreements (LeWiDi)

no code implementations • 28 Apr 2023 • Elisa Leonardelli, Alexandra Uma, Gavin Abercrombie, Dina Almanea, Valerio Basile, Tommaso Fornaciari, Barbara Plank, Verena Rieser, Massimo Poesio

We report on the second LeWiDi shared task, which differs from the first edition in three crucial respects: (i) it focuses entirely on NLP, instead of both NLP and computer vision tasks in its first edition; (ii) it focuses on subjective tasks, instead of covering different types of disagreements-as training with aggregated labels for subjective NLP tasks is a particularly obvious misrepresentation of the data; and (iii) for the evaluation, we concentrate on soft approaches to evaluation.

Sentiment Analysis

Paper
Add Code

Boosting Zero-shot Cross-lingual Retrieval by Training on Artificially Code-Switched Data

1 code implementation • 9 May 2023 • Robert Litschko, Ekaterina Artemova, Barbara Plank

Transferring information retrieval (IR) models from a high-resource language (typically English) to other languages in a zero-shot fashion has become a widely adopted approach.

Cross-Lingual Word Embeddings Information Retrieval +2

Paper
Code

Silver Syntax Pre-training for Cross-Domain Relation Extraction

1 code implementation • 18 May 2023 • Elisa Bassignana, Filip Ginter, Sampo Pyysalo, Rob van der Goot, Barbara Plank

One of the main reasons for this is the limited training size of current RE datasets: obtaining high-quality (manually annotated) data is extremely expensive and cannot realistically be repeated for each new domain.

Relation Relation Extraction

Paper
Code

Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction

1 code implementation • 18 May 2023 • Elisa Bassignana, Filip Ginter, Sampo Pyysalo, Rob van der Goot, Barbara Plank

Most research in Relation Extraction (RE) involves the English language, mainly due to the lack of multi-lingual resources.

Relation Relation Extraction +1

Paper
Code

What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability

1 code implementation • 19 May 2023 • Mario Giulianelli, Joris Baan, Wilker Aziz, Raquel Fernández, Barbara Plank

In Natural Language Generation (NLG) tasks, for any input, multiple communicative goals are plausible, and any goal can be put into words, or produced, in multiple ways.

Text Generation

Paper
Code

ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain

1 code implementation • 20 May 2023 • Mike Zhang, Rob van der Goot, Barbara Plank

The increasing number of benchmarks for Natural Language Processing (NLP) tasks in the computational job market domain highlights the demand for methods that can handle job-related tasks such as skill extraction, skill classification, job title classification, and de-identification.

De-identification Masked Language Modeling +1

Paper
Code

How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives

1 code implementation • 24 May 2023 • Xinpeng Wang, Leonie Weissweiler, Hinrich Schütze, Barbara Plank

To the best of our knowledge, this is the first work comprehensively evaluating distillation objectives in both settings.

Knowledge Distillation QNLI

Paper
Code

Findings of the VarDial Evaluation Campaign 2023

no code implementations • 31 May 2023 • Noëmi Aepli, Çağrı Çöltekin, Rob van der Goot, Tommi Jauhiainen, Mourhaf Kazzaz, Nikola Ljubešić, Kai North, Barbara Plank, Yves Scherrer, Marcos Zampieri

This report presents the results of the shared tasks organized as part of the VarDial Evaluation Campaign 2023.

Intent Detection

Paper
Add Code

ActiveAED: A Human in the Loop Improves Annotation Error Detection

1 code implementation • 31 May 2023 • Leon Weber, Barbara Plank

This problem has been addressed with Annotation Error Detection (AED) models, which can flag such errors for human re-annotation.

Paper
Code

Uncertainty in Natural Language Generation: From Theory to Applications

no code implementations • 28 Jul 2023 • Joris Baan, Nico Daheim, Evgenia Ilia, Dennis Ulmer, Haau-Sing Li, Raquel Fernández, Barbara Plank, Rico Sennrich, Chrysoula Zerva, Wilker Aziz

Recent advances of powerful Language Models have allowed Natural Language Generation (NLG) to emerge as an important technology that can not only perform traditional tasks like summarisation or translation, but also serve as a natural language interface to a variety of applications.

Active Learning Text Generation

Paper
Add Code

Donkii: Can Annotation Error Detection Methods Find Errors in Instruction-Tuning Datasets?

1 code implementation • 4 Sep 2023 • Leon Weber-Genzel, Robert Litschko, Ekaterina Artemova, Barbara Plank

Our results show that the choice of the right AED method and model size is indeed crucial and derive practical recommendations for how to use AED methods to clean instruction-tuning data.

Text Generation

Paper
Code

Establishing Trustworthiness: Rethinking Tasks and Model Evaluation

no code implementations • 9 Oct 2023 • Robert Litschko, Max Müller-Eberstein, Rob van der Goot, Leon Weber, Barbara Plank

Language understanding is a multi-faceted cognitive capability, which the Natural Language Processing (NLP) community has striven to model computationally for decades.

Paper
Add Code

From Dissonance to Insights: Dissecting Disagreements in Rationale Construction for Case Outcome Classification

no code implementations • 18 Oct 2023 • Shanshan Xu, T. Y. S. S Santosh, Oana Ichim, Isabella Risini, Barbara Plank, Matthias Grabmair

Overall, our case study reveals hitherto underappreciated complexities in creating benchmark datasets in legal NLP that revolve around identifying aspects of a case's facts supposedly relevant to its outcome.

Paper
Add Code

LoHoRavens: A Long-Horizon Language-Conditioned Benchmark for Robotic Tabletop Manipulation

no code implementations • 18 Oct 2023 • Shengqiang Zhang, Philipp Wicke, Lütfi Kerem Şenel, Luis Figueredo, Abdeldjallil Naceri, Sami Haddadin, Barbara Plank, Hinrich Schütze

The convergence of embodied agents and large language models (LLMs) has brought significant advancements to embodied instruction following.

Caption Generation Instruction Following

Paper
Add Code

ACTOR: Active Learning with Annotator-specific Classification Heads to Embrace Human Label Variation

no code implementations • 23 Oct 2023 • Xinpeng Wang, Barbara Plank

We show that in the active learning setting, a multi-head model performs significantly better than a single-head model in terms of uncertainty estimation.

Active Learning

Paper
Add Code

Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training

no code implementations • 25 Oct 2023 • Max Müller-Eberstein, Rob van der Goot, Barbara Plank, Ivan Titov

We identify critical learning phases across tasks and time, during which subspaces emerge, share information, and later disentangle to specialize.

Language Modelling Multi-Task Learning

Paper
Add Code

Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark

1 code implementation • arXiv 2023 • Stephen Mayhew, Terra Blevins, Shuheng Liu, Marek Šuppa, Hila Gonen, Joseph Marvin Imperial, Börje F. Karlsson, Peiqin Lin, Nikola Ljubešić, LJ Miranda, Barbara Plank, Arij Riabi, Yuval Pinter

We introduce Universal NER (UNER), an open, community-driven project to develop gold-standard NER benchmarks in many languages.

Ranked #1 on Named Entity Recognition (NER) on UNER v1 (Danish)

Cross-Lingual NER Multilingual Named Entity Recognition +3

Paper
Code

NNOSE: Nearest Neighbor Occupational Skill Extraction

1 code implementation • 30 Jan 2024 • Mike Zhang, Rob van der Goot, Min-Yen Kan, Barbara Plank

The labor market is changing rapidly, prompting increased interest in the automatic extraction of occupational skills from text.

Retrieval

Paper
Code

Entity Linking in the Job Market Domain

1 code implementation • 31 Jan 2024 • Mike Zhang, Rob van der Goot, Barbara Plank

In this work, we are the first to explore EL in this domain, specifically targeting the linkage of occupational skills to the ESCO taxonomy (le Vrang et al., 2014).

Entity Linking

Paper
Code

Different Tastes of Entities: Investigating Human Label Variation in Named Entity Annotations

1 code implementation • 2 Feb 2024 • Siyao Peng, Zihang Sun, Sebastian Loftus, Barbara Plank

Named Entity Recognition (NER) is a key information extraction task with a long-standing tradition.

Key Information Extraction named-entity-recognition +2

Paper
Code

Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties

1 code implementation • 3 Feb 2024 • Ekaterina Artemova, Verena Blaschke, Barbara Plank

Inspired by prior work on English varieties, we craft and manually evaluate perturbation rules that transform German sentences into colloquial forms and use them to synthesize test sets in four ToD datasets.

Intent Recognition slot-filling +3

Paper
Code

EEVEE: An Easy Annotation Tool for Natural Language Processing

no code implementations • 5 Feb 2024 • Axel Sorensen, Siyao Peng, Barbara Plank, Rob van der Goot

Annotation tools are the starting point for creating Natural Language Processing (NLP) datasets.

text-classification Text Classification

Paper
Add Code

Deep Learning-based Computational Job Market Analysis: A Survey on Skill Extraction and Classification from Job Postings

no code implementations • 8 Feb 2024 • Elena Senger, Mike Zhang, Rob van der Goot, Barbara Plank

Recent years have brought significant advances to Natural Language Processing (NLP), which enabled fast progress in the field of computational job market analysis.

Classification

Paper
Add Code

Through the Lens of Split Vote: Exploring Disagreement, Difficulty and Calibration in Legal Case Outcome Classification

no code implementations • 11 Feb 2024 • Shanshan Xu, T. Y. S. S Santosh, Oana Ichim, Barbara Plank, Matthias Grabmair

We observe limited alignment with the judge vote distribution.

Navigate

Paper
Add Code

What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects

no code implementations • 19 Feb 2024 • Verena Blaschke, Christoph Purschke, Hinrich Schütze, Barbara Plank

Natural language processing (NLP) has largely focused on modelling standardized languages.

Machine Translation

Paper
Add Code

Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning

no code implementations • 20 Feb 2024 • Philipp Mondorf, Barbara Plank

Deductive reasoning plays a pivotal role in the formulation of sound and cohesive arguments.

Paper
Add Code

"My Answer is C": First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models

1 code implementation • 22 Feb 2024 • Xinpeng Wang, Bolei Ma, Chengzhi Hu, Leon Weber-Genzel, Paul Röttger, Frauke Kreuter, Dirk Hovy, Barbara Plank

The open-ended nature of language generation makes the evaluation of autoregressive large language models (LLMs) challenging.

Multiple-choice Text Generation

Paper
Code

Interpreting Predictive Probabilities: Model Confidence or Human Label Variation?

no code implementations • 25 Feb 2024 • Joris Baan, Raquel Fernández, Barbara Plank, Wilker Aziz

With the rise of increasingly powerful and user-facing NLP systems, there is growing interest in assessing whether they have a good representation of uncertainty by evaluating the quality of their predictive distribution over outcomes.

Position

Paper
Add Code

VariErr NLI: Separating Annotation Error from Human Label Variation

no code implementations • 4 Mar 2024 • Leon Weber-Genzel, Siyao Peng, Marie-Catherine de Marneffe, Barbara Plank

To fill this gap, we introduce a systematic methodology and a new dataset, VariErr (variation versus error), focusing on the NLI task in English.

valid

Paper
Add Code

MaiBaam Annotation Guidelines

no code implementations • 9 Mar 2024 • Verena Blaschke, Barbara Kovačić, Siyao Peng, Barbara Plank

This document provides the annotation guidelines for MaiBaam, a Bavarian corpus annotated with part-of-speech (POS) tags and syntactic dependencies.

POS

Paper
Add Code

MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank

no code implementations • 15 Mar 2024 • Verena Blaschke, Barbara Kovačić, Siyao Peng, Hinrich Schütze, Barbara Plank

Despite the success of the Universal Dependencies (UD) project exemplified by its impressive language breadth, there is still a lack in `within-language breadth': most treebanks focus on standard languages.

POS POS Tagging

Paper
Add Code

Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data

1 code implementation • 19 Mar 2024 • Siyao Peng, Zihang Sun, Huangyan Shan, Marie Kolm, Verena Blaschke, Ekaterina Artemova, Barbara Plank

Named Entity Recognition (NER) is a fundamental task to extract key information from texts, but annotated resources are scarce for dialects.

Dialect Identification Multi-Task Learning +3

Paper
Code

Beyond Accuracy: Evaluating the Reasoning Behavior of Large Language Models -- A Survey

no code implementations • 2 Apr 2024 • Philipp Mondorf, Barbara Plank

Large language models (LLMs) have recently shown impressive performance on tasks involving reasoning, leading to a lively debate on whether these models possess reasoning capabilities similar to humans.

Paper
Add Code

MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual Relatedness

no code implementations • 3 Apr 2024 • Shijia Zhou, Huangyan Shan, Barbara Plank, Robert Litschko

This paper presents our system developed for the SemEval-2024 Task 1: Semantic Textual Relatedness (STR), on Track C: Cross-lingual.

Data Augmentation Machine Translation +2

Paper
Add Code

Look at the Text: Instruction-Tuned Language Models are More Robust Multiple Choice Selectors than You Think

no code implementations • 12 Apr 2024 • Xinpeng Wang, Chengzhi Hu, Bolei Ma, Paul Röttger, Barbara Plank

We show that the text answers are more robust to question perturbations than the first token probabilities, when the first token answers mismatch the text answers.

Multiple-choice

Paper
Add Code

How to Encode Domain Information in Relation Classification

no code implementations • 21 Apr 2024 • Elisa Bassignana, Viggo Unmack Gascou, Frida Nøhr Laustsen, Gustav Kristensen, Marie Haahr Petersen, Rob van der Goot, Barbara Plank

Current language models require a lot of training data to obtain high performance.

Classification Relation +1

Paper
Add Code

“I’ll be there for you”: The One with Understanding Indirect Answers

no code implementations • CODI 2021 • Cathrine Damgaard, Paulina Toborek, Trine Eriksen, Barbara Plank

In this paper, we introduce a new English corpus to study the problem of understanding indirect answers.

Paper
Add Code

Sliced at SemEval-2022 Task 11: Bigger, Better? Massively Multilingual LMs for Multilingual Complex NER on an Academic GPU Budget

no code implementations • SemEval (NAACL) 2022 • Barbara Plank

Our submission of a single model for 11 languages on the SemEval Task 11 MultiCoNER shows that a vanilla transformer-CRF with XLM-R_{large} outperforms the more recent RemBERT, ranking 9th from 26 submissions in the multilingual track.

NER XLM-R

Paper
Add Code

Cross-Lingual Cross-Domain Nested Named Entity Evaluation on English Web Texts

no code implementations • Findings (ACL) 2021 • Barbara Plank

Paper
Add Code

MultiLexNorm: A Shared Task on Multilingual Lexical Normalization

1 code implementation • EMNLP (WNUT) 2021 • Rob van der Goot, Alan Ramponi, Arkaitz Zubiaga, Barbara Plank, Benjamin Muller, Iñaki San Vicente Roncal, Nikola Ljubešić, Özlem Çetinoğlu, Rahmad Mahendra, Talha Çolakoğlu, Timothy Baldwin, Tommaso Caselli, Wladimir Sidorenko

This task is beneficial for downstream analysis, as it provides a way to harmonize (often spontaneous) linguistic variation.

Dependency Parsing Lexical Normalization +2

Paper
Code

Biomedical Event Extraction as Sequence Labeling

no code implementations • EMNLP 2020 • Alan Ramponi, Rob van der Goot, Rosario Lombardo, Barbara Plank

We introduce Biomedical Event Extraction as Sequence Labeling (BeeSL), a joint end-to-end neural information extraction model.

Event Extraction Multi-Task Learning

Paper
Add Code

From back to the roots into the gated woods: Deep learning for NLP

no code implementations • NAACL (TeachingNLP) 2021 • Barbara Plank

Deep neural networks have revolutionized many fields, including Natural Language Processing.

Paper
Add Code

Lexical Resources for Low-Resource PoS Tagging in Neural Times

no code implementations • WS (NoDaLiDa) 2019 • Barbara Plank, Sigrid Klerke

More and more evidence is appearing that integrating symbolic lexical knowledge into neural models aids learning.

Cross-Lingual POS Tagging POS +1

Paper
Add Code

The Lacunae of Danish Natural Language Processing

no code implementations • WS (NoDaLiDa) 2019 • Andreas Kirkedal, Barbara Plank, Leon Derczynski, Natalie Schluter

Danish is a North Germanic language spoken principally in Denmark, a country with a long tradition of technological and scientific innovation.

Paper
Add Code

Fine-tuning vs From Scratch: Do Vision & Language Models Have Similar Capabilities on Out-of-Distribution Visual Question Answering?

no code implementations • LREC 2022 • Kristian Nørgaard Jensen, Barbara Plank

Fine-tuning general-purpose pre-trained models has become a de-facto standard, also for Vision and Language tasks such as Visual Question Answering (VQA).

Question Answering Visual Question Answering

Paper
Add Code

Frustratingly Easy Performance Improvements for Low-resource Setups: A Tale on BERT and Segment Embeddings

no code implementations • LREC 2022 • Rob van der Goot, Max Müller-Eberstein, Barbara Plank

For low-resource syntactic tasks, we observe impacts of segment embedding and multilingual BERT choice.

Dependency Parsing Position +1

Paper
Add Code

NLP North at WNUT-2020 Task 2: Pre-training versus Ensembling for Detection of Informative COVID-19 English Tweets

no code implementations • EMNLP (WNUT) 2020 • Anders Giovanni Møller, Rob van der Goot, Barbara Plank

With the COVID-19 pandemic raging world-wide since the beginning of the 2020 decade, the need for monitoring systems to track relevant information on social media is vitally important.

Task 2

Paper
Add Code

We Need to Consider Disagreement in Evaluation

no code implementations • ACL (BPPF) 2021 • Valerio Basile, Michael Fell, Tommaso Fornaciari, Dirk Hovy, Silviu Paun, Barbara Plank, Massimo Poesio, Alexandra Uma

Instead, we suggest that we need to better capture the sources of disagreement to improve today’s evaluation practice.

Paper
Add Code

Resources and Evaluations for Danish Entity Resolution

no code implementations • CRAC (ACL) 2021 • Maria Barrett, Hieu Lam, Martin Wu, Ophélie Lacroix, Barbara Plank, Anders Søgaard

Automatic coreference resolution is understudied in Danish even though most of the Danish Dependency Treebank (Buch-Kromann, 2003) is annotated with coreference relations.

coreference-resolution Entity Disambiguation +2

Paper
Add Code

Finding the needle in a haystack: Extraction of Informative COVID-19 Danish Tweets

no code implementations • WNUT (ACL) 2021 • Benjamin Olsen, Barbara Plank

In this work, we introduce a new dataset of 5, 000 tweets for finding informative COVID-19 tweets for Danish.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.