Search Results for author: Benjamin Minixhofer

Found 5 papers, 5 papers with code

CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models

1 code implementation23 May 2023 Benjamin Minixhofer, Jonas Pfeiffer, Ivan Vulić

We first address the data gap by introducing a dataset of 255k compound and non-compound words across 56 diverse languages obtained from Wiktionary.

HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crisis Response

1 code implementation10 Oct 2022 Selim Fekih, Nicolò Tamagnone, Benjamin Minixhofer, Ranjan Shrestha, Ximena Contla, Ewan Oglethorpe, Navid Rekabsaz

Timely and effective response to humanitarian crises requires quick and accurate analysis of large amounts of text data - a process that can highly benefit from expert-assisted NLP systems trained on validated and annotated data in the humanitarian response domain.

Humanitarian Multilabel Text Classification +2

Enhancing Transformers with Gradient Boosted Decision Trees for NLI Fine-Tuning

1 code implementation Findings (ACL) 2021 Benjamin Minixhofer, Milan Gritta, Ignacio Iacobacci

For small Natural Language Inference (NLI) datasets, language modelling is typically followed by pretraining on a large (labelled) NLI dataset before fine-tuning with each NLI subtask.

Language Modelling Natural Language Inference +1

Cannot find the paper you are looking for? You can Submit a new open access paper.