no code implementations • EMNLP (sdp) 2020 • Dennis Aumiller, Satya Almasian, Philip Hausner, Michael Gertz
This work presents the entry by the team from Heidelberg University in the CL-SciSumm 2020 shared task at the Scholarly Document Processing workshop at EMNLP 2020.
no code implementations • 4 Oct 2024 • Jonathan Cook, Tim Rocktäschel, Jakob Foerster, Dennis Aumiller, Alex Wang
We then show that STICK (Self-TICK) can be used to improve generation quality across multiple benchmarks via self-refinement and Best-of-N selection.
no code implementations • 10 Jul 2024 • Alexandre Matton, Tom Sherborne, Dennis Aumiller, Elena Tommasone, Milad Alizadeh, Jingyi He, Raymond Ma, Maxime Voisin, Ellen Gilsenan-McMahon, Matthias Gallé
In this paper, we consider contamination by code generation test sets, in particular in their use in modern large language models.
no code implementations • 3 Jul 2024 • Kelly Marchisio, Saurabh Dash, Hongyu Chen, Dennis Aumiller, Ahmet Üstün, Sara Hooker, Sebastian Ruder
Quantization techniques are widely used to improve inference speed and deployment of large language models.
1 code implementation • 24 Oct 2023 • Tannon Kew, Alison Chi, Laura Vásquez-Rodríguez, Sweta Agrawal, Dennis Aumiller, Fernando Alva-Manchego, Matthew Shardlow
Our performance benchmark will be available as a resource for the development of future TS methods and evaluation metrics.
1 code implementation • 22 May 2023 • Jing Fan, Dennis Aumiller, Michael Gertz
We introduce SRLScore, a reference-free evaluation metric designed with text summarization in mind.
1 code implementation • 17 Jan 2023 • Dennis Aumiller, Jing Fan, Michael Gertz
We attribute poor evaluation quality to a variety of different factors, which are investigated in more detail in this work: A lack of qualitative (and diverse) gold data considered for training, understudied (and untreated) positional biases in some of the existing datasets, and the lack of easily accessible and streamlined pre-processing strategies or analysis tools.
1 code implementation • 4 Jan 2023 • Dennis Aumiller, Michael Gertz
Previous state-of-the-art models for lexical simplification consist of complex pipelines with several components, each of which requires deep technical knowledge and fine-tuned interaction to achieve its full potential.
1 code implementation • 24 Oct 2022 • Dennis Aumiller, Ashish Chouhan, Michael Gertz
Existing summarization datasets come with two main drawbacks: (1) They tend to focus on overly exposed domains, such as news articles or wiki-like texts, and (2) are primarily monolingual, with few multilingual datasets.
2 code implementations • LREC 2022 • Dennis Aumiller, Michael Gertz
Traditionally, Text Simplification is treated as a monolingual translation task where sentences between source texts and their simplified counterparts are aligned for training.
Ranked #1 on Text Summarization on Klexikon
1 code implementation • 30 Sep 2021 • Satya Almasian, Dennis Aumiller, Michael Gertz
By supplementing training resources with weakly labeled data from rule-based systems, our model surpasses previous works in temporal tagging and type classification, especially on rare classes.
Ranked #1 on Temporal Tagging on TempEval-3
1 code implementation • 7 Dec 2020 • Dennis Aumiller, Satya Almasian, Sebastian Lackner, Michael Gertz
The growing complexity of legal cases has lead to an increasing interest in legal information retrieval systems that can effectively satisfy user-specific information needs.