Search Results for author: Lukas Lange

Found 25 papers, 14 papers with code

AnnoCTR: A Dataset for Detecting and Linking Entities, Tactics, and Techniques in Cyber Threat Reports

2 code implementations • 11 Apr 2024 • Lukas Lange, Marc Müller, Ghazaleh Haratinezhad Torbati, Dragan Milchevski, Patrick Grau, Subhash Pujari, Annemarie Friedrich

In our few-shot scenario, we find that for identifying the MITRE ATT&CK concepts that are mentioned explicitly or implicitly in a text, concept descriptions from MITRE ATT&CK are an effective source for training data augmentation.

Data Augmentation

152

Paper
Code

Discourse-Aware In-Context Learning for Temporal Expression Normalization

no code implementations • 11 Apr 2024 • Akash Kumar Gautam, Lukas Lange, Jannik Strötgen

In this work, we explore the feasibility of proprietary and open-source large language models (LLMs) for TE normalization using in-context learning to inject task, document, and example information into the model.

In-Context Learning

Paper
Add Code

Rehearsal-Free Modular and Compositional Continual Learning for Language Models

no code implementations • 31 Mar 2024 • Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze

Continual learning aims at incrementally acquiring new knowledge while not forgetting existing knowledge.

Continual Learning Transfer Learning

Paper
Add Code

BoschAI @ Causal News Corpus 2023: Robust Cause-Effect Span Extraction using Multi-Layer Sequence Tagging and Data Augmentation

1 code implementation • 11 Dec 2023 • Timo Pierre Schrader, Simon Razniewski, Lukas Lange, Annemarie Friedrich

Understanding causality is a core aspect of intelligence.

Data Augmentation Event Causality Identification

Paper
Code

DelucionQA: Detecting Hallucinations in Domain-specific Question Answering

no code implementations • 8 Dec 2023 • Mobashir Sadat, Zhengyu Zhou, Lukas Lange, Jun Araki, Arsalan Gundroo, Bingqing Wang, Rakesh R Menon, Md Rizwan Parvez, Zhe Feng

Hallucination is a well-known phenomenon in text generated by large language models (LLMs).

Hallucination Information Retrieval +2

Paper
Add Code

GradSim: Gradient-Based Language Grouping for Effective Multilingual Training

no code implementations • 23 Oct 2023 • Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze

However, not all languages positively influence each other and it is an open research question how to select the most suitable set of languages for multilingual training and avoid negative interference among languages whose characteristics or data distributions are not compatible.

Sentiment Analysis

Paper
Add Code

TADA: Efficient Task-Agnostic Domain Adaptation for Transformers

1 code implementation • 22 May 2023 • Chia-Chien Hung, Lukas Lange, Jannik Strötgen

Our broad evaluation in 4 downstream tasks for 14 domains across single- and multi-domain setups and high- and low-resource scenarios reveals that TADA is an effective and efficient alternative to full domain-adaptive pre-training and adapters for domain adaptation, while not introducing additional parameters or complex training steps.

Domain Adaptation

Paper
Code

NLNDE at SemEval-2023 Task 12: Adaptive Pretraining and Source Language Selection for Low-Resource Multilingual Sentiment Analysis

no code implementations • 28 Apr 2023 • Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schütze

In this work, we propose to leverage language-adaptive and task-adaptive pretraining on African texts and study transfer learning with source language selection on top of an African language-centric pretrained language model.

Language Modelling Sentiment Analysis +1

Paper
Add Code

SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domains

1 code implementation • 14 Feb 2023 • Koustava Goswami, Lukas Lange, Jun Araki, Heike Adel

Prompting pre-trained language models leads to promising results across natural language processing tasks but is less effective when applied in low-resource domains, due to the domain gap between the pre-training data and the downstream task.

Language Modelling text-classification +1

Paper
Code

Multilingual Normalization of Temporal Expressions with Masked Language Models

1 code implementation • 20 May 2022 • Lukas Lange, Jannik Strötgen, Heike Adel, Dietrich Klakow

The detection and normalization of temporal expressions is an important task and preprocessing step for many applications.

Language Modelling Masked Language Modeling

Paper
Code

CLIN-X: pre-trained language models and a study on cross-task transfer for concept extraction in the clinical domain

1 code implementation • 16 Dec 2021 • Lukas Lange, Heike Adel, Jannik Strötgen, Dietrich Klakow

The field of natural language processing (NLP) has recently seen a large change towards using pre-trained language models for solving almost any task.

Clinical Concept Extraction Sentence +1

Paper
Code

Boosting Transformers for Job Expression Extraction and Classification in a Low-Resource Setting

no code implementations • 17 Sep 2021 • Lukas Lange, Heike Adel, Jannik Strötgen

In this paper, we explore possible improvements of transformer models in a low-resource setting.

Transfer Learning XLM-R

Paper
Add Code

To Share or not to Share: Predicting Sets of Sources for Model Transfer Learning

1 code implementation • EMNLP 2021 • Lukas Lange, Jannik Strötgen, Heike Adel, Dietrich Klakow

For this, we study the effects of model transfer on sequence labeling across various domains and tasks and show that our methods based on model similarity and support vector machines are able to predict promising sources, resulting in performance increases of up to 24 F1 points.

text similarity Transfer Learning

Paper
Code

ANEA: Distant Supervision for Low-Resource Named Entity Recognition

1 code implementation • 25 Feb 2021 • Michael A. Hedderich, Lukas Lange, Dietrich Klakow

Distant supervision allows obtaining labeled training corpora for low-resource settings where only limited hand-annotated data exists.

Low Resource Named Entity Recognition named-entity-recognition +2

Paper
Code

FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations

1 code implementation • EMNLP 2021 • Lukas Lange, Heike Adel, Jannik Strötgen, Dietrich Klakow

Combining several embeddings typically improves performance in downstream tasks as different embeddings encode different information.

NER POS +4

Paper
Code

NLNDE at CANTEMIST: Neural Sequence Labeling and Parsing Approaches for Clinical Concept Extraction

no code implementations • 23 Oct 2020 • Lukas Lange, Xiang Dai, Heike Adel, Jannik Strötgen

The recognition and normalization of clinical information, such as tumor morphology mentions, is an important, but complex process consisting of multiple subtasks.

Clinical Concept Extraction

Paper
Add Code

A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios

1 code implementation • NAACL 2021 • Michael A. Hedderich, Lukas Lange, Heike Adel, Jannik Strötgen, Dietrich Klakow

Deep neural networks and huge language models are becoming omnipresent in natural language applications.

Data Augmentation Transfer Learning

Paper
Code

NLNDE: The Neither-Language-Nor-Domain-Experts' Way of Spanish Medical Document De-Identification

no code implementations • 2 Jul 2020 • Lukas Lange, Heike Adel, Jannik Strötgen

Natural language processing has huge potential in the medical domain which recently led to a lot of research in this field.

De-identification

Paper
Add Code

NLNDE: Enhancing Neural Sequence Taggers with Attention and Noisy Channel for Robust Pharmacological Entity Detection

no code implementations • WS 2019 • Lukas Lange, Heike Adel, Jannik Strötgen

Named entity recognition has been extensively studied on English news texts.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

The SOFC-Exp Corpus and Neural Approaches to Information Extraction in the Materials Science Domain

1 code implementation • ACL 2020 • Annemarie Friedrich, Heike Adel, Federico Tomazic, Johannes Hingerl, Renou Benteau, Anika Maruscyk, Lukas Lange

With this paper, we publish our annotation guidelines, as well as our SOFC-Exp corpus consisting of 45 open-access scholarly articles annotated by domain experts.

named-entity-recognition Named Entity Recognition +3

Paper
Code

Adversarial Alignment of Multilingual Models for Extracting Temporal Expressions from Text

no code implementations • WS 2020 • Lukas Lange, Anastasiia Iurshina, Heike Adel, Jannik Strötgen

Although temporal tagging is still dominated by rule-based systems, there have been recent attempts at neural temporal taggers.

Ranked #1 on Temporal Tagging on Catalan TimeBank 1.0

Cross-Lingual Transfer Temporal Tagging

Paper
Add Code

Closing the Gap: Joint De-Identification and Concept Extraction in the Clinical Domain

1 code implementation • ACL 2020 • Lukas Lange, Heike Adel, Jannik Strötgen

Exploiting natural language processing in the clinical domain requires de-identification, i. e., anonymization of personal information in texts.

De-identification

Paper
Code

On the Choice of Auxiliary Languages for Improved Sequence Tagging

no code implementations • WS 2020 • Lukas Lange, Heike Adel, Jannik Strötgen

Recent work showed that embeddings from related languages can improve the performance of sequence tagging, even for monolingual models.

Part-Of-Speech Tagging

Paper
Add Code

Feature-Dependent Confusion Matrices for Low-Resource NER Labeling with Noisy Labels

1 code implementation • IJCNLP 2019 • Lukas Lange, Michael A. Hedderich, Dietrich Klakow

In low-resource settings, the performance of supervised labeling models can be improved with automatically annotated or distantly supervised data, which is cheap to create but often noisy.

Low Resource Named Entity Recognition named-entity-recognition +4

Paper
Code

KRAUTS: A German Temporally Annotated News Corpus

1 code implementation • LREC 2018 • Jannik Str{\"o}tgen, Anne-Lyse Minard, Lukas Lange, Manuela Speranza, Bernardo Magnini

Information Retrieval Question Answering

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.