Search Results for author: Rafet Sifa

Found 26 papers, 7 papers with code

Generating Prototypes for Contradiction Detection Using Large Language Models and Linguistic Rules

1 code implementation23 Oct 2023 Maren Pielka, Svetlana Schmidt, Rafet Sifa

We introduce a novel data generation method for contradiction detection, which leverages the generative power of large language models as well as linguistic rules.

Language Modelling Negation

Controlled Randomness Improves the Performance of Transformer Models

no code implementations20 Oct 2023 Tobias Deußer, Cong Zhao, Wolfgang Krämer, David Leonhard, Christian Bauckhage, Rafet Sifa

During the pre-training step of natural language models, the main objective is to learn a general representation of the pre-training dataset, usually requiring large amounts of textual data to capture the complexity and diversity of natural language.

named-entity-recognition Named Entity Recognition +2

Tokenizer Choice For LLM Training: Negligible or Crucial?

no code implementations12 Oct 2023 Mehdi Ali, Michael Fromm, Klaudia Thellmann, Richard Rutmann, Max Lübbering, Johannes Leveling, Katrin Klug, Jan Ebert, Niclas Doll, Jasper Schulze Buschhoff, Charvi Jain, Alexander Arno Weber, Lena Jurkschat, Hammam Abdelwahab, Chelsea John, Pedro Ortiz Suarez, Malte Ostendorff, Samuel Weinbach, Rafet Sifa, Stefan Kesselheim, Nicolas Flores-Herr

The recent success of Large Language Models (LLMs) has been predominantly driven by curating the training dataset composition, scaling of model architectures and dataset sizes and advancements in pretraining objectives, leaving tokenizer influence as a blind spot.

Improving Natural Language Inference in Arabic using Transformer Models and Linguistically Informed Pre-Training

1 code implementation27 Jul 2023 Mohammad Majd Saad Al Deen, Maren Pielka, Jörn Hees, Bouthaina Soulef Abdou, Rafet Sifa

This paper addresses the classification of Arabic text data in the field of Natural Language Processing (NLP), with a particular focus on Natural Language Inference (NLI) and Contradiction Detection (CD).

named-entity-recognition Named Entity Recognition +2

Word Sense Disambiguation as a Game of Neurosymbolic Darts

no code implementations25 Jul 2023 Tiansi Dong, Rafet Sifa

The core of our methodology is a neurosymbolic sense embedding, in terms of a configuration of nested balls in n-dimensional space.

Knowledge Graphs Natural Language Understanding +1

sustain.AI: a Recommender System to analyze Sustainability Reports

1 code implementation15 May 2023 Lars Hillebrand, Maren Pielka, David Leonhard, Tobias Deußer, Tim Dilmaghani, Bernd Kliem, Rüdiger Loitz, Milad Morad, Christian Temath, Thiago Bell, Robin Stenzel, Rafet Sifa

We present sustainAI, an intelligent, context-aware recommender system that assists auditors and financial investors as well as the general public to efficiently analyze companies' sustainability reports.

Multi-Label Classification Recommendation Systems

Towards automating Numerical Consistency Checks in Financial Reports

no code implementations11 Nov 2022 Lars Hillebrand, Tobias Deußer, Tim Dilmaghani, Bernd Kliem, Rüdiger Loitz, Christian Bauckhage, Rafet Sifa

It combines a financial named entity and relation extraction module with a BERT-based filtering and text pair classification component to extract KPIs from unstructured sentences before linking them to synonymous occurrences in the balance sheet and profit & loss statement.

Relation Extraction Text Pair Classification

Improving Chest X-Ray Classification by RNN-based Patient Monitoring

no code implementations28 Oct 2022 David Biesner, Helen Schneider, Benjamin Wulff, Ulrike Attenberger, Rafet Sifa

Chest X-Ray imaging is one of the most common radiological tools for detection of various pathologies related to the chest area and lung function.

Decision Making Image Classification

Zero-Shot Text Matching for Automated Auditing using Sentence Transformers

no code implementations28 Oct 2022 David Biesner, Maren Pielka, Rajkumar Ramamurthy, Tim Dilmaghani, Bernd Kliem, Rüdiger Loitz, Rafet Sifa

Natural language processing methods have several applications in automated auditing, including document or passage classification, information retrieval, and question answering.

Information Retrieval Question Answering +5

KPI-EDGAR: A Novel Dataset and Accompanying Metric for Relation Extraction from Financial Documents

1 code implementation17 Oct 2022 Tobias Deußer, Syed Musharraf Ali, Lars Hillebrand, Desiana Nurchalifah, Basil Jacob, Christian Bauckhage, Rafet Sifa

We introduce KPI-EDGAR, a novel dataset for Joint Named Entity Recognition and Relation Extraction building on financial reports uploaded to the Electronic Data Gathering, Analysis, and Retrieval (EDGAR) system, where the main objective is to extract Key Performance Indicators (KPIs) from financial documents and link them to their numerical values and other attributes.

Benchmarking Joint Entity and Relation Extraction +5

Gradient Flows for L2 Support Vector Machine Training

no code implementations8 Aug 2022 Christian Bauckhage, Helen Schneider, Benjamin Wulff, Rafet Sifa

We explore the merits of training of support vector machines for binary classification by means of solving systems of ordinary differential equations.

Binary Classification

KPI-BERT: A Joint Named Entity Recognition and Relation Extraction Model for Financial Reports

no code implementations3 Aug 2022 Lars Hillebrand, Tobias Deußer, Tim Dilmaghani, Bernd Kliem, Rüdiger Loitz, Christian Bauckhage, Rafet Sifa

We present KPI-BERT, a system which employs novel methods of named entity recognition (NER) and relation extraction (RE) to extract and link key performance indicators (KPIs), e. g. "revenue" or "interest expenses", of companies from real-world German financial documents.

named-entity-recognition Named Entity Recognition +4

DIBERT: Dependency Injected Bidirectional Encoder Representations from Transformers

1 code implementation IEEE SSCI 2021 Abdul Wahab, Rafet Sifa

Prior research in the area of Natural Language Processing (NLP) has shown that including the syntactic structure of a sentence using a dependency parse tree while training a representation learning model improves the performance on downstream tasks.

Language Modelling Masked Language Modeling +6

Generative Deep Learning Techniques for Password Generation

no code implementations10 Dec 2020 David Biesner, Kostadin Cvejoski, Bogdan Georgiev, Rafet Sifa, Erik Krupicka

Password guessing approaches via deep learning have recently been investigated with significant breakthroughs in their ability to generate novel, realistic password candidates.

Adiabatic Quantum Computing for Binary Clustering

no code implementations17 Jun 2017 Christian Bauckhage, Eduardo Brito, Kostadin Cvejoski, Cesar Ojeda, Rafet Sifa, Stefan Wrobel

Quantum computing for machine learning attracts increasing attention and recent technological developments suggest that especially adiabatic quantum computing may soon be of practical interest.

BIG-bench Machine Learning Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.