2 code implementations • 23 Nov 2024 • Gollam Rabby, Farhana Keya, Parvez Zamil, Sören Auer
Mathematical reasoning has proven to be a critical yet challenging task for large language models (LLMs), as they often struggle with complex multi-step problems.
no code implementations • 29 Oct 2024 • Parvez Zamil, Gollam Rabby, Md. Sadekur Rahman, Sören Auer
The growing volume of biomedical scholarly document abstracts presents an increasing challenge in efficiently retrieving accurate and relevant information.
2 code implementations • 27 Sep 2024 • Hamed Babaei Giglou, Jennifer D'Souza, Sören Auer
In response to the growing complexity and volume of scientific literature, this paper introduces the LLMs4Synthesis framework, designed to enhance the capabilities of Large Language Models (LLMs) in generating high-quality scientific syntheses.
no code implementations • 16 Sep 2024 • Hamed Babaei Giglou, Jennifer D'Souza, Sören Auer
This paper outlines the LLMs4OL 2024, the first edition of the Large Language Models for Ontology Learning Challenge.
no code implementations • 10 Sep 2024 • Gollam Rabby, Sören Auer, Jennifer D'Souza, Allard Oelen
Our method involves fine-tuning LLMs with CKG knowledge and additionally injecting knowledge from a CKG with a novel prompting technique significantly increasing the accuracy of scholarly knowledge extraction.
no code implementations • 3 Jul 2024 • Julia Evans, Jennifer D'Souza, Sören Auer
Our study explores how well the state-of-the-art Large Language Models (LLMs), like GPT-4 and Mistral, can assess the quality of scientific summaries or, more fittingly, scientific syntheses, comparing their evaluations to those of human annotators.
1 code implementation • 11 Jun 2024 • Hamed Babaei Giglou, Tilahun Abedissa Taffa, Rana Abdullah, Aida Usmanova, Ricardo Usbeck, Jennifer D'Souza, Sören Auer
This paper introduces a scholarly Question Answering (QA) system on top of the NFDI4DataScience Gateway, employing a Retrieval Augmented Generation-based (RAG) approach.
no code implementations • 6 Jun 2024 • Salomon Kabongo, Jennifer D'Souza, Sören Auer
The rapid advancements in Large Language Models (LLMs) have opened new avenues for automating complex tasks in AI research.
1 code implementation • 16 Apr 2024 • Hamed Babaei Giglou, Jennifer D'Souza, Felix Engel, Sören Auer
Ontology Matching (OM), is a critical task in knowledge integration, where aligning heterogeneous ontologies facilitates data interoperability and knowledge sharing.
no code implementations • 12 Apr 2024 • Raia Abu Ahmad, Jennifer D'Souza, Matthäus Zloch, Wolfgang Otto, Georg Rehm, Allard Oelen, Stefan Dietze, Sören Auer
We design a specific application of the ORKG-Dataset semantic model based on 40 diverse research datasets on scientific information extraction.
no code implementations • 18 Jan 2024 • Mahsa Shamsabadi, Jennifer D'Souza, Sören Auer
In this paper, we champion the use of structured and semantic content representation of discourse-based scholarly communication, inspired by tools like Wikipedia infoboxes or structured Amazon product descriptions.
1 code implementation • 31 Jul 2023 • Hamed Babaei Giglou, Jennifer D'Souza, Sören Auer
LLMs have shown significant advancements in natural language processing, demonstrating their ability to capture complex language patterns in different knowledge domains.
1 code implementation • 22 May 2023 • Jennifer D'Souza, Moussab Hrou, Sören Auer
There have been many recent investigations into prompt-based training of transformer language models for new text genres in low-resource settings.
1 code implementation • 10 May 2023 • Salomon Kabongo, Jennifer D'Souza, Sören Auer
Furthermore, the system is integrated with the Open Research Knowledge Graph (ORKG) platform, which fosters the machine-actionable publishing of scholarly findings.
no code implementations • 3 May 2023 • Ming Jiang, Jennifer D'Souza, Sören Auer, J. Stephen Downie
To address these limitations, we started by creating OCR-noisy texts based on three clean corpora.
no code implementations • 29 Mar 2023 • Salomon Kabongo, Jennifer D'Souza, Sören Auer
We present a large-scale empirical investigation of the zero-shot learning phenomena in a specific recognizing textual entailment (RTE) task category, i. e. the automated mining of leaderboards for Empirical AI Research.
1 code implementation • 27 Mar 2023 • Fajar J. Ekaputra, Majlinda Llugiqi, Marta Sabou, Andreas Ekelhart, Heiko Paulheim, Anna Breit, Artem Revenko, Laura Waltersdorfer, Kheir Eddine Farfar, Sören Auer
In line with the general trend in artificial intelligence research to create intelligent systems that combine learning and symbolic components, a new sub-area has emerged that focuses on combining machine learning (ML) components with techniques developed by the Semantic Web (SW) community - Semantic Web Machine Learning (SWeML for short).
no code implementations • 11 Dec 2022 • Mohamad Yaser Jaradeh, Markus Stocker, Sören Auer
Information extraction from scholarly articles is a challenging task due to the sizable document length and implicit information hidden in text, figures, and citations.
no code implementations • 5 Oct 2022 • Omar Arab Oghli, Jennifer D'Souza, Sören Auer
When semantically describing knowledge graphs (KGs), users have to make a critical choice of a vocabulary (i. e. predicates and resources).
1 code implementation • 3 Jun 2022 • Mohamad Yaser Jaradeh, Kuldeep Singh, Markus Stocker, Sören Auer
Information Extraction (IE) tasks are commonly studied topics in various domains of research.
1 code implementation • 28 Mar 2022 • Jennifer D'Souza, Sören Auer
Domain-specific named entity recognition (NER) on Computer Science (CS) scholarly articles is an information extraction task that is arguably more challenging for the various annotation aims that can beset the task and has been less studied than NER in the general domain.
no code implementations • 28 Mar 2022 • Jennifer D'Souza, Anita Monteverdi, Muhammad Haris, Marco Anteghini, Kheir Eddine Farfar, Markus Stocker, Vitor A. P. Martins dos Santos, Sören Auer
For this in turn, there is a strong need for AI tools designed for scientists that permit easy and accurate semantification of their scholarly contributions.
no code implementations • 30 Nov 2021 • Marco Anteghini, Jennifer D'Souza, Vitor A. P. Martins dos Santos, Sören Auer
Biological data and knowledge bases increasingly rely on Semantic Web technologies and the use of knowledge graphs for data integration, retrieval and federated queries.
no code implementations • 23 Nov 2021 • Mohamad Yaser Jaradeh, Kuldeep Singh, Markus Stocker, Sören Auer
Scholarly Knowledge Graphs (KGs) provide a rich source of structured information representing knowledge encoded in scientific publications.
1 code implementation • 31 Aug 2021 • Salomon Kabongo, Jennifer D'Souza, Sören Auer
In this regard, the Leaderboards facet of information organization provides an overview on the state-of-the-art by aggregating empirical results from various studies addressing the same research challenge.
1 code implementation • 10 Jun 2021 • Jennifer D'Souza, Sören Auer, Ted Pedersen
Being the first-of-its-kind in the SemEval series, the task released structured data from NLP scholarly articles at three levels of information granularity, i. e. at sentence-level, phrase-level, and phrases organized as triples toward Knowledge Graph (KG) building.
no code implementations • 22 Feb 2021 • Mohamad Yaser Jaradeh, Kuldeep Singh, Markus Stocker, Andreas Both, Sören Auer
In the last decade, a large number of Knowledge Graph (KG) information extraction approaches were proposed.
no code implementations • 11 Feb 2021 • Arthur Brack, Anett Hoppe, Markus Stocker, Sören Auer, Ralph Ewerth
Current science communication has a number of drawbacks and bottlenecks which have been subject of discussion lately: Among others, the rising number of published articles makes it nearly impossible to get a full overview of the state of the art in a certain field, or reproducibility is hampered by fixed-length, document-based publications which normally cannot cover all details of a research work.
no code implementations • 19 Jan 2021 • Mohammadreza Tavakoli, Mirette Elias, Gábor Kismihók, Sören Auer
Open Educational Resources (OERs) are openly licensed educational materials that are widely used for learning.
Computers and Society
no code implementations • 9 Oct 2020 • Jennifer D'Souza, Sören Auer
To this end, specifically, care was taken in the adjudication annotation stage to reduce annotation noise while formulating the guidelines for our proposed novel NLP contributions structuring and graphing scheme.
1 code implementation • 16 Sep 2020 • Marco Anteghini, Jennifer D'Souza, Vitor A. P. Martins dos Santos, Sören Auer
As a novel contribution to the problem of semantifying biological assays, in this paper, we propose a neural-network-based approach to automatically semantify, thereby structure, unstructured bioassay text descriptions.
1 code implementation • 23 Jun 2020 • Jennifer D'Souza, Sören Auer
We describe an annotation initiative to capture the scholarly contributions in natural language processing (NLP) articles, particularly, for the articles that discuss machine learning (ML) approaches for various information extraction tasks.
no code implementations • 2 Jun 2020 • Mohamad Yaser Jaradeh, Markus Stocker, Sören Auer
Our system can retrieve direct answers to a variety of different questions asked on tabular data in articles.
no code implementations • 20 May 2020 • Arthur Brack, Anett Hoppe, Markus Stocker, Sören Auer, Ralph Ewerth
Current science communication has a number of drawbacks and bottlenecks which have been subject of discussion lately: Among others, the rising number of published articles makes it nearly impossible to get an overview of the state of the art in a certain field, or reproducibility is hampered by fixed-length, document-based publications which normally cannot cover all details of a research work.
no code implementations • 13 Apr 2020 • Ming Jiang, Jennifer D'Souza, Sören Auer, J. Stephen Downie
With the rapid growth of research publications, there is a vast amount of scholarly knowledge that needs to be organized in digital libraries.
no code implementations • LREC 2020 • Jennifer D'Souza, Anett Hoppe, Arthur Brack, Mohamad Yaser Jaradeh, Sören Auer, Ralph Ewerth
We introduce the STEM (Science, Technology, Engineering, and Medicine) Dataset for Scientific Entity Extraction, Classification, and Resolution, version 1. 0 (STEM-ECR v1. 0).
1 code implementation • Accepted for publishing in 42nd European Conference on IR Research, ECIR 2020 2020 • Arthur Brack, Jennifer D'Souza, Anett Hoppe, Sören Auer, Ralph Ewerth
We examine the novel task of domain-independent scientific concept extraction from abstracts of scholarly articles and present two contributions.
Ranked #1 on Scientific Concept Extraction on STM-corpus
no code implementations • 7 Oct 2019 • Mohamed Nadjib Mami, Damien Graux, Harsh Thakkar, Simon Scerri, Sören Auer, Jens Lehmann
In particular, we study which query language is a most suitable candidate for that 'universal' query language.
no code implementations • 30 Jan 2019 • Mohamad Yaser Jaradeh, Allard Oelen, Kheir Eddine Farfar, Manuel Prinz, Jennifer D'Souza, Gábor Kismihók, Markus Stocker, Sören Auer
In this paper, we present the first steps towards a knowledge graph based infrastructure that acquires scholarly knowledge in machine actionable form thus enabling new possibilities for scholarly knowledge curation, publication and processing.
no code implementations • 23 May 2017 • Ashwini Jaya Kumar, Sören Auer, Christoph Schmidt, Joachim köhler
Applications which use human speech as an input require a speech interface with high recognition accuracy.
no code implementations • 22 May 2017 • Ashwini Jaya Kumar, Camilo Morales, Maria-Esther Vidal, Christoph Schmidt, Sören Auer
In this paper, we have tried to see the semantic relatedness between the words in a sentence to rescore the N-best list.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5
no code implementations • 11 Jan 2016 • Lavdim Halilaj, Irlán Grangel-González, Gökhan Coskun, Sören Auer
Collaborative vocabulary development in the context of data integration is the process of finding consensus between the experts of the different systems and domains.