no code implementations • 10 Sep 2024 • Gollam Rabby, Sören Auer, Jennifer D'Souza, Allard Oelen
Our method involves fine-tuning LLMs with CKG knowledge and additionally injecting knowledge from a CKG with a novel prompting technique significantly increasing the accuracy of scholarly knowledge extraction.
no code implementations • 19 Aug 2024 • Salomon Kabongo, Jennifer D'Souza
This study demonstrates the application of instruction finetuning of pretrained Large Language Models (LLMs) to automate the generation of AI research leaderboards, extracting (Task, Dataset, Metric, Score) quadruples from articles.
no code implementations • 3 Jul 2024 • Julia Evans, Jennifer D'Souza, Sören Auer
Our study explores how well the state-of-the-art Large Language Models (LLMs), like GPT-4 and Mistral, can assess the quality of scientific summaries or, more fittingly, scientific syntheses, comparing their evaluations to those of human annotators.
1 code implementation • 11 Jun 2024 • Hamed Babaei Giglou, Tilahun Abedissa Taffa, Rana Abdullah, Aida Usmanova, Ricardo Usbeck, Jennifer D'Souza, Sören Auer
This paper introduces a scholarly Question Answering (QA) system on top of the NFDI4DataScience Gateway, employing a Retrieval Augmented Generation-based (RAG) approach.
no code implementations • 6 Jun 2024 • Salomon Kabongo, Jennifer D'Souza, Sören Auer
The rapid advancements in Large Language Models (LLMs) have opened new avenues for automating complex tasks in AI research.
no code implementations • 23 May 2024 • Mahsa Shamsabadi, Jennifer D'Souza
This demo will present the Research Assistant (RA) tool developed to assist with six main types of research tasks defined as standardized instruction templates, instantiated with user input, applied finally as prompts to well-known--for their sophisticated natural language processing abilities--AI tools, such as ChatGPT (https://chat. openai. com/) and Gemini (https://gemini. google. com/app).
no code implementations • 4 May 2024 • Julia Evans, Sameer Sadruddin, Jennifer D'Souza
In this study, we address one of the challenges of developing NER models for scholarly domains, namely the scarcity of suitable labeled data.
no code implementations • 3 May 2024 • Vladyslav Nechakhin, Jennifer D'Souza, Steffen Eger
Current methods, such as those used by the Open Research Knowledge Graph (ORKG), involve manually curating properties to describe research papers' contributions in a structured manner, but this is labor-intensive and inconsistent between the domain expert human curators.
1 code implementation • 16 Apr 2024 • Hamed Babaei Giglou, Jennifer D'Souza, Felix Engel, Sören Auer
Ontology Matching (OM), is a critical task in knowledge integration, where aligning heterogeneous ontologies facilitates data interoperability and knowledge sharing.
no code implementations • 12 Apr 2024 • Raia Abu Ahmad, Jennifer D'Souza, Matthäus Zloch, Wolfgang Otto, Georg Rehm, Allard Oelen, Stefan Dietze, Sören Auer
We design a specific application of the ORKG-Dataset semantic model based on 40 diverse research datasets on scientific information extraction.
no code implementations • 22 Feb 2024 • Mahsa Shamsabadi, Jennifer D'Souza
This short paper highlights the growing importance of information retrieval (IR) engines in the scientific community, addressing the inefficiency of traditional keyword-based search engines due to the rising volume of publications.
no code implementations • 18 Jan 2024 • Mahsa Shamsabadi, Jennifer D'Souza, Sören Auer
In this paper, we champion the use of structured and semantic content representation of discourse-based scholarly communication, inspired by tools like Wikipedia infoboxes or structured Amazon product descriptions.
no code implementations • 10 Oct 2023 • Swathi Anil, Jennifer D'Souza
Noninvasive brain stimulation (NIBS) encompasses transcranial stimulation techniques that can influence brain excitability.
1 code implementation • 5 Oct 2023 • Anisa Rula, Jennifer D'Souza
Recent advancements in the field of Natural Language Processing, particularly the development of large-scale language models that are pretrained on vast amounts of knowledge, are creating novel opportunities within the realm of Knowledge Engineering.
1 code implementation • 31 Jul 2023 • Hamed Babaei Giglou, Jennifer D'Souza, Sören Auer
LLMs have shown significant advancements in natural language processing, demonstrating their ability to capture complex language patterns in different knowledge domains.
1 code implementation • 22 May 2023 • Jennifer D'Souza, Moussab Hrou, Sören Auer
There have been many recent investigations into prompt-based training of transformer language models for new text genres in low-resource settings.
1 code implementation • 10 May 2023 • Salomon Kabongo, Jennifer D'Souza, Sören Auer
Furthermore, the system is integrated with the Open Research Knowledge Graph (ORKG) platform, which fosters the machine-actionable publishing of scholarly findings.
no code implementations • 3 May 2023 • Ming Jiang, Jennifer D'Souza, Sören Auer, J. Stephen Downie
To address these limitations, we started by creating OCR-noisy texts based on three clean corpora.
no code implementations • 29 Mar 2023 • Salomon Kabongo, Jennifer D'Souza, Sören Auer
We present a large-scale empirical investigation of the zero-shot learning phenomena in a specific recognizing textual entailment (RTE) task category, i. e. the automated mining of leaderboards for Empirical AI Research.
no code implementations • 5 Oct 2022 • Omar Arab Oghli, Jennifer D'Souza, Sören Auer
When semantically describing knowledge graphs (KGs), users have to make a critical choice of a vocabulary (i. e. predicates and resources).
no code implementations • 24 May 2022 • Jennifer D'Souza
The STEM-NER-60k corpus, created in this work, comprises over 1M extracted entities from 60k STEM articles obtained from a major publishing platform and is publicly released https://github. com/jd-coderepos/stem-ner-60k.
1 code implementation • 28 Mar 2022 • Jennifer D'Souza, Sören Auer
Domain-specific named entity recognition (NER) on Computer Science (CS) scholarly articles is an information extraction task that is arguably more challenging for the various annotation aims that can beset the task and has been less studied than NER in the general domain.
no code implementations • 28 Mar 2022 • Jennifer D'Souza, Anita Monteverdi, Muhammad Haris, Marco Anteghini, Kheir Eddine Farfar, Markus Stocker, Vitor A. P. Martins dos Santos, Sören Auer
For this in turn, there is a strong need for AI tools designed for scientists that permit easy and accurate semantification of their scholarly contributions.
no code implementations • 30 Nov 2021 • Marco Anteghini, Jennifer D'Souza, Vitor A. P. Martins dos Santos, Sören Auer
Biological data and knowledge bases increasingly rely on Semantic Web technologies and the use of knowledge graphs for data integration, retrieval and federated queries.
no code implementations • 18 Oct 2021 • Jennifer D'Souza, Isaiah Onando Mulang', Soeren Auer
In multiple-choice exams, students select one answer from among typically four choices and can explain why they made that particular choice.
1 code implementation • 1 Sep 2021 • Jennifer D'Souza, Soeren Auer
We describe a rule-based approach for the automatic acquisition of salient scientific entities from Computational Linguistics (CL) scholarly article titles.
1 code implementation • 31 Aug 2021 • Salomon Kabongo, Jennifer D'Souza, Sören Auer
In this regard, the Leaderboards facet of information organization provides an overview on the state-of-the-art by aggregating empirical results from various studies addressing the same research challenge.
1 code implementation • 10 Jun 2021 • Jennifer D'Souza, Sören Auer, Ted Pedersen
Being the first-of-its-kind in the SemEval series, the task released structured data from NLP scholarly articles at three levels of information granularity, i. e. at sentence-level, phrase-level, and phrases organized as triples toward Knowledge Graph (KG) building.
no code implementations • 9 Oct 2020 • Jennifer D'Souza, Sören Auer
To this end, specifically, care was taken in the adjudication annotation stage to reduce annotation noise while formulating the guidelines for our proposed novel NLP contributions structuring and graphing scheme.
1 code implementation • 16 Sep 2020 • Marco Anteghini, Jennifer D'Souza, Vitor A. P. Martins dos Santos, Sören Auer
As a novel contribution to the problem of semantifying biological assays, in this paper, we propose a neural-network-based approach to automatically semantify, thereby structure, unstructured bioassay text descriptions.
1 code implementation • 23 Jun 2020 • Jennifer D'Souza, Sören Auer
We describe an annotation initiative to capture the scholarly contributions in natural language processing (NLP) articles, particularly, for the articles that discuss machine learning (ML) approaches for various information extraction tasks.
no code implementations • 13 Apr 2020 • Ming Jiang, Jennifer D'Souza, Sören Auer, J. Stephen Downie
With the rapid growth of research publications, there is a vast amount of scholarly knowledge that needs to be organized in digital libraries.
no code implementations • LREC 2020 • Jennifer D'Souza, Anett Hoppe, Arthur Brack, Mohamad Yaser Jaradeh, Sören Auer, Ralph Ewerth
We introduce the STEM (Science, Technology, Engineering, and Medicine) Dataset for Scientific Entity Extraction, Classification, and Resolution, version 1. 0 (STEM-ECR v1. 0).
1 code implementation • Accepted for publishing in 42nd European Conference on IR Research, ECIR 2020 2020 • Arthur Brack, Jennifer D'Souza, Anett Hoppe, Sören Auer, Ralph Ewerth
We examine the novel task of domain-independent scientific concept extraction from abstracts of scholarly articles and present two contributions.
Ranked #1 on Scientific Concept Extraction on STM-corpus
no code implementations • 30 Jan 2019 • Mohamad Yaser Jaradeh, Allard Oelen, Kheir Eddine Farfar, Manuel Prinz, Jennifer D'Souza, Gábor Kismihók, Markus Stocker, Sören Auer
In this paper, we present the first steps towards a knowledge graph based infrastructure that acquires scholarly knowledge in machine actionable form thus enabling new possibilities for scholarly knowledge curation, publication and processing.