no code implementations • ACL 2022 • Samuel Broscheit, Quynh Do, Judith Gaspers
Experiments show that a state-of-the-art BERT-based model suffers performance loss under this drift.
no code implementations • EMNLP 2021 • Harsh Gupta, Luciano del Corro, Samuel Broscheit, Johannes Hoffart, Eliot Brenner
We investigate post-OCR correction in a setting where we have access to different OCR views of the same document.
1 code implementation • 8 Jul 2022 • Fabio Petroni, Samuel Broscheit, Aleksandra Piktus, Patrick Lewis, Gautier Izacard, Lucas Hosseini, Jane Dwivedi-Yu, Maria Lomeli, Timo Schick, Pierre-Emmanuel Mazaré, Armand Joulin, Edouard Grave, Sebastian Riedel
Hence, maintaining and improving the quality of Wikipedia references is an important challenge and there is a pressing need for better tools to assist humans in this effort.
2 code implementations • 18 Dec 2021 • Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Dmytro Okhonko, Samuel Broscheit, Gautier Izacard, Patrick Lewis, Barlas Oğuz, Edouard Grave, Wen-tau Yih, Sebastian Riedel
In order to address increasing demands of real-world applications, the research for knowledge-intensive NLP (KI-NLP) should advance by capturing the challenges of a truly open-domain environment: web-scale knowledge, lack of structure, inconsistent quality and noise.
1 code implementation • EMNLP 2020 • Samuel Broscheit, Daniel Ruffinelli, Adrian Kochsiek, Patrick Betz, Rainer Gemulla
LibKGE ( https://github. com/uma-pi1/kge ) is an open-source PyTorch-based library for training, hyperparameter optimization, and evaluation of knowledge graph embedding models for link prediction.
1 code implementation • ACL 2020 • Samuel Broscheit, Kiril Gashteovski, Yanjie Wang, Rainer Gemulla
An evaluation in such a setup raises the question if a correct prediction is actually a new fact that was induced by reasoning over the open knowledge graph or if it can be trivially explained.
2 code implementations • ICLR 2020 • Daniel Ruffinelli, Samuel Broscheit, Rainer Gemulla
A vast number of KGE techniques for multi-relational link prediction have been proposed in the recent literature, often with state-of-the-art performance.
1 code implementation • CONLL 2019 • Samuel Broscheit
We show on an entity linking benchmark that (i) this model improves the entity representations over plain BERT, (ii) that it outperforms entity linking architectures that optimize the tasks separately and (iii) that it only comes second to the current state-of-the-art that does mention detection and entity disambiguation jointly.
Ranked #9 on
Entity Linking
on AIDA-CoNLL
(using extra training data)
3 code implementations • AKBC 2019 • Kiril Gashteovski, Sebastian Wanner, Sven Hertling, Samuel Broscheit, Rainer Gemulla
In this paper, we release, describe, and analyze an OIE corpus called OPIEC, which was extracted from the text of English Wikipedia.
no code implementations • 3 Feb 2019 • Yanjie Wang, Samuel Broscheit, Rainer Gemulla
We propose the Relational Tucker3 (RT) decomposition for multi-relational link prediction in knowledge graphs.
no code implementations • WS 2019 • Yanjie Wang, Daniel Ruffinelli, Rainer Gemulla, Samuel Broscheit, Christian Meilicke
In this paper, we explore whether recent models work well for knowledge base completion and argue that the current evaluation protocols are more suited for question answering rather than knowledge base completion.
no code implementations • WS 2018 • Samuel Broscheit
In summary, we show that it is possible to achieve a \textit{text representation} only from pixels.
no code implementations • WS 2018 • Jonas Pfeiffer, Samuel Broscheit, Rainer Gemulla, Mathias G{\"o}schl
In this study, we investigate learning-to-rank and query refinement approaches for information retrieval in the pharmacogenomic domain.