no code implementations • LREC 2022 • Benjamin Hättasch, Carsten Binnig
In this paper, we present a new corpus of clickbait articles annotated by university students along with a corresponding shared task: clickbait articles use a headline or teaser that hides information from the reader to make them curious to open the article.
no code implementations • 28 Aug 2024 • Yannis Chronis, Yawen Wang, Yu Gan, Sami Abu-El-Haija, Chelsea Lin, Carsten Binnig, Fatma Özcan
In contrast to other initial benchmarks, our benchmark is much more diverse and can be used for training and testing learned models systematically.
1 code implementation • 13 Mar 2024 • Roman Heinrich, Carsten Binnig, Harald Kornmayer, Manisha Luthra
In this work, we present COSTREAM, a novel learned cost model for Distributed Stream Processing Systems that provides accurate predictions of the execution costs of a streaming query in an edge-cloud environment.
no code implementations • 20 Oct 2023 • Benjamin Hilprecht, Kristian Kersting, Carsten Binnig
While there has been extensive work on deep neural networks for images and text, deep learning for relational databases (RDBs) is still a rather unexplored field.
no code implementations • 24 May 2023 • Liane Vogel, Benjamin Hilprecht, Carsten Binnig
However, existing approaches only learn a representation from a single table, and thus ignore the potential to learn from the full structure of relational databases, including neighboring tables that can contain important information for a contextualized representation.
no code implementations • 26 Apr 2023 • Matthias Urban, Carsten Binnig
In this paper, we propose Multi-Modal Databases (MMDBs), which is a new class of database systems that can seamlessly query text and tables using SQL.
no code implementations • 4 Jul 2022 • Benjamin Hilprecht, Christian Hammacher, Eduardo Reis, Mohamed Abdelaal, Carsten Binnig
In this paper, we present our vision of differentiable ML pipelines called DiffML to automate the construction of ML pipelines in an end-to-end fashion.
no code implementations • 26 Mar 2022 • Marius Gassen, Benjamin Hättasch, Benjamin Hilprecht, Nadja Geisler, Alexander Fraser, Carsten Binnig
However, developing a conversational agent (i. e., a chatbot-like interface) to allow end-users to interact with an application using natural language requires both immense amounts of training data and NLP expertise.
no code implementations • 9 Mar 2022 • Benjamin Hättasch, Jan-Micha Bodensohn, Carsten Binnig
In this paper, we propose a new system called ASET that allows users to perform structured explorations of text collections in an ad-hoc manner.
no code implementations • 8 Mar 2022 • Benjamin Hättasch, Michael Truong-Ngoc, Andreas Schmidt, Carsten Binnig
Since data is often stored in different sources, it needs to be integrated to gather a global view that is required in order to create value and derive knowledge from it.
1 code implementation • 3 Jan 2022 • Benjamin Hilprecht, Carsten Binnig
In this paper, we introduce zero-shot cost models which enable learned cost estimation that generalizes to unseen databases.
no code implementations • 3 May 2021 • Benjamin Hilprecht, Carsten Binnig
In this paper, we present our vision of so called zero-shot learning for databases which is a new learning approach for database components.
no code implementations • 4 Sep 2020 • Tiemo Bang, Norman May, Ilia Petrov, Carsten Binnig
In this paper, we propose a radical new approach for scale-out distributed DBMSs.
no code implementations • LREC 2020 • Benjamin H{\"a}ttasch, Nadja Geisler, Christian M. Meyer, Carsten Binnig
Large state-of-the-art corpora for training neural networks to create abstractive summaries are mostly limited to the news genre, as it is expensive to acquire human-written summaries for other types of text at a large scale.
1 code implementation • 2 Sep 2019 • Benjamin Hilprecht, Andreas Schmidt, Moritz Kulessa, Alejandro Molina, Kristian Kersting, Carsten Binnig
The typical approach for learned DBMS components is to capture the behavior by running a representative set of queries and use the observations to train a machine learning model.
Databases
no code implementations • 19 Dec 2018 • Cagatay Turkay, Nicola Pezzotti, Carsten Binnig, Hendrik Strobelt, Barbara Hammer, Daniel A. Keim, Jean-Daniel Fekete, Themis Palpanas, Yunhai Wang, Florin Rusu
We discuss these challenges and outline first steps towards progressiveness, which, we argue, will ultimately help to significantly speed-up the overall data science process.
no code implementations • 15 Nov 2018 • Moritz Kulessa, Alejandro Molina, Carsten Binnig, Benjamin Hilprecht, Kristian Kersting
However, classical AQP approaches suffer from various problems that limit the applicability to support the ad-hoc exploration of a new data set: (1) Classical AQP approaches that perform online sampling can support ad-hoc exploration queries but yield low quality if executed over rare subpopulations.
1 code implementation • 7 Apr 2018 • Philipp Eichmann, Carsten Binnig, Tim Kraska, Emanuel Zgraggen
Existing benchmarks for analytical database systems such as TPC-DS and TPC-H are designed for static reporting scenarios.
Databases
2 code implementations • 2 Apr 2018 • Prasetya Utama, Nathaniel Weir, Fuat Basik, Carsten Binnig, Ugur Cetintemel, Benjamin Hättasch, Amir Ilkhechi, Shekar Ramaswamy, Arif Usta
The ability to extract insights from new data sets is critical for decision making.
no code implementations • 30 Jan 2018 • Alex Galakatos, Michael Markovitch, Carsten Binnig, Rodrigo Fonseca, Tim Kraska
At the core of our index is a tunable error parameter that allows a DBA to balance lookup performance and space consumption.
Databases