Search Results for author: Carsten Binnig

Found 19 papers, 4 papers with code

Know Better – A Clickbait Resolving Challenge

no code implementations LREC 2022 Benjamin Hättasch, Carsten Binnig

In this paper, we present a new corpus of clickbait articles annotated by university students along with a corresponding shared task: clickbait articles use a headline or teaser that hides information from the reader to make them curious to open the article.

Clickbait Detection Question Answering

COSTREAM: Learned Cost Models for Operator Placement in Edge-Cloud Environments

1 code implementation13 Mar 2024 Roman Heinrich, Carsten Binnig, Harald Kornmayer, Manisha Luthra

In this work, we present COSTREAM, a novel learned cost model for Distributed Stream Processing Systems that provides accurate predictions of the execution costs of a streaming query in an edge-cloud environment.

SPARE: A Single-Pass Neural Model for Relational Databases

no code implementations20 Oct 2023 Benjamin Hilprecht, Kristian Kersting, Carsten Binnig

While there has been extensive work on deep neural networks for images and text, deep learning for relational databases (RDBs) is still a rather unexplored field.

Towards Foundation Models for Relational Databases [Vision Paper]

no code implementations24 May 2023 Liane Vogel, Benjamin Hilprecht, Carsten Binnig

However, existing approaches only learn a representation from a single table, and thus ignore the potential to learn from the full structure of relational databases, including neighboring tables that can contain important information for a contextualized representation.

Representation Learning

Towards Multi-Modal DBMSs for Seamless Querying of Texts and Tables

no code implementations26 Apr 2023 Matthias Urban, Carsten Binnig

In this paper, we propose Multi-Modal Databases (MMDBs), which is a new class of database systems that can seamlessly query text and tables using SQL.

DiffML: End-to-end Differentiable ML Pipelines

no code implementations4 Jul 2022 Benjamin Hilprecht, Christian Hammacher, Eduardo Reis, Mohamed Abdelaal, Carsten Binnig

In this paper, we present our vision of differentiable ML pipelines called DiffML to automate the construction of ML pipelines in an end-to-end fashion.

feature selection

Demonstrating CAT: Synthesizing Data-Aware Conversational Agents for Transactional Databases

no code implementations26 Mar 2022 Marius Gassen, Benjamin Hättasch, Benjamin Hilprecht, Nadja Geisler, Alexander Fraser, Carsten Binnig

However, developing a conversational agent (i. e., a chatbot-like interface) to allow end-users to interact with an application using natural language requires both immense amounts of training data and NLP expertise.

Chatbot

ASET: Ad-hoc Structured Exploration of Text Collections [Extended Abstract]

no code implementations9 Mar 2022 Benjamin Hättasch, Jan-Micha Bodensohn, Carsten Binnig

In this paper, we propose a new system called ASET that allows users to perform structured explorations of text collections in an ad-hoc manner.

It's AI Match: A Two-Step Approach for Schema Matching Using Embeddings

no code implementations8 Mar 2022 Benjamin Hättasch, Michael Truong-Ngoc, Andreas Schmidt, Carsten Binnig

Since data is often stored in different sources, it needs to be integrated to gather a global view that is required in order to create value and derive knowledge from it.

Attribute Data Integration

Zero-Shot Cost Models for Out-of-the-box Learned Cost Prediction

no code implementations3 Jan 2022 Benjamin Hilprecht, Carsten Binnig

In this paper, we introduce zero-shot cost models which enable learned cost estimation that generalizes to unseen databases.

One Model to Rule them All: Towards Zero-Shot Learning for Databases

no code implementations3 May 2021 Benjamin Hilprecht, Carsten Binnig

In this paper, we present our vision of so called zero-shot learning for databases which is a new learning approach for database components.

Few-Shot Learning Transfer Learning +1

AnyDB: An Architecture-less DBMS for Any Workload

no code implementations4 Sep 2020 Tiemo Bang, Norman May, Ilia Petrov, Carsten Binnig

In this paper, we propose a radical new approach for scale-out distributed DBMSs.

Summarization Beyond News: The Automatically Acquired Fandom Corpora

no code implementations LREC 2020 Benjamin H{\"a}ttasch, Nadja Geisler, Christian M. Meyer, Carsten Binnig

Large state-of-the-art corpora for training neural networks to create abstractive summaries are mostly limited to the news genre, as it is expensive to acquire human-written summaries for other types of text at a large scale.

Abstractive Text Summarization Descriptive

DeepDB: Learn from Data, not from Queries!

1 code implementation2 Sep 2019 Benjamin Hilprecht, Andreas Schmidt, Moritz Kulessa, Alejandro Molina, Kristian Kersting, Carsten Binnig

The typical approach for learned DBMS components is to capture the behavior by running a representative set of queries and use the observations to train a machine learning model.

Databases

Progressive Data Science: Potential and Challenges

no code implementations19 Dec 2018 Cagatay Turkay, Nicola Pezzotti, Carsten Binnig, Hendrik Strobelt, Barbara Hammer, Daniel A. Keim, Jean-Daniel Fekete, Themis Palpanas, Yunhai Wang, Florin Rusu

We discuss these challenges and outline first steps towards progressiveness, which, we argue, will ultimately help to significantly speed-up the overall data science process.

Model-based Approximate Query Processing

no code implementations15 Nov 2018 Moritz Kulessa, Alejandro Molina, Carsten Binnig, Benjamin Hilprecht, Kristian Kersting

However, classical AQP approaches suffer from various problems that limit the applicability to support the ad-hoc exploration of a new data set: (1) Classical AQP approaches that perform online sampling can support ad-hoc exploration queries but yield low quality if executed over rare subpopulations.

IDEBench: A Benchmark for Interactive Data Exploration

1 code implementation7 Apr 2018 Philipp Eichmann, Carsten Binnig, Tim Kraska, Emanuel Zgraggen

Existing benchmarks for analytical database systems such as TPC-DS and TPC-H are designed for static reporting scenarios.

Databases

A-Tree: A Bounded Approximate Index Structure

no code implementations30 Jan 2018 Alex Galakatos, Michael Markovitch, Carsten Binnig, Rodrigo Fonseca, Tim Kraska

At the core of our index is a tunable error parameter that allows a DBA to balance lookup performance and space consumption.

Databases

Cannot find the paper you are looking for? You can Submit a new open access paper.