Search Results for author: Ryan Marcus

Found 15 papers, 7 papers with code

Kepler: Robust Learning for Faster Parametric Query Optimization

no code implementations11 Jun 2023 Lyric Doshi, Vincent Zhuang, Gaurav Jain, Ryan Marcus, Haoyu Huang, Deniz Altınbüken, Eugene Brevdo, Campbell Fraser

We propose Kepler, an end-to-end learning-based approach to PQO that demonstrates significant speedups in query latency over a traditional query optimizer.

LSI: A Learned Secondary Index Structure

1 code implementation11 May 2022 Andreas Kipf, Dominik Horn, Pascal Pfeil, Ryan Marcus, Tim Kraska

LSI works by building a learned index over a permutation vector, which allows binary search to performed on the unsorted base data using random access.

Towards Practical Learned Indexing

1 code implementation11 Aug 2021 Mihail Stoian, Andreas Kipf, Ryan Marcus, Tim Kraska

Latest research proposes to replace existing index structures with learned models.

A Skew-Sensitive Evaluation Framework for Imbalanced Data Classification

1 code implementation12 Oct 2020 Min Du, Nesime Tatbul, Brian Rivers, Akhilesh Kumar Gupta, Lucas Hu, Wei Wang, Ryan Marcus, Shengtian Zhou, Insup Lee, Justin Gottschlich

Class distribution skews in imbalanced datasets may lead to models with prediction bias towards majority classes, making fair assessment of classifiers a challenging task.

Classification General Classification

Buffer Pool Aware Query Scheduling via Deep Reinforcement Learning

no code implementations21 Jul 2020 Chi Zhang, Ryan Marcus, Anat Kleiman, Olga Papaemmanouil

In this extended abstract, we propose a new technique for query scheduling with the explicit goal of reducing disk reads and thus implicitly increasing query performance.

reinforcement-learning Reinforcement Learning (RL) +1

RadixSpline: A Single-Pass Learned Index

no code implementations30 Apr 2020 Andreas Kipf, Ryan Marcus, Alexander van Renen, Mihail Stoian, Alfons Kemper, Tim Kraska, Thomas Neumann

Recent research has shown that learned models can outperform state-of-the-art index structures in size and lookup performance.

Context-Aware Parse Trees

no code implementations24 Mar 2020 Fangke Ye, Shengtian Zhou, Anand Venkat, Ryan Marcus, Paul Petersen, Jesmin Jahan Tithi, Tim Mattson, Tim Kraska, Pradeep Dubey, Vivek Sarkar, Justin Gottschlich

The simplified parse tree (SPT) presented in Aroma, a state-of-the-art code recommendation system, is a tree-structured representation used to infer code semantics by capturing program \emph{structure} rather than program \emph{syntax}.

ARDA: Automatic Relational Data Augmentation for Machine Learning

1 code implementation21 Mar 2020 Nadiia Chepurko, Ryan Marcus, Emanuel Zgraggen, Raul Castro Fernandez, Tim Kraska, David Karger

Our system has two distinct components: (1) a framework to search and join data with the input data, based on various attributes of the input, and (2) an efficient feature selection algorithm that prunes out noisy or irrelevant features from the resulting join.

BIG-bench Machine Learning Data Augmentation +2

SOSD: A Benchmark for Learned Indexes

1 code implementation29 Nov 2019 Andreas Kipf, Ryan Marcus, Alexander van Renen, Mihail Stoian, Alfons Kemper, Tim Kraska, Thomas Neumann

A groundswell of recent work has focused on improving data management systems with learned components.

Benchmarking Management

Flexible Operator Embeddings via Deep Learning

no code implementations25 Jan 2019 Ryan Marcus, Olga Papaemmanouil

Integrating machine learning into the internals of database management systems requires significant feature engineering, a human effort-intensive process to determine the best way to represent the pieces of information that are relevant to a task.

Feature Engineering Management

Deep Reinforcement Learning for Join Order Enumeration

no code implementations28 Feb 2018 Ryan Marcus, Olga Papaemmanouil

However, modern query optimizers typically employ static join enumeration algorithms that do not receive any feedback about the quality of the resulting plan.

Decision Making reinforcement-learning +1

WiSeDB: A Learning-based Workload Management Advisor for Cloud Databases

1 code implementation29 Jan 2016 Ryan Marcus, Olga Papaemmanouil

Workload management for cloud databases must deal with the tasks of resource provisioning, query placement and query scheduling in a manner that meets the application's performance goals while minimizing the cost of using cloud resources.

Databases

Cannot find the paper you are looking for? You can Submit a new open access paper.