Search Results

Reproducible and Portable Big Data Analytics in the Cloud

1 code implementation17 Dec 2021

To tackle these problems, we leverage serverless computing and containerization techniques for automated scalable execution and reproducibility, and utilize the adapter design pattern to enable application portability and reproducibility across different clouds.

Cloud Computing Descriptive

Serverless Query Processing with Flexible Performance SLAs and Prices

1 code implementation2 Sep 2024

In this paper, we first define the problem of flexible performance SLAs and prices in serverless query processing and discuss its significance.

Databases

PixelsDB: Serverless and NL-Aided Data Analytics with Flexible Service Levels and Prices

1 code implementation30 May 2024

The queries are then executed by a serverless query engine that offers varying prices for different performance service levels (SLAs).

Scheduling

Transparent Serverless execution of Python multiprocessing applications

1 code implementation18 May 2022

The results show that Python's multiprocessing library design is an enabler towards transparency: legacy applications using efficient disaggregated abstractions can transparently scale beyond VM limited resources for increased parallelism without changing the underlying code or architecture.

Distributed, Parallel, and Cluster Computing

Manu: A Cloud Native Vector Database Management System

1 code implementation28 Jun 2022

In the past three years, through interaction with our 1200+ industry users, we have sketched a vision for the features that next-generation vector databases should have, which include long-term evolvability, tunable consistency, good elasticity, and high performance.

Databases

Deep Lake: a Lakehouse for Deep Learning

3 code implementations22 Sep 2022

Traditional data lakes provide critical data infrastructure for analytical workloads by enabling time travel, running SQL queries, ingesting data with ACID transactions, and visualizing petabyte-scale datasets on cloud storage.

Decision Making Deep Learning

PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds

2 code implementations30 Jul 2022

The first is scene-level swapping which exchanges point cloud sectors of two LiDAR scans that are cut along the azimuth axis.

3D Object Detection 3D Unsupervised Domain Adaptation +3

Flock: A Low-Cost Streaming Query Engine on FaaS Platforms

1 code implementation27 Dec 2023

Existing serverless data analytics systems rely on external storage services like S3 for data shuffling and communication between cloud functions.

Databases Distributed, Parallel, and Cluster Computing

Satellite Image Time Series Analysis for Big Earth Observation Data

1 code implementation24 Apr 2022

Solutions that are efficient for specific hardware architectures can not be used in other environments.

BIG-bench Machine Learning Cloud Computing +3

Collective Mind, Part II: Towards Performance- and Cost-Aware Software Engineering as a Natural Science

22 code implementations20 Jun 2015

Nowadays, engineers have to develop software often without even knowing which hardware it will eventually run on in numerous mobile phones, tablets, desktops, laptops, data centers, supercomputers and cloud services.