Search Results for author: Gustavo Alonso

Found 11 papers, 6 papers with code

ACCL+: an FPGA-Based Collective Engine for Distributed Applications

no code implementations18 Dec 2023 Zhenhao He, Dario Korolija, Yu Zhu, Benjamin Ramhorst, Tristan Laan, Lucian Petrica, Michaela Blott, Gustavo Alonso

To facilitate the development of distributed applications with FPGAs, in this paper we propose ACCL+, an open-source versatile FPGA-based collective communication library.

Chameleon: a heterogeneous and disaggregated accelerator system for retrieval-augmented language models

no code implementations15 Oct 2023 Wenqi Jiang, Marco Zeller, Roger Waleffe, Torsten Hoefler, Gustavo Alonso

The heterogeneity ensures efficient acceleration of both LM inference and retrieval, while the accelerator disaggregation enables the system to independently scale both types of accelerators to fulfill diverse RALM requirements.

Language Modelling Retrieval +1

Co-design Hardware and Algorithm for Vector Search

1 code implementation19 Jun 2023 Wenqi Jiang, Shigang Li, Yu Zhu, Johannes De Fine Licht, Zhenhao He, Runbin Shi, Cedric Renggli, Shuai Zhang, Theodoros Rekatsinas, Torsten Hoefler, Gustavo Alonso

Vector search has emerged as the foundation for large-scale information retrieval and machine learning systems, with search engines like Google and Bing processing tens of thousands of queries per second on petabyte-scale document datasets by evaluating vector similarities between encoded query texts and web documents.

Information Retrieval Retrieval

Towards Demystifying Serverless Machine Learning Training

1 code implementation17 May 2021 Jiawei Jiang, Shaoduo Gan, Yue Liu, Fanlin Wang, Gustavo Alonso, Ana Klimovic, Ankit Singla, Wentao Wu, Ce Zhang

The appeal of serverless (FaaS) has triggered a growing interest on how to use it in data-intensive applications such as ETL, query processing, or machine learning (ML).

BIG-bench Machine Learning

MicroRec: Efficient Recommendation Inference by Hardware and Data Structure Solutions

no code implementations12 Oct 2020 Wenqi Jiang, Zhenhao He, Shuai Zhang, Thomas B. Preußer, Kai Zeng, Liang Feng, Jiansong Zhang, Tongxuan Liu, Yong Li, Jingren Zhou, Ce Zhang, Gustavo Alonso

MicroRec accelerates recommendation inference by (1) redesigning the data structures involved in the embeddings to reduce the number of lookups needed and (2) taking advantage of the availability of High-Bandwidth Memory (HBM) in FPGA accelerators to tackle the latency by enabling parallel lookups.

Recommendation Systems

Benchmarking High Bandwidth Memory on FPGAs

2 code implementations9 May 2020 Zeke Wang, Hongjing Huang, Jie Zhang, Gustavo Alonso

FPGAs are starting to be enhanced with High Bandwidth Memory (HBM) as a way to reduce the memory bandwidth bottleneck encountered in some applications and to give the FPGA more capacity to deal with application state.

Hardware Architecture

Modularis: Modular Relational Analytics over Heterogeneous Distributed Platforms

no code implementations7 Apr 2020 Dimitrios Koutsoukos, Ingo Müller, Renato Marroquín, Gustavo Alonso

The enormous quantity of data produced every day together with advances in data analytics has led to a proliferation of data management and analysis systems.

Databases

Rumble: Data Independence for Large Messy Data Sets

1 code implementation25 Oct 2019 Stefan Irimescu, Can Berker Cikis, Ingo Müller, Ghislain Fourny, Gustavo Alonso

This paper introduces Rumble, an engine that executes JSONiq queries on large, heterogeneous and nested collections of JSON objects, leveraging the parallel capabilities of Spark so as to provide a high degree of data independence.

JSONiq Query Execution Databases 68N99 H.2.3; C.2.4

Cannot find the paper you are looking for? You can Submit a new open access paper.