Search Results for author: Maximilian Lam

Found 11 papers, 7 papers with code

GPU-based Private Information Retrieval for On-Device Machine Learning Inference

1 code implementation • 26 Jan 2023 • Maximilian Lam, Jeff Johnson, Wenjie Xiong, Kiwan Maeng, Udit Gupta, Yang Li, Liangzhen Lai, Ilias Leontiadis, Minsoo Rhu, Hsien-Hsin S. Lee, Vijay Janapa Reddi, Gu-Yeon Wei, David Brooks, G. Edward Suh

Together, for various on-device ML applications such as recommendation and language modeling, our system on a single V100 GPU can serve up to $100, 000$ queries per second -- a $>100 \times$ throughput improvement over a CPU-based baseline -- while maintaining model accuracy.

Information Retrieval Language Modelling +1

Paper
Code

Tabula: Efficiently Computing Nonlinear Activation Functions for Secure Neural Network Inference

no code implementations • 5 Mar 2022 • Maximilian Lam, Michael Mitzenmacher, Vijay Janapa Reddi, Gu-Yeon Wei, David Brooks

Multiparty computation approaches to secure neural network inference traditionally rely on garbled circuits for securely executing nonlinear activation functions.

Paper
Add Code

The People's Speech: A Large-Scale Diverse English Speech Recognition Dataset for Commercial Usage

no code implementations • 17 Nov 2021 • Daniel Galvez, Greg Diamos, Juan Ciro, Juan Felipe Cerón, Keith Achorn, Anjali Gopi, David Kanter, Maximilian Lam, Mark Mazumder, Vijay Janapa Reddi

The People's Speech is a free-to-download 30, 000-hour and growing supervised conversational English speech recognition dataset licensed for academic and commercial usage under CC-BY-SA (with a CC-BY subset).

speech-recognition Speech Recognition

Paper
Add Code

Gradient Disaggregation: Breaking Privacy in Federated Learning by Reconstructing the User Participant Matrix

1 code implementation • 10 Jun 2021 • Maximilian Lam, Gu-Yeon Wei, David Brooks, Vijay Janapa Reddi, Michael Mitzenmacher

We show that aggregated model updates in federated learning may be insecure.

Federated Learning

Paper
Code

Widening Access to Applied Machine Learning with TinyML

1 code implementation • 7 Jun 2021 • Vijay Janapa Reddi, Brian Plancher, Susan Kennedy, Laurence Moroney, Pete Warden, Anant Agarwal, Colby Banbury, Massimo Banzi, Matthew Bennett, Benjamin Brown, Sharad Chitlangia, Radhika Ghosal, Sarah Grafman, Rupert Jaeger, Srivatsan Krishnan, Maximilian Lam, Daniel Leiker, Cara Mann, Mark Mazumder, Dominic Pajak, Dhilan Ramaprasad, J. Evan Smith, Matthew Stewart, Dustin Tingley

Broadening access to both computational and educational resources is critical to diffusing machine-learning (ML) innovation.

BIG-bench Machine Learning

133

Paper
Code

Quantized Neural Network Inference with Precision Batching

no code implementations • 26 Feb 2020 • Maximilian Lam, Zachary Yedidia, Colby Banbury, Vijay Janapa Reddi

We present PrecisionBatching, a quantized inference algorithm for speeding up neural network execution on traditional hardware platforms at low bitwidths without the need for retraining or recalibration.

Language Modelling Natural Language Inference +1

Paper
Add Code

QuaRL: Quantization for Fast and Environmentally Sustainable Reinforcement Learning

1 code implementation • 2 Oct 2019 • Srivatsan Krishnan, Maximilian Lam, Sharad Chitlangia, Zishen Wan, Gabriel Barth-Maron, Aleksandra Faust, Vijay Janapa Reddi

We believe that this is the first of many future works on enabling computationally energy-efficient and sustainable reinforcement learning.

Decision Making Quantization +2

Paper
Code

Word2Bits - Quantized Word Vectors

1 code implementation • 15 Mar 2018 • Maximilian Lam

Word vectors require significant amounts of memory and storage, posing issues to resource limited devices like mobile phones and GPUs.

Quantization Question Answering +1

755

Paper
Code

Cataloging the Visible Universe through Bayesian Inference at Petascale

1 code implementation • 31 Jan 2018 • Jeffrey Regier, Kiran Pamnany, Keno Fischer, Andreas Noack, Maximilian Lam, Jarrett Revels, Steve Howard, Ryan Giordano, David Schlegel, Jon McAuliffe, Rollin Thomas, Prabhat

We construct an astronomical catalog from 55 TB of imaging data using Celeste, a Bayesian variational inference code written entirely in the high-productivity programming language Julia.

Distributed, Parallel, and Cluster Computing Instrumentation and Methods for Astrophysics 85A35, 68W10, 62P35 J.2; D.1.3; G.3; I.2; D.2

180

Paper
Code

CYCLADES: Conflict-free Asynchronous Machine Learning

1 code implementation • NeurIPS 2016 • Xinghao Pan, Maximilian Lam, Stephen Tu, Dimitris Papailiopoulos, Ce Zhang, Michael. I. Jordan, Kannan Ramchandran, Chris Re, Benjamin Recht

We present CYCLADES, a general framework for parallelizing stochastic optimization algorithms in a shared memory setting.

BIG-bench Machine Learning Stochastic Optimization

Paper
Code

Speeding Up Distributed Machine Learning Using Codes

no code implementations • 8 Dec 2015 • Kangwook Lee, Maximilian Lam, Ramtin Pedarsani, Dimitris Papailiopoulos, Kannan Ramchandran

We focus on two of the most basic building blocks of distributed learning algorithms: matrix multiplication and data shuffling.

BIG-bench Machine Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.