Search Results for author: G. Edward Suh

Found 12 papers, 3 papers with code

LazyDP: Co-Designing Algorithm-Software for Scalable Training of Differentially Private Recommendation Models

no code implementations • 12 Apr 2024 • Juntaek Lim, Youngeun Kwon, Ranggi Hwang, Kiwan Maeng, G. Edward Suh, Minsoo Rhu

Differential privacy (DP) is widely being employed in the industry as a practical standard for privacy protection.

Recommendation Systems

Paper
Add Code

Approximating ReLU on a Reduced Ring for Efficient MPC-based Private Inference

no code implementations • 9 Sep 2023 • Kiwan Maeng, G. Edward Suh

Secure multi-party computation (MPC) allows users to offload machine learning inference on untrusted servers without having to share their privacy-sensitive data.

Paper
Add Code

Information Flow Control in Machine Learning through Modular Model Architecture

no code implementations • 5 Jun 2023 • Trishita Tiwari, Suchin Gururangan, Chuan Guo, Weizhe Hua, Sanjay Kariyappa, Udit Gupta, Wenjie Xiong, Kiwan Maeng, Hsien-Hsin S. Lee, G. Edward Suh

In today's machine learning (ML) models, any part of the training data can affect its output.

Language Modelling

Paper
Add Code

GPU-based Private Information Retrieval for On-Device Machine Learning Inference

1 code implementation • 26 Jan 2023 • Maximilian Lam, Jeff Johnson, Wenjie Xiong, Kiwan Maeng, Udit Gupta, Yang Li, Liangzhen Lai, Ilias Leontiadis, Minsoo Rhu, Hsien-Hsin S. Lee, Vijay Janapa Reddi, Gu-Yeon Wei, David Brooks, G. Edward Suh

Together, for various on-device ML applications such as recommendation and language modeling, our system on a single V100 GPU can serve up to $100, 000$ queries per second -- a $>100 \times$ throughput improvement over a CPU-based baseline -- while maintaining model accuracy.

Information Retrieval Language Modelling +1

Paper
Code

Data Leakage via Access Patterns of Sparse Features in Deep Learning-based Recommendation Systems

no code implementations • 12 Dec 2022 • Hanieh Hashemi, Wenjie Xiong, Liu Ke, Kiwan Maeng, Murali Annavaram, G. Edward Suh, Hsien-Hsin S. Lee

This paper explores the private information that may be learned by tracking a recommendation model's sparse feature access patterns.

Recommendation Systems

Paper
Add Code

Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning using Independent Component Analysis

no code implementations • 12 Sep 2022 • Sanjay Kariyappa, Chuan Guo, Kiwan Maeng, Wenjie Xiong, G. Edward Suh, Moinuddin K Qureshi, Hsien-Hsin S. Lee

Federated learning (FL) aims to perform privacy-preserving machine learning on distributed data held by multiple data owners.

blind source separation Federated Learning +1

Paper
Add Code

Structured Pruning is All You Need for Pruning CNNs at Initialization

no code implementations • 4 Mar 2022 • Yaohui Cai, Weizhe Hua, Hongzheng Chen, G. Edward Suh, Christopher De Sa, Zhiru Zhang

In addition, since PreCropping compresses CNNs at initialization, the computational and memory costs of CNNs are reduced for both training and inference on commodity hardware.

Model Compression

Paper
Add Code

BulletTrain: Accelerating Robust Neural Network Training via Boundary Example Mining

no code implementations • NeurIPS 2021 • Weizhe Hua, Yichi Zhang, Chuan Guo, Zhiru Zhang, G. Edward Suh

Neural network robustness has become a central topic in machine learning in recent years.

Paper
Add Code

GuardNN: Secure Accelerator Architecture for Privacy-Preserving Deep Learning

no code implementations • 26 Aug 2020 • Weizhe Hua, Muhammad Umar, Zhiru Zhang, G. Edward Suh

This paper proposes GuardNN, a secure DNN accelerator that provides hardware-based protection for user data and model parameters even in an untrusted environment.

Privacy Preserving Privacy Preserving Deep Learning

Paper
Add Code

MGX: Near-Zero Overhead Memory Protection for Data-Intensive Accelerators

no code implementations • 20 Apr 2020 • Weizhe Hua, Muhammad Umar, Zhiru Zhang, G. Edward Suh

This paper introduces MGX, a near-zero overhead memory protection scheme for hardware accelerators.

Paper
Add Code

Precision Gating: Improving Neural Network Efficiency with Dynamic Dual-Precision Activations

1 code implementation • ICLR 2020 • Yichi Zhang, Ritchie Zhao, Weizhe Hua, Nayun Xu, G. Edward Suh, Zhiru Zhang

The proposed approach is applicable to a variety of DNN architectures and significantly reduces the computational cost of DNN execution with almost no accuracy loss.

Quantization

Paper
Code

Channel Gating Neural Networks

1 code implementation • NeurIPS 2019 • Weizhe Hua, Yuan Zhou, Christopher De Sa, Zhiru Zhang, G. Edward Suh

Combining our method with knowledge distillation reduces the compute cost of ResNet-18 by 2. 6$\times$ without accuracy drop on ImageNet.

Knowledge Distillation Network Pruning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.