Search Results for author: Yaohui Cai

In addition, since PreCropping compresses CNNs at initialization, the computational and memory costs of CNNs are reduced for both training and inference on commodity hardware.

Model Compression

Paper
Add Code

SPADE: A Spectral Method for Black-Box Adversarial Robustness Evaluation

2 code implementations • 7 Feb 2021 • Wuxinlin Cheng, Chenhui Deng, Zhiqiang Zhao, Yaohui Cai, Zhiru Zhang, Zhuo Feng

A black-box spectral method is introduced for evaluating the adversarial robustness of a given machine learning (ML) model.

Adversarial Robustness Graph Embedding

Paper
Code

CoDeNet: Efficient Deployment of Input-Adaptive Object Detection on Embedded FPGAs

3 code implementations • 12 Jun 2020 • Zhen Dong, Dequan Wang, Qijing Huang, Yizhao Gao, Yaohui Cai, Tian Li, Bichen Wu, Kurt Keutzer, John Wawrzynek

Deploying deep learning models on embedded systems has been challenging due to limited computing resources.

Image Classification Novel Object Detection +3

Paper
Code

Algorithm-hardware Co-design for Deformable Convolution

2 code implementations • 19 Feb 2020 • Qijing Huang, Dequan Wang, Yizhao Gao, Yaohui Cai, Zhen Dong, Bichen Wu, Kurt Keutzer, John Wawrzynek

In this work, we first investigate the overhead of the deformable convolution on embedded FPGA SoCs, and then show the accuracy-latency tradeoffs for a set of algorithm modifications including full versus depthwise, fixed-shape, and limited-range.

Image Classification Instance Segmentation +4

Paper
Code

ZeroQ: A Novel Zero Shot Quantization Framework

3 code implementations • CVPR 2020 • Yaohui Cai, Zhewei Yao, Zhen Dong, Amir Gholami, Michael W. Mahoney, Kurt Keutzer

Importantly, ZeroQ has a very low computational overhead, and it can finish the entire quantization process in less than 30s (0. 5\% of one epoch training time of ResNet50 on ImageNet).

Ranked #1 on Data Free Quantization on CIFAR10 (CIFAR-10 W8A8 Top-1 Accuracy metric)

Data Free Quantization Neural Network Compression

270

Paper
Code

HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks

2 code implementations • NeurIPS 2020 • Zhen Dong, Zhewei Yao, Yaohui Cai, Daiyaan Arfeen, Amir Gholami, Michael W. Mahoney, Kurt Keutzer

However, the search space for a mixed-precision quantization is exponential in the number of layers.

object-detection Object Detection +1

627

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.