Search Results for author: Animesh Jain

Found 4 papers, 0 papers with code

Automated Backend-Aware Post-Training Quantization

no code implementations27 Mar 2021 Ziheng Jiang, Animesh Jain, Andrew Liu, Josh Fromm, Chengqian Ma, Tianqi Chen, Luis Ceze

Quantization is a key technique to reduce the resource requirement and improve the performance of neural network deployment.

Quantization

UNIT: Unifying Tensorized Instruction Compilation

no code implementations21 Jan 2021 Jian Weng, Animesh Jain, Jie Wang, Leyuan Wang, Yida Wang, Tony Nowatzki

However, it is hard to leverage mixed precision without hardware support because of the overhead of data casting.

Efficient Execution of Quantized Deep Learning Models: A Compiler Approach

no code implementations18 Jun 2020 Animesh Jain, Shoubhik Bhattacharya, Masahiro Masuda, Vin Sharma, Yida Wang

A deep learning compiler such as Apache TVM can enable the efficient execution of model from various frameworks on various targets.

Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.