Search Results for author: Soroosh Khoram

Found 5 papers, 0 papers with code

Interleaved Composite Quantization for High-Dimensional Similarity Search

no code implementations • 18 Dec 2019 • Soroosh Khoram, Stephen J. Wright, Jing Li

A method often used to reduce this computational cost is quantization of the vector space and location-based encoding of the dataset vectors.

Quantization Vocal Bursts Intensity Prediction

Paper
Add Code

TOCO: A Framework for Compressing Neural Network Models Based on Tolerance Analysis

no code implementations • 18 Dec 2019 • Soroosh Khoram, Jing Li

Neural network compression methods have enabled deploying large models on emerging edge devices with little cost, by adapting already-trained models to the constraints of these devices.

Active Learning Neural Network Compression

Paper
Add Code

Efficient Large-Scale Approximate Nearest Neighbor Search on OpenCL FPGA

no code implementations • CVPR 2018 • Jialiang Zhang, Soroosh Khoram, Jing Li

The proposed method significantly outperforms state-of-the-art methods on CPU and GPU for high dimensional nearest neighbor queries on billion-scale datasets in terms of query time and accuracy regardless of the batch size.

Quantization

Paper
Add Code

DNN Model Compression Under Accuracy Constraints

no code implementations • ICLR 2018 • Soroosh Khoram, Jing Li

In this paper, we propose a technique that directly minimizes both the model complexity and the changes in the loss function.

Model Compression

Paper
Add Code

Adaptive Quantization of Neural Networks

no code implementations • ICLR 2018 • Soroosh Khoram, Jing Li

The optimization problem at the core of this method iteratively uses the loss function gradient to determine an error margin for each parameter and assigns it a precision accordingly.

Edge-computing Model Compression +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.