Search Results for author: Iman Keivanloo

Found 2 papers, 0 papers with code

Magic Pyramid: Accelerating Inference with Early Exiting and Token Pruning

no code implementations30 Oct 2021 Xuanli He, Iman Keivanloo, Yi Xu, Xiang He, Belinda Zeng, Santosh Rajagopalan, Trishul Chilimbi

To achieve this, we propose a novel idea, Magic Pyramid (MP), to reduce both width-wise and depth-wise computation via token pruning and early exiting for Transformer-based models, particularly BERT.

text-classification Text Classification

Low-Precision Quantization for Efficient Nearest Neighbor Search

no code implementations17 Oct 2021 Anthony Ko, Iman Keivanloo, Vihan Lakshman, Eric Schkufza

Fast k-Nearest Neighbor search over real-valued vector spaces (KNN) is an important algorithmic task for information retrieval and recommendation systems.

Information Retrieval Quantization +2

Cannot find the paper you are looking for? You can Submit a new open access paper.