Search Results for author: Joseph Hassoun

Found 6 papers, 4 papers with code

SaiT: Sparse Vision Transformers through Adaptive Token Pruning

1 code implementation • 11 Oct 2022 • Ling Li, David Thorsley, Joseph Hassoun

Sparse adaptive image Transformer (SaiT) offers varying levels of model acceleration by merely changing the token sparsity on the fly.

Knowledge Distillation

Paper
Code

MaiT: Leverage Attention Masks for More Efficient Image Transformers

no code implementations • 6 Jul 2022 • Ling Li, Ali Shafiee Ardestani, Joseph Hassoun

Though image transformers have shown competitive results with convolutional neural networks in computer vision tasks, lacking inductive biases such as locality still poses problems in terms of model efficiency especially for embedded applications.

Paper
Add Code

A Fast Post-Training Pruning Framework for Transformers

2 code implementations • 29 Mar 2022 • Woosuk Kwon, Sehoon Kim, Michael W. Mahoney, Joseph Hassoun, Kurt Keutzer, Amir Gholami

To address this, we propose a fast post-training pruning framework for Transformers that does not require any retraining.

145

Paper
Code

Fast and Efficient Once-For-All Networks for Diverse Hardware Deployment

no code implementations • 29 Sep 2021 • Jun Fang, Li Yang, Chengyao Shen, Hamzah Abdel-Aziz, David Thorsley, Joseph Hassoun

In this work, we continue the effort to reduce the training cost of OFA methods.

Knowledge Distillation

Paper
Add Code

Learned Token Pruning for Transformers

1 code implementation • 2 Jul 2021 • Sehoon Kim, Sheng Shen, David Thorsley, Amir Gholami, Woosuk Kwon, Joseph Hassoun, Kurt Keutzer

We extensively test the performance of LTP on GLUE tasks and show that our method outperforms the prior state-of-the-art token pruning methods by up to ~2. 5% higher accuracy with the same amount of FLOPs.

Sentence

Paper
Code

Post-Training Piecewise Linear Quantization for Deep Neural Networks

3 code implementations • ECCV 2020 • Jun Fang, Ali Shafiee, Hamzah Abdel-Aziz, David Thorsley, Georgios Georgiadis, Joseph Hassoun

Quantization plays an important role in the energy-efficient deployment of deep neural networks on resource-limited devices.

Image Classification object-detection +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.