Search Results for author: Danny Loh

Found 4 papers, 4 papers with code

Highly Optimized Kernels and Fine-Grained Codebooks for LLM Inference on Arm CPUs

1 code implementation23 Dec 2024 Dibakar Gope, David Mansell, Danny Loh, Ian Bratt

Large language models (LLMs) have transformed the way we think about language understanding and generation, enthralling both researchers and developers.

Quantization

Restructurable Activation Networks

1 code implementation17 Aug 2022 Kartikeya Bhardwaj, James Ward, Caleb Tung, Dibakar Gope, Lingchuan Meng, Igor Fedorov, Alex Chalfin, Paul Whatmough, Danny Loh

To address this question, we propose a new paradigm called Restructurable Activation Networks (RANs) that manipulate the amount of non-linearity in models to improve their hardware-awareness and efficiency.

object-detection Object Detection

Collapsible Linear Blocks for Super-Efficient Super Resolution

3 code implementations17 Mar 2021 Kartikeya Bhardwaj, Milos Milosavljevic, Liam O'Neil, Dibakar Gope, Ramon Matas, Alex Chalfin, Naveen Suda, Lingchuan Meng, Danny Loh

Our results highlight the challenges faced by super resolution on AI accelerators and demonstrate that SESR is significantly faster (e. g., 6x-8x higher FPS) than existing models on mobile-NPU.

4k 8k +1

Cannot find the paper you are looking for? You can Submit a new open access paper.