Search Results for author: Pragaash Ponnusamy

Found 6 papers, 2 papers with code

Training-Free Activation Sparsity in Large Language Models

1 code implementation26 Aug 2024 James Liu, Pragaash Ponnusamy, Tianle Cai, Han Guo, Yoon Kim, Ben Athiwaratkun

Activation sparsity can enable practical inference speedups in large language models (LLMs) by reducing the compute and memory-movement required for matrix multiplications during the forward pass.

Quantization

Mechanistic Design and Scaling of Hybrid Architectures

1 code implementation26 Mar 2024 Michael Poli, Armin W Thomas, Eric Nguyen, Pragaash Ponnusamy, Björn Deiseroth, Kristian Kersting, Taiji Suzuki, Brian Hie, Stefano Ermon, Christopher Ré, Ce Zhang, Stefano Massaroli

The development of deep learning architectures is a resource-demanding process, due to a vast design space, long prototyping times, and high compute costs associated with at-scale model training and evaluation.

Mamba

Personalized Query Rewriting in Conversational AI Agents

no code implementations9 Nov 2020 Alireza Roshan-Ghias, Clint Solomon Mathialagan, Pragaash Ponnusamy, Lambert Mathias, Chenlei Guo

Spoken language understanding (SLU) systems in conversational AI agents often experience errors in the form of misrecognitions by automatic speech recognition (ASR) or semantic gaps in natural language understanding (NLU).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

Feedback-Based Self-Learning in Large-Scale Conversational AI Agents

no code implementations6 Nov 2019 Pragaash Ponnusamy, Alireza Roshan Ghias, Chenlei Guo, Ruhi Sarikaya

Typically, the accuracy of the ML models in these components are improved by manually transcribing and annotating data.

Collaborative Filtering Self-Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.