Search Results for author: Mudhakar Srivatsa

Found 15 papers, 2 papers with code

Accelerating Production LLMs with Combined Token/Embedding Speculators

1 code implementation29 Apr 2024 Davis Wertheimer, Joshua Rosenkranz, Thomas Parnell, Sahil Suneja, Pavithra Ranganathan, Raghu Ganti, Mudhakar Srivatsa

This technical report describes the design and training of novel speculative decoding draft models, for accelerating the inference speeds of large language models in a production environment.

SudokuSens: Enhancing Deep Learning Robustness for IoT Sensing Applications using a Generative Approach

no code implementations3 Feb 2024 Tianshi Wang, Jinyang Li, Ruijie Wang, Denizhan Kara, Shengzhong Liu, Davis Wertheimer, Antoni Viros-i-Martin, Raghu Ganti, Mudhakar Srivatsa, Tarek Abdelzaher

To incorporate sufficient diversity into the IoT training data, one therefore needs to consider a combinatorial explosion of training cases that are multiplicative in the number of objects considered and the possible environmental conditions in which such objects may be encountered.

Contrastive Learning

TP-Aware Dequantization

no code implementations15 Jan 2024 Adnan Hoque, Mudhakar Srivatsa, Chih-Chieh Yang, Raghu Ganti

In this paper, we present a novel method that reduces model inference latency during distributed deployment of Large Language Models (LLMs).

Quantization

Accelerating a Triton Fused Kernel for W4A16 Quantized Inference with SplitK work decomposition

no code implementations5 Jan 2024 Adnan Hoque, Less Wright, Chih-Chieh Yang, Mudhakar Srivatsa, Raghu Ganti

Our implementation shows improvement for the type of skinny matrix-matrix multiplications found in foundation model inference workloads.

State Action Separable Reinforcement Learning

no code implementations5 Jun 2020 Ziyao Zhang, Liang Ma, Kin K. Leung, Konstantinos Poularakis, Mudhakar Srivatsa

We observe that although actions directly define the agents' behaviors, for many problems the next state after a state transition matters more than the action taken, in determining the return of such a state transition.

Decision Making reinforcement-learning +2

Neural Network Tomography

no code implementations9 Jan 2020 Liang Ma, Ziyao Zhang, Mudhakar Srivatsa

Network tomography, a classic research problem in the realm of network monitoring, refers to the methodology of inferring unmeasured network attributes using selected end-to-end path measurements.

Data Augmentation

neuralRank: Searching and ranking ANN-based model repositories

no code implementations2 Mar 2019 Nirmit Desai, Linsong Chu, Raghu K. Ganti, Sebastian Stein, Mudhakar Srivatsa

The key idea behind this algorithm is to base model suitability on the discriminating power of a model, using a novel metric to measure it.

Transfer Learning

Actor Conditioned Attention Maps for Video Action Detection

2 code implementations30 Dec 2018 Oytun Ulutan, Swati Rallapalli, Mudhakar Srivatsa, Carlos Torres, B. S. Manjunath

While observing complex events with multiple actors, humans do not assess each actor separately, but infer from the context.

Action Detection

Object Localization and Size Estimation from RGB-D Images

no code implementations2 Aug 2018 ShreeRanjani SrirangamSridharan, Oytun Ulutan, Shehzad Noor Taus Priyo, Swati Rallapalli, Mudhakar Srivatsa

However, the addition of a depth image can be further used to segment images that might otherwise have identical color information.

Object Object Localization

Beyond Spatial Auto-Regressive Models: Predicting Housing Prices with Satellite Imagery

no code implementations16 Oct 2016 Archith J. Bency, Swati Rallapalli, Raghu K. Ganti, Mudhakar Srivatsa, B. S. Manjunath

Spatial Auto-Regression (SAR) is a common tool used to model such data, where the spatial contiguity matrix (W) encodes the spatial correlations.

Cannot find the paper you are looking for? You can Submit a new open access paper.