Search Results for author: Yanzi Jin

Found 5 papers, 2 papers with code

KV Prediction for Improved Time to First Token

1 code implementation10 Oct 2024 Maxwell Horton, Qingqing Cao, Chenfan Sun, Yanzi Jin, Sachin Mehta, Mohammad Rastegari, Moin Nabi

In our method, a small auxiliary model is used to process the prompt and produce an approximation of the KV cache used by a base model.

Code Completion HumanEval +2

An Efficient and Streaming Audio Visual Active Speaker Detection System

no code implementations13 Sep 2024 Arnav Kundu, Yanzi Jin, Mohammad Sekhavat, Max Horton, Danny Tormoen, Devang Naik

This paper delves into the challenging task of Active Speaker Detection (ASD), where the system needs to determine in real-time whether a person is speaking or not in a series of video frames.

Active Speaker Detection Audio-Visual Active Speaker Detection

Diffusion Models as Masked Audio-Video Learners

no code implementations5 Oct 2023 Elvis Nunez, Yanzi Jin, Mohammad Rastegari, Sachin Mehta, Maxwell Horton

Over the past several years, the synchronization between audio and visual signals has been leveraged to learn richer audio-visual representations.

Audio Classification Contrastive Learning

Layer-Wise Data-Free CNN Compression

no code implementations18 Nov 2020 Maxwell Horton, Yanzi Jin, Ali Farhadi, Mohammad Rastegari

We also show how to precondition the network to improve the accuracy of our layer-wise compression method.

Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.