1 code implementation • 10 Oct 2023 • Piyush Singh Pasi, Karthikeya Battepati, Preethi Jyothi, Ganesh Ramakrishnan, Tanmay Mahapatra, Manoj Singh
The problem of audio-to-text alignment has seen significant amount of research using complete supervision during training.
no code implementations • 31 Mar 2022 • Piyush Singh Pasi, Shubham Nemani, Preethi Jyothi, Ganesh Ramakrishnan
We focus on the audio-visual video parsing (AVVP) problem that involves detecting audio and visual event labels with temporal boundaries.