no code implementations • 28 Oct 2023 • Deepa Anand, Gurunath Reddy M, Vanika Singhal, Dattesh D. Shanbhag, Shriram KS, Uday Patil, Chitresh Bhushan, Kavitha Manickam, Dawei Gui, Rakesh Mullick, Avinash Gopal, Parminder Bhatia, Taha Kass-Hout
Recent advances in Vision Transformers (ViT) and Stable Diffusion (SD) models with their ability to capture rich semantic features of the image have been used for image correspondence tasks on natural images.
no code implementations • 23 Jan 2023 • Gurunath Reddy M, Zhe Zhang, Yi Yu, Florian Harscoet, Simon Canales, Suhua Tang
We propose a deep attention-based alignment network, which aims to automatically predict lyrics and melody with given incomplete lyrics as input in a way similar to the music creation of humans.
1 code implementation • 9 Nov 2020 • Soumava Paul, Gurunath Reddy M, K Sreenivasa Rao, Partha Pratim Das
Singing Voice Detection (SVD) has been an active area of research in music information retrieval (MIR).
no code implementations • 1 Jun 2020 • Sanket Shah, Basil Abraham, Gurunath Reddy M, Sunayana Sitaram, Vikas Joshi
In this work, we show that fine-tuning ASR models on code-switched speech harms performance on monolingual speech.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • 22 Apr 2019 • Pradeep Rengaswamy, Gurunath Reddy M, Krothapalli Sreenivasa Rao
The proposed hybrid model exploits the advantages of deep learning and signal processing methods to minimize the pitch detection error and adopts to various modes of acoustic signal.
no code implementations • 25 Nov 2018 • Gurunath Reddy M, Tanumay Mandal, Krothapalli Sreenivasa Rao
In this paper, we propose a classification based glottal closure instants (GCI) detection from pathological acoustic speech signal, which finds many applications in vocal disorder analysis.