Search Results for author: Gurunath Reddy M

Found 6 papers, 2 papers with code

One-shot Localization and Segmentation of Medical Images with Foundation Models

no code implementations • 28 Oct 2023 • Deepa Anand, Gurunath Reddy M, Vanika Singhal, Dattesh D. Shanbhag, Shriram KS, Uday Patil, Chitresh Bhushan, Kavitha Manickam, Dawei Gui, Rakesh Mullick, Avinash Gopal, Parminder Bhatia, Taha Kass-Hout

Recent advances in Vision Transformers (ViT) and Stable Diffusion (SD) models with their ability to capture rich semantic features of the image have been used for image correspondence tasks on natural images.

Segmentation Semantic Segmentation

Paper
Add Code

Deep Attention-Based Alignment Network for Melody Generation from Incomplete Lyrics

no code implementations • 23 Jan 2023 • Gurunath Reddy M, Zhe Zhang, Yi Yu, Florian Harscoet, Simon Canales, Suhua Tang

We propose a deep attention-based alignment network, which aims to automatically predict lyrics and melody with given incomplete lyrics as input in a way similar to the music creation of humans.

Deep Attention

Paper
Add Code

Knowledge Distillation for Singing Voice Detection

1 code implementation • 9 Nov 2020 • Soumava Paul, Gurunath Reddy M, K Sreenivasa Rao, Partha Pratim Das

Singing Voice Detection (SVD) has been an active area of research in music information retrieval (MIR).

Information Retrieval Knowledge Distillation +3

Paper
Code

Learning to Recognize Code-switched Speech Without Forgetting Monolingual Speech Recognition

no code implementations • 1 Jun 2020 • Sanket Shah, Basil Abraham, Gurunath Reddy M, Sunayana Sitaram, Vikas Joshi

In this work, we show that fine-tuning ASR models on code-switched speech harms performance on monolingual speech.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

hf0: A hybrid pitch extraction method for multimodal voice

1 code implementation • 22 Apr 2019 • Pradeep Rengaswamy, Gurunath Reddy M, Krothapalli Sreenivasa Rao

The proposed hybrid model exploits the advantages of deep learning and signal processing methods to minimize the pitch detection error and adopts to various modes of acoustic signal.

Paper
Code

Glottal Closure Instants Detection From Pathological Acoustic Speech Signal Using Deep Learning

no code implementations • 25 Nov 2018 • Gurunath Reddy M, Tanumay Mandal, Krothapalli Sreenivasa Rao

In this paper, we propose a classification based glottal closure instants (GCI) detection from pathological acoustic speech signal, which finds many applications in vocal disorder analysis.

General Classification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.