Search Results for author: Anton Ratnarajah

Found 7 papers, 2 papers with code

AV-RIR: Audio-Visual Room Impulse Response Estimation

no code implementations30 Nov 2023 Anton Ratnarajah, Sreyan Ghosh, Sonal Kumar, Purva Chiniya, Dinesh Manocha

We propose AV-RIR, a novel multi-modal multi-task learning approach to accurately estimate the RIR from a given reverberant speech signal and the visual cues of its corresponding environment.

Multi-Task Learning Room Impulse Response (RIR) +1

AdVerb: Visually Guided Audio Dereverberation

no code implementations ICCV 2023 Sanjoy Chowdhury, Sreyan Ghosh, Subhrajyoti Dasgupta, Anton Ratnarajah, Utkarsh Tyagi, Dinesh Manocha

We present AdVerb, a novel audio-visual dereverberation framework that uses visual cues in addition to the reverberant sound to estimate clean audio.

Speaker Verification Speech Enhancement +2

Listen2Scene: Interactive material-aware binaural sound propagation for reconstructed 3D scenes

no code implementations2 Feb 2023 Anton Ratnarajah, Dinesh Manocha

We propose a novel neural-network-based binaural sound propagation method to generate acoustic effects for indoor 3D models of real environments.

Generative Adversarial Network

Towards Improved Room Impulse Response Estimation for Speech Recognition

no code implementations8 Nov 2022 Anton Ratnarajah, Ishwarya Ananthabhotla, Vamsi Krishna Ithapu, Pablo Hoffmann, Dinesh Manocha, Paul Calamia

We propose a novel approach for blind room impulse response (RIR) estimation systems in the context of a downstream application scenario, far-field automatic speech recognition (ASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes

2 code implementations18 May 2022 Anton Ratnarajah, Zhenyu Tang, Rohith Chandrashekar Aralikatti, Dinesh Manocha

We show that the acoustic metrics of the IRs predicted from our MESH2IR match the ground truth with less than 10% error.

2k Speech Dereverberation +1

FAST-RIR: Fast neural diffuse room impulse response generator

2 code implementations7 Oct 2021 Anton Ratnarajah, Shi-Xiong Zhang, Meng Yu, Zhenyu Tang, Dinesh Manocha, Dong Yu

We present a neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Cannot find the paper you are looking for? You can Submit a new open access paper.