Search Results for author: Paul Calamia

Found 9 papers, 2 papers with code

Blind Localization of Room Reflections with Application to Spatial Audio

no code implementations • 21 Dec 2023 • Yogev Hadadi, Vladimir Tourbabin, Paul Calamia, Boaz Rafaely

Additionally, performance in terms of perception is investigated through a listening test.

Paper
Add Code

Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations

no code implementations • CVPR 2023 • Sagnik Majumder, Hao Jiang, Pierre Moulon, Ethan Henderson, Paul Calamia, Kristen Grauman, Vamsi Krishna Ithapu

Can conversational videos captured from multiple egocentric viewpoints reveal the map of a scene in a cost-efficient way?

Paper
Add Code

Towards Improved Room Impulse Response Estimation for Speech Recognition

no code implementations • 8 Nov 2022 • Anton Ratnarajah, Ishwarya Ananthabhotla, Vamsi Krishna Ithapu, Pablo Hoffmann, Dinesh Manocha, Paul Calamia

We propose a novel approach for blind room impulse response (RIR) estimation systems in the context of a downstream application scenario, far-field automatic speech recognition (ASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Direct and Residual Subspace Decomposition of Spatial Room Impulse Responses

1 code implementation • 20 Jul 2022 • Thomas Deppisch, Sebastià V. Amengual Garí, Paul Calamia, Jens Ahrens

This work proposes a subspace method that decomposes SRIRs into a direct part, which comprises the direct sound and the salient reflections, and a residual, to facilitate enhanced analysis and rendering methods by providing individual access to these components.

Direction of Arrival Estimation

Paper
Code

SAQAM: Spatial Audio Quality Assessment Metric

no code implementations • 24 Jun 2022 • Pranay Manocha, Anurag Kumar, Buye Xu, Anjali Menon, Israel D. Gebru, Vamsi K. Ithapu, Paul Calamia

Audio quality assessment is critical for assessing the perceptual realism of sounds.

Multi-Task Learning Speech Enhancement

Paper
Add Code

SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning

2 code implementations • 16 Jun 2022 • Changan Chen, Carl Schissler, Sanchit Garg, Philip Kobernik, Alexander Clegg, Paul Calamia, Dhruv Batra, Philip W Robinson, Kristen Grauman

We introduce SoundSpaces 2. 0, a platform for on-the-fly geometry-based audio rendering for 3D environments.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

311

Paper
Code

Visual Acoustic Matching

no code implementations • CVPR 2022 • Changan Chen, Ruohan Gao, Paul Calamia, Kristen Grauman

We introduce the visual acoustic matching task, in which an audio clip is transformed to sound like it was recorded in a target environment.

Paper
Add Code

Filtered Noise Shaping for Time Domain Room Impulse Response Estimation From Reverberant Speech

no code implementations • 15 Jul 2021 • Christian J. Steinmetz, Vamsi Krishna Ithapu, Paul Calamia

Deep learning approaches have emerged that aim to transform an audio signal so that it sounds as if it was recorded in the same room as a reference recording, with applications both in audio post-production and augmented reality.

Room Impulse Response (RIR)

Paper
Add Code

DPLM: A Deep Perceptual Spatial-Audio Localization Metric

no code implementations • 29 May 2021 • Pranay Manocha, Anurag Kumar, Buye Xu, Anjali Menon, Israel D. Gebru, Vamsi K. Ithapu, Paul Calamia

Subjective evaluations are critical for assessing the perceptual realism of sounds in audio-synthesis driven technologies like augmented and virtual reality.

Audio Synthesis

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.