Search Results for author: Dejan Markovic

Found 6 papers, 2 papers with code

Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio

1 code implementation • NeurIPS 2023 • Xudong Xu, Dejan Markovic, Jacob Sandakly, Todd Keebler, Steven Krenn, Alexander Richard

While 3D human body modeling has received much attention in computer vision, modeling the acoustic equivalent, i. e. modeling 3D spatial audio produced by body motion and speech, has fallen short in the community.

Position

Paper
Code

Reconstructing the Dynamic Directivity of Unconstrained Speech

no code implementations • 9 Sep 2022 • Camille Noufi, Dejan Markovic, Peter Dodds

This virtual array is used to measure and encode the high-resolution directivity pattern of the speech signal as it evolves dynamically with natural speech and movement.

Paper
Add Code

End-to-End Binaural Speech Synthesis

no code implementations • 8 Jul 2022 • Wen Chin Huang, Dejan Markovic, Alexander Richard, Israel Dejene Gebru, Anjali Menon

In this work, we present an end-to-end binaural speech synthesis system that combines a low-bitrate audio codec with a powerful binaural decoder that is capable of accurate speech binauralization while faithfully reconstructing environmental factors like ambient noise or reverb.

Speech Synthesis

Paper
Add Code

Implicit Neural Spatial Filtering for Multichannel Source Separation in the Waveform Domain

no code implementations • 30 Jun 2022 • Dejan Markovic, Alexandre Defossez, Alexander Richard

We present a single-stage casual waveform-to-waveform multichannel model that can separate moving sound sources based on their broad spatial locations in a dynamic acoustic scene.

Paper
Add Code

Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis

1 code implementation • CVPR 2022 • Karren Yang, Dejan Markovic, Steven Krenn, Vasu Agrawal, Alexander Richard

Since facial actions such as lip movements contain significant information about speech content, it is not surprising that audio-visual speech enhancement methods are more accurate than their audio-only counterparts.

Speech Enhancement

Paper
Code

Neural Synthesis of Binaural Audio

no code implementations • ICLR 2021 • Alexander Richard, Dejan Markovic, Israel D. Gebru, Steven Krenn, Gladstone Alexander Butler, Fernando Torre, Yaser Sheikh

We present a neural rendering approach for binaural sound synthesis that can produce realistic and spatially accurate binaural sound in realtime.

Neural Rendering Position

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.