no code implementations • 5 May 2023 • Srđan Kitić, Jérôme Daniel
In the present work, we suggest to circumvent this problem by directly modeling the impulse responses constituting the GTVV time series, which permits not only to relax the initial assumptions, but also to extract the information therein in a more consistent and efficient manner, entering the realm of blind system identification.
no code implementations • 10 Mar 2022 • Jérôme Daniel, Srđan Kitić
Range estimation of a far field sound source in a reverberant environment is known to be a notoriously difficult problem, hence most localization methods are only capable of estimating the source's Direction-of-Arrival (DoA).
no code implementations • 12 Oct 2021 • Srđan Kitić, Jérôme Daniel
We introduce and analyze Generalized Time Domain Velocity Vector (GTVV), an extension of the previously presented acoustic multipath footprint extracted from the Ambisonic recordings.
no code implementations • 8 Sep 2021 • Pierre-Amaury Grumiaux, Srđan Kitić, Laurent Girin, Alexandre Guérin
This article is a survey on deep learning methods for single and multiple sound source localization.
no code implementations • 2 Jun 2020 • Amélie Bosca, Alexandre Guérin, Lauréline Perotin, Srđan Kitić
We present a CNN architecture for speech enhancement from multichannel first-order Ambisonics mixtures.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 23 Jan 2020 • Srđan Kitić, Gilles Puy, Patrick Pérez, Philippe Gilberton
We consider the problem of identifying people on the basis of their walk (gait) pattern.