no code implementations • 25 Sep 2023 • Krishna Subramani, Jean-Marc Valin, Jan Buethe, Paris Smaragdis, Mike Goodwin
Pitch estimation is an essential step of many speech processing algorithms, including speech coding, synthesis, and enhancement.
no code implementations • 8 Dec 2022 • Ahmed Mustafa, Jean-Marc Valin, Jan Büthe, Paris Smaragdis, Mike Goodwin
GAN vocoders are currently one of the state-of-the-art methods for building high-quality neural waveform generative models.
no code implementations • 18 Jun 2022 • Zhepei Wang, Ritwik Giri, Shrikant Venkataramani, Umut Isik, Jean-Marc Valin, Paris Smaragdis, Mike Goodwin, Arvindh Krishnaswamy
In this work, we propose Exformer, a time-domain architecture for target speaker extraction.