no code implementations • 26 Sep 2024 • Artem Dementyev, Chandan K. A. Reddy, Scott Wisdom, Navin Chatlani, John R. Hershey, Richard F. Lyon
Low latency models are critical for real-time speech enhancement applications, such as hearing aids and hearables.
no code implementations • 5 Apr 2022 • Alessandro Ragano, Emmanouil Benetos, Michael Chinen, Helard B. Martinez, Chandan K. A. Reddy, Jan Skoglund, Andrew Hines
In this paper, we evaluate several MOS predictors based on wav2vec 2. 0 and the NISQA speech quality prediction model to explore the role of the training data, the influence of the system type, and the role of cross-domain features in SSL models.
no code implementations • 8 Oct 2021 • Chandan K. A. Reddy, Vishak Gopa, Harishchandra Dubey, Sergiy Matusevych, Ross Cutler, Robert Aichner
With the recent growth of remote work, online meetings often encounter challenging audio contexts such as background noise, music, and echo.
no code implementations • 22 Jan 2021 • Sebastian Braun, Hannes Gamper, Chandan K. A. Reddy, Ivan Tashev
It is shown that the achievable speech quality is a function of network complexity, and show which models have better tradeoffs.
1 code implementation • 16 May 2020 • Chandan K. A. Reddy, Vishak Gopal, Ross Cutler, Ebrahim Beyrami, Roger Cheng, Harishchandra Dubey, Sergiy Matusevych, Robert Aichner, Ashkan Aazami, Sebastian Braun, Puneet Rana, Sriram Srinivasan, Johannes Gehrke
In this challenge, we open-sourced a large clean speech and noise corpus for training the noise suppression models and a representative test set to real-world scenarios consisting of both synthetic and real recordings.
1 code implementation • 23 Jan 2020 • Chandan K. A. Reddy, Ebrahim Beyrami, Harishchandra Dubey, Vishak Gopal, Roger Cheng, Ross Cutler, Sergiy Matusevych, Robert Aichner, Ashkan Aazami, Sebastian Braun, Puneet Rana, Sriram Srinivasan, Johannes Gehrke
In this challenge, we open-source a large clean speech and noise corpus for training the noise suppression models and a representative test set to real-world scenarios consisting of both synthetic and real recordings.
no code implementations • 17 Sep 2019 • Chandan K. A. Reddy, Ebrahim Beyrami, Jamie Pool, Ross Cutler, Sriram Srinivasan, Johannes Gehrke
Our subjective MOS evaluation is the first large scale evaluation of Speech Enhancement algorithms that we are aware of.
no code implementations • 3 Jul 2019 • Chandan K. A. Reddy, Ross Cutler, Johannes Gehrke
The user feedback after the call can act as the ground truth labels for training a supervised classifier on a large audio dataset.