Search Results for author: Ross Cutler

Found 15 papers, 5 papers with code

Interspeech 2021 Deep Noise Suppression Challenge

1 code implementation6 Jan 2021 Chandan K A Reddy, Harishchandra Dubey, Kazuhito Koishida, Arun Nair, Vishak Gopal, Ross Cutler, Sebastian Braun, Hannes Gamper, Robert Aichner, Sriram Srinivasan

In this version of the challenge organized at INTERSPEECH 2021, we are expanding both our training and test datasets to accommodate full band scenarios.

Denoising Speech Quality

DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise Suppressors

no code implementations28 Oct 2020 Chandan K A Reddy, Vishak Gopal, Ross Cutler

The no-reference approaches correlate poorly with human ratings and are not widely adopted in the research community.

Speech Quality

Subjective Evaluation of Noise Suppression Algorithms in Crowdsourcing

1 code implementation25 Oct 2020 Babak Naderi, Ross Cutler

The quality of the speech communication systems, which include noise suppression algorithms, are typically evaluated in laboratory experiments according to the ITU-T Rec.

Speech Quality

Crowdsourcing approach for subjective evaluation of echo impairment

1 code implementation25 Oct 2020 Ross Cutler, Babak Nadari, Markus Loide, Sten Sootla, Ando Saabas

The quality of acoustic echo cancellers (AECs) in real-time communication systems is typically evaluated using objective metrics like ERLE and PESQ, and less commonly with lab-based subjective tests like ITU-T Rec.

ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets and Testing Framework

no code implementations10 Sep 2020 Kusha Sridhar, Ross Cutler, Ando Saabas, Tanel Parnamaa, Hannes Gamper, Sebastian Braun, Robert Aichner, Sriram Srinivasan

In this challenge, we open source two large datasets to train AEC models under both single talk and double talk scenarios.

Acoustic echo cancellation Audio and Speech Processing Sound

The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets, Subjective Testing Framework, and Challenge Results

no code implementations16 May 2020 Chandan K. A. Reddy, Vishak Gopal, Ross Cutler, Ebrahim Beyrami, Roger Cheng, Harishchandra Dubey, Sergiy Matusevych, Robert Aichner, Ashkan Aazami, Sebastian Braun, Puneet Rana, Sriram Srinivasan, Johannes Gehrke

In this challenge, we open-sourced a large clean speech and noise corpus for training the noise suppression models and a representative test set to real-world scenarios consisting of both synthetic and real recordings.

Speech Enhancement

Multimodal active speaker detection and virtual cinematography for video conferencing

no code implementations10 Feb 2020 Ross Cutler, Ramin Mehran, Sam Johnson, Cha Zhang, Adam Kirk, Oliver Whyte, Adarsh Kowdle

Active speaker detection (ASD) and virtual cinematography (VC) can significantly improve the remote user experience of a video conference by automatically panning, tilting and zooming of a video conferencing camera: users subjectively rate an expert video cinematographer's video significantly higher than unedited video.

The INTERSPEECH 2020 Deep Noise Suppression Challenge: Datasets, Subjective Speech Quality and Testing Framework

no code implementations23 Jan 2020 Chandan K. A. Reddy, Ebrahim Beyrami, Harishchandra Dubey, Vishak Gopal, Roger Cheng, Ross Cutler, Sergiy Matusevych, Robert Aichner, Ashkan Aazami, Sebastian Braun, Puneet Rana, Sriram Srinivasan, Johannes Gehrke

In this challenge, we open-source a large clean speech and noise corpus for training the noise suppression models and a representative test set to real-world scenarios consisting of both synthetic and real recordings.

Speech Enhancement Speech Quality

Reinforcement learning for bandwidth estimation and congestion control in real-time communications

no code implementations4 Dec 2019 Joyce Fang, Martin Ellis, Bin Li, Siyao Liu, Yasaman Hosseinkashi, Michael Revow, Albert Sadovnikov, Ziyuan Liu, Peng Cheng, Sachin Ashok, David Zhao, Ross Cutler, Yan Lu, Johannes Gehrke

Bandwidth estimation and congestion control for real-time communications (i. e., audio and video conferencing) remains a difficult problem, despite many years of research.

A scalable noisy speech dataset and online subjective test framework

no code implementations17 Sep 2019 Chandan K. A. Reddy, Ebrahim Beyrami, Jamie Pool, Ross Cutler, Sriram Srinivasan, Johannes Gehrke

Our subjective MOS evaluation is the first large scale evaluation of Speech Enhancement algorithms that we are aware of.

Speech Enhancement

Supervised Classifiers for Audio Impairments with Noisy Labels

no code implementations3 Jul 2019 Chandan K. A. Reddy, Ross Cutler, Johannes Gehrke

The user feedback after the call can act as the ground truth labels for training a supervised classifier on a large audio dataset.

On Design of Problem Token Questions in Quality of Experience Surveys

no code implementations19 Aug 2018 Jayant Gupchup, Ebrahim Beyrami, Martin Ellis, Yasaman Hosseinkashi, Sam Johnson, Ross Cutler

Based on 900, 000 calls gathered using a randomized controlled experiment from a live system, we find that the order bias can be significantly reduced by randomizing the display order of tokens.

Cannot find the paper you are looking for? You can Submit a new open access paper.