TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Dereverberation	WHAMR!	Conv-TasNet DAE	SI-SDR	12.03	# 2
Speech Dereverberation	WHAMR!	Conv-TasNet DAE	PESQ	3.46	# 2
Speech Dereverberation	WHAMR!	Conv-TasNet DAE	SI-SDRi	7.63	# 1
Speech Dereverberation	WHAMR!	Conv-TasNet DAE	ESTOI	93	# 2
Speech Dereverberation	WHAMR!	Conv-TasNet DAE	SRMR	8.7	# 2
Speech Dereverberation	WHAMR_ext	Conv-TasNet DAE	SI-SDR	7.07	# 1
Speech Dereverberation	WHAMR_ext	Conv-TasNet DAE	SI-SDRi	10.81	# 1
Speech Dereverberation	WHAMR_ext	Conv-TasNet DAE	PESQ	2.46	# 1
Speech Dereverberation	WHAMR_ext	Conv-TasNet DAE	ESTOI	81	# 1
Speech Dereverberation	WHAMR_ext	Conv-TasNet DAE	SRMR	9.18	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/receptive-field-analysis-of-temporal/speech-dereverberation-on-whamr-ext)](https://paperswithcode.com/sota/speech-dereverberation-on-whamr-ext?p=receptive-field-analysis-of-temporal)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/receptive-field-analysis-of-temporal/speech-dereverberation-on-whamr)](https://paperswithcode.com/sota/speech-dereverberation-on-whamr?p=receptive-field-analysis-of-temporal)`

Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation

13 Apr 2022 · William Ravenscroft, Stefan Goetze, Thomas Hain ·

Speech dereverberation is often an important requirement in robust speech processing tasks. Supervised deep learning (DL) models give state-of-the-art performance for single-channel speech dereverberation. Temporal convolutional networks (TCNs) are commonly used for sequence modelling in speech enhancement tasks. A feature of TCNs is that they have a receptive field (RF) dependent on the specific model configuration which determines the number of input frames that can be observed to produce an individual output frame. It has been shown that TCNs are capable of performing dereverberation of simulated speech data, however a thorough analysis, especially with focus on the RF is yet lacking in the literature. This paper analyses dereverberation performance depending on the model size and the RF of TCNs. Experiments using the WHAMR corpus which is extended to include room impulse responses (RIRs) with larger T60 values demonstrate that a larger RF can have significant improvement in performance when training smaller TCN models. It is also demonstrated that TCNs benefit from a wider RF when dereverberating RIRs with larger RT60 values.

PDF Abstract

Code

Add Remove Mark official

jwr1995/whamr_ext official

Tasks

Add Remove

Speech Dereverberation

Speech Enhancement

Datasets

Introduced in the Paper:

WHAMR_ext

Used in the Paper:

WHAM! WHAMR!

Results from the Paper

Edit

Ranked #1 on Speech Dereverberation on WHAMR_ext

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Dereverberation	WHAMR!	Conv-TasNet DAE	SI-SDR	12.03	# 2	Compare
			PESQ	3.46	# 2	Compare
			SI-SDRi	7.63	# 1	Compare
			ESTOI	93	# 2	Compare
			SRMR	8.7	# 2	Compare
Speech Dereverberation	WHAMR_ext	Conv-TasNet DAE	SI-SDR	7.07	# 1	Compare
			SI-SDRi	10.81	# 1	Compare
			PESQ	2.46	# 1	Compare
			ESTOI	81	# 1	Compare
			SRMR	9.18	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove