About

Benchmarks

TREND DATASET BEST METHOD PAPER TITLE PAPER CODE COMPARE

Datasets

Greatest papers with code

Quaternion Neural Networks for Multi-channel Distant Speech Recognition

18 May 2020mravanelli/pytorch-kaldi

In this paper, we propose to capture these inter- and intra- structural dependencies with quaternion neural networks, which can jointly process multiple signals as whole quaternion entities.

DISTANT SPEECH RECOGNITION

Interpretable Convolutional Filters with SincNet

23 Nov 2018mravanelli/pytorch-kaldi

Deep learning is currently playing a crucial role toward higher levels of artificial intelligence.

DISTANT SPEECH RECOGNITION

The PyTorch-Kaldi Speech Recognition Toolkit

19 Nov 2018mravanelli/pytorch-kaldi

Experiments, that are conducted on several datasets and tasks, show that PyTorch-Kaldi can effectively be used to develop modern state-of-the-art speech recognizers.

DISTANT SPEECH RECOGNITION NOISY SPEECH RECOGNITION

Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

6 Apr 2019santi-pdp/pase

Learning good representations without supervision is still an open issue in machine learning, and is particularly challenging for speech signals, which are often characterized by long sequences with a complex hierarchical structure.

DISTANT SPEECH RECOGNITION HIERARCHICAL STRUCTURE

Contaminated speech training methods for robust DNN-HMM distant speech recognition

10 Oct 2017mravanelli/pySpeechRev

Despite the significant progress made in the last years, state-of-the-art speech recognition technologies provide a satisfactory performance only in the close-talking condition.

DISTANT SPEECH RECOGNITION SPEECH ENHANCEMENT

The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments

6 Oct 2017mravanelli/pySpeechRev

This paper introduces the contents and the possible usage of the DIRHA-ENGLISH multi-microphone corpus, recently realized under the EC DIRHA project.

DISTANT SPEECH RECOGNITION

Residual LSTM: Design of a Deep Recurrent Architecture for Distant Speech Recognition

10 Jan 2017kdgutier/esrnn_torch

The residual LSTM provides an additional spatial shortcut path from lower layers for efficient training of deep networks with multiple LSTM layers.

DISTANT SPEECH RECOGNITION