Search Results for author: Maurizio Omologo

Found 15 papers, 6 papers with code

The DIRHA simulated corpus

no code implementations • LREC 2014 • Luca Cristoforetti, Mirco Ravanelli, Maurizio Omologo, Aless Sosi, ro, Alberto Abad, Martin Hagmueller, Petros Maragos

This paper describes a multi-microphone multi-language acoustic corpus being developed under the EC project Distant-speech Interaction for Robust Home Applications (DIRHA).

Dialogue Management Distant Speech Recognition +2

Paper
Add Code

A network of deep neural networks for distant speech recognition

no code implementations • 23 Mar 2017 • Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio

Despite the remarkable progress recently made in distant speech recognition, state-of-the-art technology still suffers from a lack of robustness, especially when adverse acoustic conditions characterized by non-stationary noises and reverberation are met.

Distant Speech Recognition Speech Enhancement +1

Paper
Add Code

Batch-normalized joint training for DNN-based distant speech recognition

no code implementations • 24 Mar 2017 • Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio

Improving distant speech recognition is a crucial step towards flexible human-machine interfaces.

Distant Speech Recognition Speech Enhancement +1

Paper
Add Code

Improving speech recognition by revising gated recurrent units

1 code implementation • 29 Sep 2017 • Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio

First, we suggest to remove the reset gate in the GRU design, resulting in a more efficient single-gate architecture.

speech-recognition Speech Recognition

Paper
Code

The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments

2 code implementations • 6 Oct 2017 • Mirco Ravanelli, Maurizio Omologo

This paper introduces the contents and the possible usage of the DIRHA-ENGLISH multi-microphone corpus, recently realized under the EC DIRHA project.

Distant Speech Recognition speech-recognition

Paper
Code

Contaminated speech training methods for robust DNN-HMM distant speech recognition

1 code implementation • 10 Oct 2017 • Mirco Ravanelli, Maurizio Omologo

Despite the significant progress made in the last years, state-of-the-art speech recognition technologies provide a satisfactory performance only in the close-talking condition.

Distant Speech Recognition Speech Enhancement +1

Paper
Code

Realistic multi-microphone data simulation for distant speech recognition

1 code implementation • 26 Nov 2017 • Mirco Ravanelli, Piergiorgio Svaizer, Maurizio Omologo

The availability of realistic simulated corpora is of key importance for the future progress of distant speech recognition technology.

Audio and Speech Processing Sound

Paper
Code

Light Gated Recurrent Units for Speech Recognition

1 code implementation • 26 Mar 2018 • Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio

A field that has directly benefited from the recent advances in deep learning is Automatic Speech Recognition (ASR).

Ranked #6 on Speech Recognition on TIMIT

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Code

Automatic context window composition for distant speech recognition

no code implementations • 26 May 2018 • Mirco Ravanelli, Maurizio Omologo

Distant speech recognition is being revolutionized by deep learning, that has contributed to significantly outperform previous HMM-GMM systems.

Distant Speech Recognition speech-recognition

Paper
Add Code

DiPCo -- Dinner Party Corpus

no code implementations • 30 Sep 2019 • Maarten Van Segbroeck, Ahmed Zaid, Ksenia Kutsenko, Cirenia Huerta, Tinh Nguyen, Xuewen Luo, Björn Hoffmeister, Jan Trmal, Maurizio Omologo, Roland Maas

We present a speech data corpus that simulates a "dinner party" scenario taking place in an everyday home environment.

Benchmarking

Paper
Add Code

Sample Drop Detection for Distant-speech Recognition with Asynchronous Devices Distributed in Space

1 code implementation • 15 Nov 2019 • Tina Raissi, Santiago Pascual, Maurizio Omologo

The candidate time windows are selected from a set of large time intervals, possibly including a sample drop, and by using a preprocessing step.

Sound Audio and Speech Processing I.2.7

Paper
Code

Multi-Channel Transformer Transducer for Speech Recognition

no code implementations • 30 Aug 2021 • Feng-Ju Chang, Martin Radfar, Athanasios Mouchtaris, Maurizio Omologo

In this paper, we present a novel speech recognition model, Multi-Channel Transformer Transducer (MCTT), which features end-to-end multi-channel training, low computation cost, and low latency so that it is suitable for streaming decoding in on-device speech recognition.

speech-recognition Speech Recognition

Paper
Add Code

Context-Aware Transformer Transducer for Speech Recognition

no code implementations • 5 Nov 2021 • Feng-Ju Chang, Jing Liu, Martin Radfar, Athanasios Mouchtaris, Maurizio Omologo, Ariya Rastrow, Siegfried Kunzmann

We also leverage both BLSTM and pretrained BERT based models to encode contextual data and guide the network training.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

A neural prosody encoder for end-ro-end dialogue act classification

no code implementations • 11 May 2022 • Kai Wei, Dillon Knox, Martin Radfar, Thanh Tran, Markus Muller, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris, Maurizio Omologo

Dialogue act classification (DAC) is a critical task for spoken language understanding in dialogue systems.

Dialogue Act Classification Spoken Language Understanding

Paper
Add Code

Leveraging Redundancy in Multiple Audio Signals for Far-Field Speech Recognition

no code implementations • 1 Mar 2023 • Feng-Ju Chang, Anastasios Alexandridis, Rupak Vignesh Swaminathan, Martin Radfar, Harish Mallidi, Maurizio Omologo, Athanasios Mouchtaris, Brian King, Roland Maas

We augment the MC fusion networks to a conformer transducer model and train it in an end-to-end fashion.

Acoustic echo cancellation Automatic Speech Recognition +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.