no code implementations • 5 Sep 2019 • Xavier Bost, Serigne Gueye, Vincent Labatut, Martha Larson, Georges Linarès, Damien Malinas, Raphaël Roth
In this paper, we tackle plot modeling by considering the social network of interactions between the characters involved in the narrative: substantial, durable changes in a major character's social environment suggest a new development relevant for the summary.
no code implementations • 17 Jun 2019 • Titouan Parcollet, Mohamed Morchid, Georges Linarès, Renato de Mori
Deep neural networks (DNNs) and more precisely recurrent neural networks (RNNs) are at the core of modern automatic speech recognition systems, due to their efficiency to process input sequences.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • 20 May 2019 • Noé Cecillon, Vincent Labatut, Richard Dufour, Georges Linarès
In recent years, online social networks have allowed worldwide users to meet and discuss.
no code implementations • 13 Apr 2019 • Titouan Parcollet, Mohamed Morchid, Xavier Bost, Georges Linarès
TRS transcripts are only used to measure the performances of ASR systems.
Generative Adversarial Network Spoken Language Understanding
no code implementations • 18 Dec 2018 • Xavier Bost, Georges Linarès, Serigne Gueye
Speaker diarization may be difficult to achieve when applied to narrative films, where speakers usually talk in adverse acoustic conditions: background music, sound effects, wide variations in intonation may hide the inter-speaker variability and make audio-based speaker diarization approaches error prone.
no code implementations • 21 Nov 2018 • Titouan Parcollet, Mirco Ravanelli, Mohamed Morchid, Georges Linarès, Renato de Mori
Neural network architectures are at the core of powerful automatic speech recognition systems (ASR).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
2 code implementations • 6 Nov 2018 • Titouan Parcollet, Mohamed Morchid, Georges Linarès, Renato de Mori
Recurrent neural networks (RNN) are at the core of modern automatic speech recognition (ASR) systems.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • 20 Jun 2018 • Titouan Parcollet, Ying Zhang, Mohamed Morchid, Chiheb Trabelsi, Georges Linarès, Renato De Mori, Yoshua Bengio
Quaternion numbers and quaternion neural networks have shown their efficiency to process multidimensional inputs as entities, to encode internal dependencies, and to solve many tasks with less learning parameters than real-valued models.
Ranked #19 on Speech Recognition on TIMIT
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
3 code implementations • ICLR 2019 • Titouan Parcollet, Mirco Ravanelli, Mohamed Morchid, Georges Linarès, Chiheb Trabelsi, Renato de Mori, Yoshua Bengio
Recurrent neural networks (RNNs) are powerful architectures to model sequential data, due to their capability to learn short and long-term dependencies between the basic elements of a sequence.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 20 Mar 2017 • Mohamed Morchid, Juan-Manuel Torres-Moreno, Richard Dufour, Javier Ramírez-Rodríguez, Georges Linarès
One of the main difficulty in using topic model on huge data collection is related to the material resources (CPU time and memory) required for model estimate.
no code implementations • 11 Feb 2017 • Mohamed Bouaziz, Mohamed Morchid, Richard Dufour, Georges Linarès, Renato de Mori
Nevertheless, these RNNs process a single input stream in one (LSTM) or two (Bidirectional LSTM) directions.
1 code implementation • 25 Feb 2016 • Xavier Bost, Vincent Labatut, Serigne Gueye, Georges Linarès
In order to assess our method, we apply it to a new corpus of 3 popular TV series, and compare it to both standard approaches.
no code implementations • 17 Nov 2015 • Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linarès
In this paper, we propose two neural network models targeted to retrieve OOV PNs relevant to an audio document: (a) Document level Continuous Bag of Words (D-CBOW), (b) Document level Continuous Bag of Weighted Words (D-CBOW2).