1 code implementation • 6 Apr 2021 • Anton Mitrofanov, Mariya Korenevskaya, Ivan Podluzhny, Yuri Khokhlov, Aleksandr Laptev, Andrei Andrusenko, Aleksei Ilin, Maxim Korenevsky, Ivan Medennikov, Aleksei Romanenko
We propose a novel rescoring approach, which processes the entire lattice in a single call to the model.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • 14 May 2020 • Ivan Medennikov, Maxim Korenevsky, Tatiana Prisyach, Yuri Khokhlov, Mariya Korenevskaya, Ivan Sorokin, Tatiana Timofeeva, Anton Mitrofanov, Andrei Andrusenko, Ivan Podluzhny, Aleksandr Laptev, Aleksei Romanenko
We propose a novel Target-Speaker Voice Activity Detection (TS-VAD) approach, which directly predicts an activity of each speaker on each time frame.
no code implementations • 15 Mar 2020 • Natalia Tomashenko, Yuri Khokhlov, Yannick Esteve
Experimental results on the TED-LIUM corpus show that the proposed adaptation technique can be effectively integrated into DNN and TDNN setups at different levels and provide additional gain in recognition performance: up to 6% of relative word error rate reduction (WERR) over the strong feature-space adaptation techniques based on maximum likelihood linear regression (fMLLR) speaker adapted DNN baseline, and up to 18% of relative WERR in comparison with a speaker independent (SI) DNN baseline model, trained on conventional features.
no code implementations • 19 Jul 2017 • Yuri Khokhlov, Natalia Tomashenko, Ivan Medennikov, Alexei Romanenko
The proposed approach is based on using high-level features from an automatic speech recognition (ASR) system, so called phoneme posterior based (PPB) features, for decoding.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • JEPTALNRECITAL 2016 • Natalia Tomashenko, Yuri Khokhlov, Anthony Larcher, Yannick Est{\`e}ve
L{'}{\'e}tude pr{\'e}sent{\'e}e dans cet article am{\'e}liore une m{\'e}thode r{\'e}cemment propos{\'e}e pour l{'}adaptation de mod{\`e}les acoustiques markoviens coupl{\'e}s {\`a} un r{\'e}seau de neurones profond (DNN-HMM).