no code implementations • IWSLT (EMNLP) 2018 • Matthias Sperber, Ngoc-Quan Pham, Thai-Son Nguyen, Jan Niehues, Markus Müller, Thanh-Le Ha, Sebastian Stüker, Alex Waibel
The baseline system is a cascade of an ASR system, a system to segment the ASR output and a neural machine translation system.
no code implementations • IWSLT 2017 • Thai-Son Nguyen, Markus Müller, Matthias Sperber, Thomas Zenkel, Sebastian Stüker, Alex Waibel
For the English lecture task, our best combination system has a WER of 8. 3% on the tst2015 development set while our other combinations gained 25. 7% WER for German lecture tasks.
no code implementations • EMNLP (IWSLT) 2019 • Felix Schneider, Alex Waibel
In this paper, we describe KIT’s submission for the IWSLT 2019 shared task on text translation.
no code implementations • EAMT 2020 • Ondřej Bojar, Dominik Macháček, Sangeet Sagar, Otakar Smrž, Jonáš Kratochvíl, Ebrahim Ansari, Dario Franceschini, Chiara Canton, Ivan Simonini, Thai-Son Nguyen, Felix Schneider, Sebastian Stücker, Alex Waibel, Barry Haddow, Rico Sennrich, Philip Williams
ELITR (European Live Translator) project aims to create a speech translation system for simultaneous subtitling of conferences and online meetings targetting up to 43 languages.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • IWSLT 2017 • Matthias Sperber, Jan Niehues, Alex Waibel
We note that unlike our baseline model, models trained on noisy data are able to generate outputs of proper length even for noisy inputs, while gradually reducing output length for higher amount of noise, as might also be expected from a human translator.
no code implementations • IWSLT 2017 • Eunah Cho, Jan Niehues, Alex Waibel
Experiments show that generalizing rare and unknown words greatly improves the punctuation insertion performance, reaching up to 8. 8 points of improvement in F-score when applied to the out-of-domain test scenario.
no code implementations • IWSLT 2016 • Thai-Son Nguyen, Markus Müller, Matthias Sperber, Thomas Zenkel, Kevin Kilgour, Sebastian Stüker, Alex Waibel
For the English TED task, our best combination system has a WER of 7. 8% on the development set while our other combinations gained 21. 8% and 28. 7% WERs for the English and German MSLT tasks.
no code implementations • IWSLT 2016 • Eunah Cho, Jan Niehues, Thanh-Le Ha, Matthias Sperber, Mohammed Mediani, Alex Waibel
In addition, we investigated methods to combine NMT systems that encode the input as well as the output differently.
no code implementations • IWSLT 2016 • Eunah Cho, Jan Niehues, Thanh-Le Ha, Alex Waibel
In this paper, we investigate a multilingual approach for speech disfluency removal.
no code implementations • IWSLT 2016 • Markus Müller, Sebastian Stüker, Alex Waibel
For system training, we use additional data from French, German and Turkish.
no code implementations • 7 Nov 2024 • Ibrahim Said Ahmad, Antonios Anastasopoulos, Ondřej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, William Chen, Qianqian Dong, Marcello Federico, Barry Haddow, Dávid Javorský, Mateusz Krubiński, Tsz Kin Lam, Xutai Ma, Prashant Mathur, Evgeny Matusov, Chandresh Maurya, John McCrae, Kenton Murray, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, Atul Kr. Ojha, John Ortega, Sara Papi, Peter Polák, Adam Pospíšil, Pavel Pecina, Elizabeth Salesky, Nivedita Sethiya, Balaram Sarkar, Jiatong Shi, Claytone Sikasote, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Brian Thompson, Marco Turchi, Alex Waibel, Shinji Watanabe, Patrick Wilken, Petr Zemánek, Rodolfo Zevallos
This paper reports on the shared tasks organized by the 21st IWSLT Conference.
no code implementations • 26 Sep 2024 • Leonard Bärmann, Chad DeChant, Joana Plewnia, Fabian Peller-Konrad, Daniel Bauer, Tamim Asfour, Alex Waibel
Verbalization of robot experience, i. e., summarization of and question answering about a robot's past, is a crucial ability for improving human-robot interaction.
no code implementations • 20 Sep 2023 • Peter Polák, Brian Yan, Shinji Watanabe, Alex Waibel, Ondřej Bojar
Further, this method lacks mechanisms for \textit{controlling} the quality vs. latency tradeoff.
no code implementations • 8 Sep 2023 • Leonard Bärmann, Rainer Kartmann, Fabian Peller-Konrad, Jan Niehues, Alex Waibel, Tamim Asfour
In this paper, we propose a system to achieve incremental learning of complex behavior from natural interaction, and demonstrate its implementation on a humanoid robot.
no code implementations • 5 May 2023 • Zhong Zhou, Jan Niehues, Alex Waibel
We examine two approaches: 1. best selection of seed sentences to jump start translations in a new language in view of best generalization to the remainder of a larger targeted text(s), and 2. we adapt large general multilingual translation engines from many other languages to focus on a specific text in a new, unknown language.
no code implementations • 24 May 2022 • Ngoc-Quan Pham, Alex Waibel, Jan Niehues
Multilingual speech recognition with supervised learning has achieved great results as reflected in recent research.
no code implementations • MTSummit 2021 • Zhong Zhou, Alex Waibel
We compare the portion-based approach that optimizes coherence of the text locally with the random sampling approach that increases coverage of the text globally.
no code implementations • 12 Apr 2021 • Zhong Zhou, Alex Waibel
In other words, given a text in 124 source languages, we translate it into a severely low resource language using only ~1, 000 lines of low resource data without any external help.
no code implementations • EACL 2021 • Ond{\v{r}}ej Bojar, Dominik Mach{\'a}{\v{c}}ek, Sangeet Sagar, Otakar Smr{\v{z}}, Jon{\'a}{\v{s}} Kratochv{\'\i}l, Peter Pol{\'a}k, Ebrahim Ansari, Mohammad Mahmoudi, Rishu Kumar, Dario Franceschini, Chiara Canton, Ivan Simonini, Thai-Son Nguyen, Felix Schneider, Sebastian St{\"u}ker, Alex Waibel, Barry Haddow, Rico Sennrich, Philip Williams
This paper presents an automatic speech translation system aimed at live subtitling of conference presentations.
1 code implementation • 7 Oct 2020 • Thai-Son Nguyen, Sebastian Stueker, Alex Waibel
Achieving super-human performance in recognizing human speech has been a goal for several decades, as researchers have worked on increasingly challenging tasks.
no code implementations • WS 2020 • Ebrahim Ansari, Amittai Axelrod, Nguyen Bach, Ond{\v{r}}ej Bojar, Roldano Cattoni, Fahim Dalvi, Nadir Durrani, Marcello Federico, Christian Federmann, Jiatao Gu, Fei Huang, Kevin Knight, Xutai Ma, Ajay Nagesh, Matteo Negri, Jan Niehues, Juan Pino, Elizabeth Salesky, Xing Shi, Sebastian St{\"u}ker, Marco Turchi, Alex Waibel, er, Changhan Wang
The evaluation campaign of the International Conference on Spoken Language Translation (IWSLT 2020) featured this year six challenge tracks: (i) Simultaneous speech translation, (ii) Video speech translation, (iii) Offline speech translation, (iv) Conversational speech translation, (v) Open domain translation, and (vi) Non-native speech translation.
no code implementations • WS 2020 • Felix Schneider, Alex Waibel, er
Simultaneous machine translation systems rely on a policy to schedule read and write operations in order to begin translating a source sentence before it is complete.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • WS 2020 • Ngoc-Quan Pham, Felix Schneider, Tuan-Nam Nguyen, Thanh-Le Ha, Thai Son Nguyen, Maximilian Awiszus, Sebastian St{\"u}ker, Alex Waibel, er
This paper describes KIT{'}s submissions to the IWSLT2020 Speech Translation evaluation campaign.
no code implementations • LREC 2020 • Dario Franceschini, Chiara Canton, Ivan Simonini, Armin Schweinfurth, Adelheid Glott, Sebastian St{\"u}ker, Thai-Son Nguyen, Felix Schneider, Thanh-Le Ha, Alex Waibel, Barry Haddow, Philip Williams, Rico Sennrich, Ond{\v{r}}ej Bojar, Sangeet Sagar, Dominik Mach{\'a}{\v{c}}ek, Otakar Smr{\v{z}}
This paper presents our progress towards deploying a versatile communication platform in the task of highly multilingual live speech translation for conferences and remote meetings live subtitling.
no code implementations • LREC 2020 • Juan Hussain, Oussama Zenkri, Sebastian St{\"u}ker, Alex Waibel
Collecting domain-specific data for under-resourced languages, e. g., dialects of languages, can be very expensive, potentially financially prohibitive and taking long time.
no code implementations • 8 Apr 2020 • Stefan Constantin, Alex Waibel
If yes, it corrects the second last utterance according to the error correction in the last utterance and outputs the extracted pairs of reparandum and repair entity.
no code implementations • 22 Mar 2020 • Thai-Son Nguyen, Ngoc-Quan Pham, Sebastian Stueker, Alex Waibel
However, when it comes to performing run-on recognition on an input stream of audio data while producing recognition results in real-time and with low word-based latency, these models face several challenges.
no code implementations • 22 Mar 2020 • Thai Son Nguyen, Jan Niehues, Eunah Cho, Thanh-Le Ha, Kevin Kilgour, Markus Muller, Matthias Sperber, Sebastian Stueker, Alex Waibel
User studies have shown that reducing the latency of our simultaneous lecture translation system should be the most important goal.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 9 Mar 2020 • Thai-Son Nguyen, Sebastian Stüker, Alex Waibel
We show that for the hybrid models, supplying additional training data from other domains with mismatched acoustic conditions does not increase the performance on specific domains.
no code implementations • 29 Nov 2019 • Verena Heusser, Niklas Freymuth, Stefan Constantin, Alex Waibel
Speech emotion recognition is a challenging task and an important step towards more natural human-machine interaction.
no code implementations • 7 Nov 2019 • Zhong Zhou, Lori Levin, David R. Mortensen, Alex Waibel
Firstly, we pool IGT for 1, 497 languages in ODIN (54, 545 glosses) and 70, 918 glosses in Arapaho and train a gloss-to-target NMT system from IGT to English, with a BLEU score of 25. 94.
no code implementations • 29 Oct 2019 • Thai-Son Nguyen, Sebastian Stueker, Jan Niehues, Alex Waibel
Sequence-to-Sequence (S2S) models recently started to show state-of-the-art performance for automatic speech recognition (ASR).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • WS 2019 • Stefan Constantin, Jan Niehues, Alex Waibel
The state-of-the-art neural network architectures make it possible to create spoken language understanding systems with high quality and fast processing time.
Natural Language Understanding Spoken Language Understanding
no code implementations • WS 2019 • Ngoc-Quan Pham, Jan Niehues, Thanh-Le Ha, Alex Waibel
We investigated the behaviour of such models on the standard IWSLT 2017 multilingual dataset.
no code implementations • ACL 2019 • Matthias Sperber, Graham Neubig, Ngoc-Quan Pham, Alex Waibel
Lattices are an efficient and effective method to encode ambiguity of upstream systems in natural language processing tasks, for example to compactly capture multiple speech recognition hypotheses, or to represent multiple linguistic analyses.
no code implementations • NAACL 2019 • Elizabeth Salesky, Matthias Sperber, Alex Waibel
Spoken language translation applications for speech suffer due to conversational speech phenomena, particularly the presence of disfluencies.
no code implementations • TACL 2019 • Matthias Sperber, Graham Neubig, Jan Niehues, Alex Waibel
Speech translation has traditionally been approached through cascaded models consisting of a speech recognizer trained on a corpus of transcribed speech, and a machine translation system trained on parallel texts.
no code implementations • 31 Mar 2019 • Thai-Son Nguyen, Sebastian Stueker, Alex Waibel
In this work, we learn a shared encoding representation for a multi-task neural network model optimized with connectionist temporal classification (CTC) and conventional framewise cross-entropy training criteria.
no code implementations • 2 Feb 2019 • Thai-Son Nguyen, Sebastian Stueker, Alex Waibel
Acoustic-to-word (A2W) models that allow direct mapping from acoustic signals to word sequences are an appealing approach to end-to-end automatic speech recognition due to their simplicity.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 17 Dec 2018 • Stefan Constantin, Jan Niehues, Alex Waibel
When building a neural network-based Natural Language Understanding component, one main challenge is to collect enough training data.
no code implementations • 7 Nov 2018 • Elizabeth Salesky, Susanne Burger, Jan Niehues, Alex Waibel
We introduce a corpus of cleaned target data for the Fisher Spanish-English dataset for this task.
no code implementations • WS 2018 • Ngoc-Quan Pham, Jan Niehues, Alex Waibel, er
We present our experiments in the scope of the news translation task in WMT 2018, in directions: English→German.
no code implementations • WS 2018 • Ngoc-Quan Pham, Jan Niehues, Alex Waibel
Neural machine translation (NMT) has significantly improved the quality of automatic translation models.
no code implementations • ACL 2019 • Zhong Zhou, Matthias Sperber, Alex Waibel
Our multi-paraphrase NMT that trains only on two languages outperforms the multilingual baselines.
no code implementations • 1 Aug 2018 • Jan Niehues, Ngoc-Quan Pham, Thanh-Le Ha, Matthias Sperber, Alex Waibel
After adaptation, we are able to reduce the number of corrections displayed during incremental output construction by 45%, without a decrease in translation quality.
no code implementations • COLING 2018 • Florian Dessloch, Thanh-Le Ha, Markus M{\"u}ller, Jan Niehues, Thai-Son Nguyen, Ngoc-Quan Pham, Elizabeth Salesky, Matthias Sperber, Sebastian St{\"u}ker, Thomas Zenkel, Alex Waibel, er
{\%} Combining these techniques, we are able to provide an adapted speech translation system for several European languages.
no code implementations • 27 Jul 2018 • Patrick Huber, Jan Niehues, Alex Waibel
Our approach overcomes recent limitations with extended narratives through a multi-layered computational approach to generate an abstract context representation.
1 code implementation • WS 2018 • Jörg Franke, Jan Niehues, Alex Waibel
Deep learning models are often not easily adaptable to new tasks and require task-specific adjustments.
no code implementations • 5 Jul 2018 • Markus Müller, Sebastian Stüker, Alex Waibel
Multilingual Speech Recognition is one of the most costly AI problems, because each language (7, 000+) and even different accents require their own acoustic models to obtain best recognition performance.
no code implementations • WS 2018 • Zhong Zhou, Matthias Sperber, Alex Waibel
The main challenges we identify are the lack of low-resource language data, effective methods for cross-lingual transfer, and the variable-binding problem that is common in neural systems.
1 code implementation • 26 Mar 2018 • Matthias Sperber, Jan Niehues, Graham Neubig, Sebastian Stüker, Alex Waibel
Self-attention is a method of encoding sequences of vectors by relating these vectors to each-other based on pairwise similarities.
1 code implementation • LREC 2018 • Patrick Huber, Jan Niehues, Alex Waibel
We present a new approach to evaluate computational models for the task of text understanding by the means of out-of-context error detection.
no code implementations • 6 Mar 2018 • Stefan Constantin, Jan Niehues, Alex Waibel
Furthermore, by using a feedforward neural network, we are able to generate the output word by word and are no longer restricted to a fixed number of possible response candidates.
no code implementations • 19 Dec 2017 • Thomas Zenkel, Ramon Sanabria, Florian Metze, Alex Waibel
This paper proposes a novel approach to create an unit set for CTC based speech recognition systems.
no code implementations • 13 Nov 2017 • Markus Müller, Sebastian Stüker, Alex Waibel
We evaluated the use of different language combinations as well as the addition of Language Feature Vectors (LFVs).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 13 Nov 2017 • Markus Müller, Sebastian Stüker, Alex Waibel
In this work, we focus on multilingual systems based on recurrent neural networks (RNNs), trained using the Connectionist Temporal Classification (CTC) loss function.
no code implementations • 15 Sep 2017 • Matthias Sperber, Graham Neubig, Jan Niehues, Satoshi Nakamura, Alex Waibel
We investigate the problem of manually correcting errors from an automatic speech transcript in a cost-sensitive fashion.
no code implementations • WS 2017 • Jan-Thorsten Peter, Hermann Ney, Ond{\v{r}}ej Bojar, Ngoc-Quan Pham, Jan Niehues, Alex Waibel, Franck Burlot, Fran{\c{c}}ois Yvon, M{\=a}rcis Pinnis, Valters {\v{S}}ics, Jasmijn Bastings, Miguel Rios, Wilker Aziz, Philip Williams, Fr{\'e}d{\'e}ric Blain, Lucia Specia
no code implementations • 15 Aug 2017 • Thomas Zenkel, Ramon Sanabria, Florian Metze, Jan Niehues, Matthias Sperber, Sebastian Stüker, Alex Waibel
The CTC loss function maps an input sequence of observable feature vectors to an output sequence of symbols.
no code implementations • WS 2017 • Jan Niehues, Eunah Cho, Thanh-Le Ha, Alex Waibel
By separating the search space and the modeling using $n$-best list reranking, we analyze the influence of both parts of an NMT system independently.
1 code implementation • 2 Jun 2017 • Robin Ruede, Markus Müller, Sebastian Stüker, Alex Waibel
BCs can be expressed in different ways, depending on the modality of the interaction, for example as gestures or acoustic cues.
no code implementations • EMNLP 2017 • Matthias Sperber, Graham Neubig, Jan Niehues, Alex Waibel
In this work, we extend the TreeLSTM (Tai et al., 2015) into a LatticeLSTM that is able to consume word lattices, and can be used as encoder in an attentional encoder-decoder model.
no code implementations • COLING 2016 • Matthias Sperber, Graham Neubig, Jan Niehues, Sebastian St{\"u}ker, Alex Waibel
Evaluating the quality of output from language processing systems such as machine translation or speech recognition is an essential step in ensuring that they are sufficient for practical use.
no code implementations • COLING 2016 • Jan Niehues, Eunah Cho, Thanh-Le Ha, Alex Waibel
We analyzed the influence of the quality of the initial system on the final result.
no code implementations • WS 2016 • Jan-Thorsten Peter, Tamer Alkhouli, Hermann Ney, Matthias Huck, Fabienne Braune, Alex Fraser, er, Ale{\v{s}} Tamchyna, Ond{\v{r}}ej Bojar, Barry Haddow, Rico Sennrich, Fr{\'e}d{\'e}ric Blain, Lucia Specia, Jan Niehues, Alex Waibel, Alex Allauzen, re, Lauriane Aufrant, Franck Burlot, Elena Knyazeva, Thomas Lavergne, Fran{\c{c}}ois Yvon, M{\=a}rcis Pinnis, Stella Frank
Ranked #12 on Machine Translation on WMT2016 English-Romanian
no code implementations • NAACL 2016 • Markus M{\"u}ller, Thai Son Nguyen, Jan Niehues, Eunah Cho, Bastian Kr{\"u}ger, Thanh-Le Ha, Kevin Kilgour, Matthias Sperber, Mohammed Mediani, Sebastian St{\"u}ker, Alex Waibel
no code implementations • LREC 2016 • Markus M{\"u}ller, Sarah F{\"u}nfer, Sebastian St{\"u}ker, Alex Waibel
One obstacle to achieving this goal is that lectures at KIT are usually held in German which many foreign students are not sufficiently proficient in, as, e. g., opposed to English.
no code implementations • LREC 2016 • Matthias Sperber, Graham Neubig, Satoshi Nakamura, Alex Waibel
Our goal is to improve the human transcription quality via appropriate user interface design.
no code implementations • 28 Apr 2015 • Thanh-Le Ha, Jan Niehues, Alex Waibel
In this paper we combine the advantages of a model using global source sentence contexts, the Discriminative Word Lexicon, and neural networks.
no code implementations • LREC 2014 • Eunah Cho, Sarah F{\"u}nfer, Sebastian St{\"u}ker, Alex Waibel
With the increasing number of applications handling spontaneous speech, the needs to process spoken languages become stronger.
no code implementations • LREC 2014 • Teresa Herrmann, Jan Niehues, Alex Waibel
However, it is a crucial aspect for humans when deciding on translation quality.
no code implementations • TACL 2014 • Matthias Sperber, Mirjam Simantzik, Graham Neubig, Satoshi Nakamura, Alex Waibel
In this paper, we study the problem of manually correcting automatic annotations of natural language in as efficient a manner as possible.
no code implementations • LREC 2012 • Sebastian St{\"u}ker, Florian Kraft, Christian Mohr, Teresa Herrmann, Eunah Cho, Alex Waibel
Academic lectures offer valuable content, but often do not reach their full potential audience due to the language barrier.