Search Results for author: Markus Müller

Found 13 papers, 1 papers with code

The 2016 KIT IWSLT Speech-to-Text Systems for English and German

no code implementations IWSLT 2016 Thai-Son Nguyen, Markus Müller, Matthias Sperber, Thomas Zenkel, Kevin Kilgour, Sebastian Stüker, Alex Waibel

For the English TED task, our best combination system has a WER of 7. 8% on the development set while our other combinations gained 21. 8% and 28. 7% WERs for the English and German MSLT tasks.

The 2017 KIT IWSLT Speech-to-Text Systems for English and German

no code implementations IWSLT 2017 Thai-Son Nguyen, Markus Müller, Matthias Sperber, Thomas Zenkel, Sebastian Stüker, Alex Waibel

For the English lecture task, our best combination system has a WER of 8. 3% on the tst2015 development set while our other combinations gained 25. 7% WER for German lecture tasks.

A Novel Self-Supervised Cross-Modal Image Retrieval Method In Remote Sensing

no code implementations23 Feb 2022 Gencer Sumbul, Markus Müller, Begüm Demir

Due to the availability of multi-modal remote sensing (RS) image archives, one of the most important research topics is the development of cross-modal RS image retrieval (CM-RSIR) methods that search semantically similar images across different modalities.

Image Retrieval Retrieval

Fluctuation driven transitions in localized insulators: Intermittent metallicity and path chaos precede delocalization

no code implementations11 Mar 2021 Valentina Ros, Markus Müller

The first occurs between a non-resonating insulator and an intermittent metal.

Disordered Systems and Neural Networks Statistical Mechanics

Tie Your Embeddings Down: Cross-Modal Latent Spaces for End-to-end Spoken Language Understanding

no code implementations18 Nov 2020 Bhuvan Agrawal, Markus Müller, Martin Radfar, Samridhi Choudhary, Athanasios Mouchtaris, Siegfried Kunzmann

In this paper, we treat an E2E system as a multi-modal model, with audio and text functioning as its two modalities, and use a cross-modal latent space (CMLS) architecture, where a shared latent space is learned between the `acoustic' and `text' embeddings.

Spoken Language Understanding Triplet

Very Deep Self-Attention Networks for End-to-End Speech Recognition

no code implementations30 Apr 2019 Ngoc-Quan Pham, Thai-Son Nguyen, Jan Niehues, Markus Müller, Sebastian Stüker, Alexander Waibel

Recently, end-to-end sequence-to-sequence models for speech recognition have gained significant interest in the research community.

speech-recognition Speech Recognition

Neural Language Codes for Multilingual Acoustic Models

no code implementations5 Jul 2018 Markus Müller, Sebastian Stüker, Alex Waibel

Multilingual Speech Recognition is one of the most costly AI problems, because each language (7, 000+) and even different accents require their own acoustic models to obtain best recognition performance.

speech-recognition Speech Recognition

Multilingual Adaptation of RNN Based ASR Systems

no code implementations13 Nov 2017 Markus Müller, Sebastian Stüker, Alex Waibel

In this work, we focus on multilingual systems based on recurrent neural networks (RNNs), trained using the Connectionist Temporal Classification (CTC) loss function.

Yeah, Right, Uh-Huh: A Deep Learning Backchannel Predictor

1 code implementation2 Jun 2017 Robin Ruede, Markus Müller, Sebastian Stüker, Alex Waibel

BCs can be expressed in different ways, depending on the modality of the interaction, for example as gestures or acoustic cues.

Cannot find the paper you are looking for? You can Submit a new open access paper.