no code implementations • 2 Feb 2024 • Yi Chang, Zhao Ren, Zixing Zhang, Xin Jing, Kun Qian, Xi Shao, Bin Hu, Tanja Schultz, Björn W. Schuller
Speech contains rich information on the emotions of humans, and Speech Emotion Recognition (SER) has been an important topic in the area of human-computer interaction.
1 code implementation • 15 Jan 2024 • Daniel Reich, Tanja Schultz
In this study, we demonstrate that current evaluation schemes for VG-methods are problematic due to the flawed assumption of availability of relevant visual information.
no code implementations • 26 Jul 2023 • Zexu Pan, Marvin Borsdorf, Siqi Cai, Tanja Schultz, Haizhou Li
We propose both an offline and an online NeuroHeed, with the latter designed for real-time inference.
1 code implementation • 24 May 2023 • Daniel Reich, Felix Putze, Tanja Schultz
Metrics for Visual Grounding (VG) in Visual Question Answering (VQA) systems primarily aim to measure a system's reliance on relevant parts of the image when inferring an answer to the given question.
1 code implementation • 15 Nov 2022 • Daniel Reich, Felix Putze, Tanja Schultz
Visual Grounding (VG) in Visual Question Answering (VQA) systems describes how well a system manages to tie a question and its answer to relevant image regions.
no code implementations • 28 Jun 2021 • Daniel Reich, Felix Putze, Tanja Schultz
With the expressed goal of improving system transparency and visual grounding in the reasoning process in VQA, we present a modular system for the task of compositional VQA based on scene graphs.
no code implementations • LREC 2020 • Martha Yifiru Tachbelie, Solomon Teferra Abate, Tanja Schultz
Our results show the possibility of developing ASR system for a language, if we have pronunciation dictionary and language model, using an existing speech corpus of another language irrespective of their language family.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • LREC 2020 • Ayimunishagu Abulimiti, Tanja Schultz
In this work, we show our effort to build word-based as well as morpheme-based language models for Uyghur, a language that combines both challenges, i. e. it is a low-resource and agglutinative language.
no code implementations • LREC 2020 • Martha Yifiru Tachbelie, Solomon Teferra Abate, Tanja Schultz
From GP, Turkish, Uyghur and Croatian are found to have much overlap with the Ethiopian languages.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • LREC 2020 • Ayimunishagu Abulimiti, Tanja Schultz
For the developing of multilingual speech recognition system for Uyghur, we used Turkish as donor language, which we selected from GlobalPhone corpus as the most similar language to Uyghur.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • SoftwareX 2020 • Marília Barandas, Duarte Folgado, Letícia Fernandes, Sara Santos, Mariana Abreu, Patrícia Bota, Hui Liu, Tanja Schultz, Hugo Gamboa
Time series feature extraction is one of the preliminary steps of conventional machine learning pipelines.
no code implementations • 4 Oct 2017 • Heike Adel, Ngoc Thang Vu, Katrin Kirchhoff, Dominic Telaar, Tanja Schultz
The experimental results reveal that Brown word clusters, part-of-speech tags and open-class words are the most effective at reducing the perplexity of factored language models on the Mandarin-English Code-Switching corpus SEAME.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • LREC 2016 • Jochen Weiner, Claudia Frankenberg, Dominic Telaar, Britta Wendelstein, Johannes Schr{\"o}der, Tanja Schultz
Using a recursive long audio alignment procedure 96 hours of the transcribed data have been made accessible for ASR training.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • LREC 2014 • Tanja Schultz, Tim Schlippe
This paper describes the advances in the multilingual text and speech database GlobalPhone, a multilingual database of high-quality read speech with corresponding transcriptions and pronunciation dictionaries in 20 languages.
no code implementations • NeurIPS 2008 • Yik-Cheung Tam, Tanja Schultz
We propose using correlated bigram LSA for unsupervised LM adaptation for automatic speech recognition.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2