no code implementations • 17 Dec 2024 • Peter Wu, Bohan Yu, Kevin Scheck, Alan W Black, Aditi S. Krishnapriyan, Irene Y. Chen, Tanja Schultz, Shinji Watanabe, Gopala K. Anumanchipalli
The amount of articulatory data available for training deep learning models is much less compared to acoustic speech data.
no code implementations • 17 Oct 2024 • Chao Tan, Sheng Li, Yang Cao, Zhao Ren, Tanja Schultz
To this end, this paper focuses on investigating the security of FL for SER concerning property inference attacks.
no code implementations • 16 Sep 2024 • Catarina Botelho, Alberto Abad, Tanja Schultz, Isabel Trancoso
The proposed framework for reference speech characterization and disease detection is designed to support the medical community by providing clinically meaningful explanations that can serve as a valuable second opinion.
no code implementations • 4 Sep 2024 • Dashanka De Silva, Siqi Cai, Saurav Pahuja, Tanja Schultz, Haizhou Li
In the study of auditory attention, it has been revealed that there exists a robust correlation between attended speech and elicited neural responses, measurable through electroencephalography (EEG).
no code implementations • 26 Jun 2024 • Daniel Reich, Tanja Schultz
Finally, we propose an approach to create OOD tests that properly emphasize a requirement for VG, and show how to improve performance on them.
no code implementations • 21 Jun 2024 • Yi Chang, Zhao Ren, Zhonghao Zhao, Thanh Tam Nguyen, Kun Qian, Tanja Schultz, Björn W. Schuller
Speech emotion recognition (SER) plays a crucial role in human-computer interaction.
no code implementations • 2 Feb 2024 • Yi Chang, Zhao Ren, Zixing Zhang, Xin Jing, Kun Qian, Xi Shao, Bin Hu, Tanja Schultz, Björn W. Schuller
Speech contains rich information on the emotions of humans, and Speech Emotion Recognition (SER) has been an important topic in the area of human-computer interaction.
1 code implementation • 15 Jan 2024 • Daniel Reich, Tanja Schultz
In this study, we demonstrate that current evaluation schemes for VG-methods are problematic due to the flawed assumption of availability of relevant visual information.
no code implementations • 26 Jul 2023 • Zexu Pan, Marvin Borsdorf, Siqi Cai, Tanja Schultz, Haizhou Li
We propose both an offline and an online NeuroHeed, with the latter designed for real-time inference.
1 code implementation • 24 May 2023 • Daniel Reich, Felix Putze, Tanja Schultz
Metrics for Visual Grounding (VG) in Visual Question Answering (VQA) systems primarily aim to measure a system's reliance on relevant parts of the image when inferring an answer to the given question.
1 code implementation • 15 Nov 2022 • Daniel Reich, Felix Putze, Tanja Schultz
Visual Grounding (VG) in Visual Question Answering (VQA) systems describes how well a system manages to tie a question and its answer to relevant image regions.
no code implementations • 28 Jun 2021 • Daniel Reich, Felix Putze, Tanja Schultz
With the expressed goal of improving system transparency and visual grounding in the reasoning process in VQA, we present a modular system for the task of compositional VQA based on scene graphs.
no code implementations • LREC 2020 • Martha Yifiru Tachbelie, Solomon Teferra Abate, Tanja Schultz
From GP, Turkish, Uyghur and Croatian are found to have much overlap with the Ethiopian languages.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • LREC 2020 • Ayimunishagu Abulimiti, Tanja Schultz
For the developing of multilingual speech recognition system for Uyghur, we used Turkish as donor language, which we selected from GlobalPhone corpus as the most similar language to Uyghur.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • LREC 2020 • Martha Yifiru Tachbelie, Solomon Teferra Abate, Tanja Schultz
Our results show the possibility of developing ASR system for a language, if we have pronunciation dictionary and language model, using an existing speech corpus of another language irrespective of their language family.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • LREC 2020 • Ayimunishagu Abulimiti, Tanja Schultz
In this work, we show our effort to build word-based as well as morpheme-based language models for Uyghur, a language that combines both challenges, i. e. it is a low-resource and agglutinative language.
1 code implementation • SoftwareX 2020 • Marília Barandas, Duarte Folgado, Letícia Fernandes, Sara Santos, Mariana Abreu, Patrícia Bota, Hui Liu, Tanja Schultz, Hugo Gamboa
Time series feature extraction is one of the preliminary steps of conventional machine learning pipelines.
no code implementations • 4 Oct 2017 • Heike Adel, Ngoc Thang Vu, Katrin Kirchhoff, Dominic Telaar, Tanja Schultz
The experimental results reveal that Brown word clusters, part-of-speech tags and open-class words are the most effective at reducing the perplexity of factored language models on the Mandarin-English Code-Switching corpus SEAME.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • LREC 2016 • Jochen Weiner, Claudia Frankenberg, Dominic Telaar, Britta Wendelstein, Johannes Schr{\"o}der, Tanja Schultz
Using a recursive long audio alignment procedure 96 hours of the transcribed data have been made accessible for ASR training.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • LREC 2014 • Tanja Schultz, Tim Schlippe
This paper describes the advances in the multilingual text and speech database GlobalPhone, a multilingual database of high-quality read speech with corresponding transcriptions and pronunciation dictionaries in 20 languages.
no code implementations • NeurIPS 2008 • Yik-Cheung Tam, Tanja Schultz
We propose using correlated bigram LSA for unsupervised LM adaptation for automatic speech recognition.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3