no code implementations • 11 Mar 2021 • Tianxi Ji, Emre Yilmaz, Erman Ayday, Pan Li
Database fingerprinting have been widely adopted to prevent unauthorized sharing of data and identify the source of data leakages.
Cryptography and Security Databases
no code implementations • Interspeech 2020 • Emre Yilmaz, Özgür Bora Gevrek, Jibin Wu, Yuxiang Chen, Xuanbo Meng, Haizhou Li
To explore the effectiveness and computational complexity of SNN on KWS and wakeword detection, we compare the performance and computational costs of spiking fully-connected and convolutional neural networks with ANN counterparts under clean and noisy testing conditions.
no code implementations • LREC 2020 • Nick Wilkinson, Astik Biswas, Emre Yilmaz, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler
Automatic segmentation was applied in combination with automaticspeaker diarization.
no code implementations • LREC 2020 • Astik Biswas, Emre Yilmaz, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler
This paper reports on the semi-supervised development of acoustic and language models for under-resourced, code-switched speech in five South African languages.
1 code implementation • 19 Nov 2019 • Jibin Wu, Emre Yilmaz, Malu Zhang, Haizhou Li, Kay Chen Tan
The brain-inspired spiking neural networks (SNN) closely mimic the biological neural networks and can operate on low-power neuromorphic hardware with spike-based computation.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 27 Sep 2019 • Xianghu Yue, Grandee Lee, Emre Yilmaz, Fang Deng, Haizhou Li
In this work, we describe an E2E ASR pipeline for the recognition of CS speech in which a low-resourced language is mixed with a high resourced language.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 23 Sep 2019 • Chitralekha Gupta, Emre Yilmaz, Haizhou Li
Automatic lyrics alignment and transcription in polyphonic music are challenging tasks because the singing vocals are corrupted by the background music.
Audio and Speech Processing Sound
no code implementations • 25 Jun 2019 • Chitralekha Gupta, Emre Yilmaz, Haizhou Li
In this work, we propose (1) using additional speech and music-informed features and (2) adapting the acoustic models trained on a large amount of solo singing vocals towards polyphonic music using a small amount of in-domain data.
no code implementations • 20 Jun 2019 • Astik Biswas, Emre Yilmaz, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler
Furthermore, because English is common to all language pairs in our data, it dominates when training a unified language model, leading to improved English ASR performance at the expense of the other languages.
no code implementations • 19 Jun 2019 • Emre Yilmaz, Adem Derinel, Zhou Kun, Henk van den Heuvel, Niko Brummer, Haizhou Li, David A. van Leeuwen
This paper describes our initial efforts to build a large-scale speaker diarization (SD) and identification system on a recently digitized radio broadcast archive from the Netherlands which has more than 6500 audio tapes with 3000 hours of Frisian-Dutch speech recorded between 1950-2016.
no code implementations • 19 Jun 2019 • Qinyi Wang, Emre Yilmaz, Adem Derinel, Haizhou Li
Code-switching (CS) detection refers to the automatic detection of language switches in code-mixed utterances.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 18 Jun 2019 • Emre Yilmaz, Samuel Cohen, Xianghu Yue, David van Leeuwen, Haizhou Li
This archive contains recordings with monolingual Frisian and Dutch speech segments as well as Frisian-Dutch CS speech, hence the recognition performance on monolingual segments is also vital for accurate transcriptions.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 16 May 2019 • Emre Yilmaz, Vikramjit Mitra, Ganesh Sivaraman, Horacio Franco
The rapid population aging has stimulated the development of assistive devices that provide personalized medical support to the needies suffering from various etiologies.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 3 May 2019 • Emre Yilmaz, Mohammad Al-Rubaie, J. Morris Chang
In order to train a Naive Bayes classifier in an untrusted setting, we propose to use methods satisfying local differential privacy.
no code implementations • 16 Apr 2019 • Kong Aik Lee, Ville Hautamaki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Hector Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Huy Dat Tran, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-Francois Bonastre, Cheng-Lin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas Evans
The I4U consortium was established to facilitate a joint entry to NIST speaker recognition evaluations (SRE).
no code implementations • 23 Oct 2018 • Emre Yilmaz, Mitchell McLaren, Henk van den Heuvel, David A. van Leeuwen
In this paper, we describe several automatic annotation approaches to enable using of a large amount of raw bilingual broadcast data for acoustic model training in a semi-supervised setting.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5
no code implementations • 17 Sep 2018 • Longting Xu, Rohan Kumar Das, Emre Yilmaz, Jichen Yang, Haizhou Li
Speaker verification (SV) systems using deep neural network embeddings, so-called the x-vector systems, are becoming popular due to its good performance superior to the i-vector systems.
no code implementations • 28 Jul 2018 • Emre Yilmaz, Astik Biswas, Ewald van der Westhuizen, Febe De Wet, Thomas Niesler
We present our first efforts towards building a single multilingual automatic speech recognition (ASR) system that can process code-switching (CS) speech in five languages spoken within the same population.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 28 Jul 2018 • Emre Yilmaz, Henk van den Heuvel, David A. van Leeuwen
In this paper, we describe several techniques for improving the acoustic and language model of an automatic speech recognition (ASR) system operating on code-switching (CS) speech.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 28 Jul 2018 • Emre Yilmaz, Vikramjit Mitra, Chris Bartels, Horacio Franco
In this work, we investigate the joint use of articulatory and acoustic features for automatic speech recognition (ASR) of pathological speech.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 23 Jul 2018 • Raghav Menon, Herman Kamper, Emre Yilmaz, John Quinn, Thomas Niesler
We consider multilingual bottleneck features (BNFs) for nearly zero-resource keyword spotting.
no code implementations • LREC 2016 • Emre Yilmaz, Mario Ganzeboom, Lilian Beijer, Catia Cucchiarini, Helmer Strik
We present a new Dutch dysarthric speech database containing utterances of neurological patients with Parkinson{'}s disease, traumatic brain injury and cerebrovascular accident.
no code implementations • LREC 2016 • Emre Yilmaz, Maaike Andringa, Sigrid Kingma, Jelske Dijkstra, Frits van der Kuip, Hans Van de Velde, Frederik Kampstra, Jouke Algra, Henk van den Heuvel, David van Leeuwen
Frisian is mostly spoken in the province Fryslan and it is the second official language of the Netherlands.