no code implementations • 16 Jun 2019 • Jian Zheng, Sudha Krishnamurthy, Ruxin Chen, Min-Hung Chen, Zhenhao Ge, Xiaohua LI
However, little work has been done for game image captioning which has some unique characteristics and requirements.
2 code implementations • 8 Feb 2017 • Zhenhao Ge, Ananth N. Iyer, Srinath Cheluvaraja, Aravind Ganapathiraju
The mechanism proposed here is for real-time speaker change detection in conversations, which firstly trains a neural network text-independent speaker classifier using in-domain speaker data.
Sound
1 code implementation • 8 Feb 2017 • Zhenhao Ge, Ananth N. Iyer, Srinath Cheluvaraja, Ram Sundaram, Aravind Ganapathiraju
This work presents a novel framework based on feed-forward neural network for text-independent speaker classification and verification, two related systems of speaker recognition.
Sound
no code implementations • 28 Jun 2016 • Zhenhao Ge, Aravind Ganapathiraju, Ananth N. Iyer, Scott A. Randal, Felix I. Wyss
Speech recognition, especially name recognition, is widely used in phone services such as company directory dialers, stock quote providers or location finders.
no code implementations • 25 Feb 2016 • Zhenhao Ge, Sudhendu R. Sharma, Mark J. T. Smith
This paper presents a method for detecting mispronunciations with the aim of improving Computer Assisted Language Learning (CALL) tools used by foreign language learners.
no code implementations • 25 Feb 2016 • Zhenhao Ge, Sudhendu R. Sharma, Mark J. T. Smith
New adaptive features have been developed and obtained through an adaptive warping of the frequency scale prior to computing the cepstral coefficients.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • 25 Feb 2016 • Zhenhao Ge, Sudhendu R. Sharma, Mark J. T. Smith
Various algorithms for text-independent speaker recognition have been developed through the decades, aiming to improve both accuracy and efficiency.
no code implementations • 24 Feb 2016 • Zhenhao Ge, Yingyi Tan, Aravind Ganapathiraju
Previous accent classification research focused mainly on detecting accents with pure acoustic information without recognizing accented speech.
1 code implementation • 24 Feb 2016 • Zhenhao Ge, Yufang Sun
In this paper, we present a novel setup of a Neural Network Language Model (NNLM) and apply it to a database of text samples from different authors.
no code implementations • 24 Feb 2016 • Zhenhao Ge
Researches have shown accent classification can be improved by integrating semantic information into pure acoustic approach.
1 code implementation • 17 Feb 2016 • Zhenhao Ge, Yufang Sun, Mark J. T. Smith
In practice, training language models for individual authors is often expensive because of limited data resources.