no code implementations • 20 Oct 2021 • Huaibo Zhao, Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi
In the present paper, an attempt is made to combine Mask-CTC and the triggered attention mechanism to construct a streaming end-to-end automatic speech recognition (ASR) system that provides high performance with low latency.
1 code implementation • 8 Oct 2021 • Yosuke Higuchi, Keita Karube, Tetsuji Ogawa, Tetsunori Kobayashi
In this work, to promote the word-level representation learning in end-to-end ASR, we propose a hierarchical conditional model that is based on connectionist temporal classification (CTC).
no code implementations • COLING 2020 • Hikari Tanabe, Tetsuji Ogawa, Tetsunori Kobayashi, Yoshihiko Hayashi
Recognition of the mental state of a human character in text is a major challenge in natural language processing.
no code implementations • COLING 2020 • Hiroaki Takatsu, Ryota Ando, Yoichi Matsuyama, Tetsunori Kobayashi
As smart speakers and conversational robots become ubiquitous, the demand for expressive speech synthesis has increased.
no code implementations • 26 Oct 2020 • Yosuke Higuchi, Hirofumi Inaguma, Shinji Watanabe, Tetsuji Ogawa, Tetsunori Kobayashi
While Mask-CTC achieves remarkably fast inference speed, its recognition performance falls behind that of conventional autoregressive (AR) systems.
no code implementations • 18 May 2020 • Yosuke Higuchi, Shinji Watanabe, Nanxin Chen, Tetsuji Ogawa, Tetsunori Kobayashi
In this work, Mask CTC model is trained using a Transformer encoder-decoder with joint training of mask prediction and CTC.
Audio and Speech Processing Sound
no code implementations • LREC 2020 • Mika Hasegawa, Tetsunori Kobayashi, Yoshihiko Hayashi
Human semantic knowledge about concepts acquired through perceptual inputs and daily experiences can be expressed as a bundle of attributes.
no code implementations • WS 2019 • Mao Nakanishi, Tetsunori Kobayashi, Yoshihiko Hayashi
Conversational question generation is a novel area of NLP research which has a range of potential applications.
no code implementations • COLING 2018 • Mao Nakanishi, Tetsunori Kobayashi, Yoshihiko Hayashi
However, to realize human-like language comprehension ability, a machine should also be able to distinguish not-answerable questions (NAQs) from answerable questions.
no code implementations • WS 2017 • Kentaro Kanada, Tetsunori Kobayashi, Yoshihiko Hayashi
This paper proposes a method for classifying the type of lexical-semantic relation between a given pair of words.