no code implementations • 30 Sep 2023 • Taichi Higasa, Keitaro Tanaka, Qi Feng, Shigeo Morishima
Language learners should regularly engage in reading challenging materials as part of their study routine.
no code implementations • 10 Jun 2023 • Tomoya Yoshinaga, Keitaro Tanaka, Shigeo Morishima
This paper describes an audio-visual speech enhancement (AV-SE) method that estimates from noisy input audio a mixture of the speech of the speaker appearing in an input video (on-screen target speech) and of a selected speaker not appearing in the video (off-screen target speech).
no code implementations • 23 May 2023 • Sara Kashiwagi, Keitaro Tanaka, Qi Feng, Shigeo Morishima
This paper presents a novel metric learning approach to address the performance gap between normal and silent speech in visual speech recognition (VSR).
no code implementations • 14 Apr 2023 • Shinei Arakawa, Hideki Tsunashima, Daichi Horita, Keitaro Tanaka, Shigeo Morishima
Second, we propose Global Content Conditioning (GCC) to ensure patches have coherent content when concatenated together.
no code implementations • 4 Jun 2021 • Keitaro Tanaka, Ryosuke Sawata, Shusuke Takahashi
This paper presents a new deep clustering (DC) method called manifold-aware DC (M-DC) that can enhance hyperspace utilization more effectively than the original DC.