no code implementations • 30 Sep 2023 • Taichi Higasa, Keitaro Tanaka, Qi Feng, Shigeo Morishima
Language learners should regularly engage in reading challenging materials as part of their study routine.
no code implementations • 19 Sep 2023 • Ryosuke Oshima, Seitaro Shinagawa, Hideki Tsunashima, Qi Feng, Shigeo Morishima
Effective communication between humans and intelligent agents has promising applications for solving complex problems.
no code implementations • 24 Aug 2023 • Qi Feng, Hubert P. H. Shum, Shigeo Morishima
To explore this influence, we first propose a pilot study that captures real environments with multiple eye heights and asks participants to judge the egocentric distances and immersion.
no code implementations • 10 Jun 2023 • Tomoya Yoshinaga, Keitaro Tanaka, Shigeo Morishima
This paper describes an audio-visual speech enhancement (AV-SE) method that estimates from noisy input audio a mixture of the speech of the speaker appearing in an input video (on-screen target speech) and of a selected speaker not appearing in the video (off-screen target speech).
no code implementations • 23 May 2023 • Sara Kashiwagi, Keitaro Tanaka, Qi Feng, Shigeo Morishima
This paper presents a novel metric learning approach to address the performance gap between normal and silent speech in visual speech recognition (VSR).
no code implementations • 14 Apr 2023 • Shinei Arakawa, Hideki Tsunashima, Daichi Horita, Keitaro Tanaka, Shigeo Morishima
Second, we propose Global Content Conditioning (GCC) to ensure patches have coherent content when concatenated together.
no code implementations • 6 Mar 2023 • Gido Kato, Yoshihiro Fukuhara, Mariko Isogawa, Hideki Tsunashima, Hirokatsu Kataoka, Shigeo Morishima
To protect privacy and prevent malicious use of deepfake, current studies propose methods that interfere with the generation process, such as detection and destruction approaches.
1 code implementation • 5 Mar 2023 • Yuta Tsuji, Tatsuya Yatagawa, Hiroyuki Kubo, Shigeo Morishima
This paper presents an algorithm to obtain an event-based video from noisy frames given by physics-based Monte Carlo path tracing over a synthetic 3D scene.
1 code implementation • 19 Jul 2022 • Tanqiu Qiao, Qianhui Men, Frederick W. B. Li, Yoshiki Kubotani, Shigeo Morishima, Hubert P. H. Shum
Consider that geometric features such as human pose and object position provide meaningful information to understand HOIs, we argue to combine the benefits of both visual and geometric features in HOI recognition, and propose a novel Two-level Geometric feature-informed Graph Convolutional Network (2G-GCN).
no code implementations • 30 Mar 2022 • Takashi Oya, Shohei Iwase, Shigeo Morishima
To tackle this problem, we propose a fully unsupervised method that learns to detect objects in an image and separate sound source simultaneously.
no code implementations • 17 Mar 2022 • Shintaro Yamamoto, Hirokatsu Kataoka, Ryota Suzuki, Seitaro Shinagawa, Shigeo Morishima
To alleviate this problem, we organized a group of non-native English speakers to write summaries of papers presented at a computer vision conference to share the knowledge of the papers read by the group.
1 code implementation • 16 Feb 2022 • Qi Feng, Hubert P. H. Shum, Shigeo Morishima
In this work, we first establish a large-scale dataset with varied settings called Depth360 to tackle the training data problem.
1 code implementation • 31 Jul 2021 • Yoshiki Kubotani, Yoshihiro Fukuhara, Shigeo Morishima
However, optimization using reinforcement learning requires a large number of interactions, and thus it cannot be applied directly to actual students.
no code implementations • 28 Jan 2021 • Takashi Oya, Shigeo Morishima
This paper introduces the 2nd place solution for the Riiid!
no code implementations • 21 Dec 2020 • Shintaro Yamamoto, Anne Lauscher, Simone Paolo Ponzetto, Goran Glavaš, Shigeo Morishima
Providing visual summaries of scientific publications can increase information access for readers and thereby help deal with the exponential growth in the number of scientific publications.
no code implementations • 11 Jul 2020 • Takashi Oya, Shohei Iwase, Ryota Natsume, Takahiro Itazuri, Shugo Yamaguchi, Shigeo Morishima
Moreover, we show that the majority of sound-producing objects within the samples in this dataset can be inherently identified using only visual information, and thus that the dataset is inadequate to evaluate a system's capability to leverage aural information.
no code implementations • 8 Apr 2020 • Takayuki Nakatsuka, Kazuyoshi Yoshii, Yuki Koyama, Satoru Fukayama, Masataka Goto, Shigeo Morishima
Specifically, we formulate a hierarchical generative model of poses and images by integrating a deep generative model of poses from pose features with that of images from poses and image features.
no code implementations • 25 Sep 2019 • Masahiro Kato, Yoshihiro Fukuhara, Hirokatsu Kataoka, Shigeo Morishima
Our main idea is to apply a framework of learning with rejection and adversarial examples to assist in the decision making for such suspicious samples.
1 code implementation • 19 May 2019 • Takahiro Itazuri, Yoshihiro Fukuhara, Hirokatsu Kataoka, Shigeo Morishima
In this paper, we address the open question: "What do adversarially robust models look at?"
1 code implementation • ICCV 2019 • Shunsuke Saito, Zeng Huang, Ryota Natsume, Shigeo Morishima, Angjoo Kanazawa, Hao Li
We introduce Pixel-aligned Implicit Function (PIFu), a highly effective implicit representation that locally aligns pixels of 2D images with the global context of their corresponding 3D object.
Ranked #1 on 3D Object Reconstruction on RenderPeople
1 code implementation • CVPR 2019 • Ryota Natsume, Shunsuke Saito, Zeng Huang, Weikai Chen, Chongyang Ma, Hao Li, Shigeo Morishima
The synthesized silhouettes which are the most consistent with the input segmentation are fed into a deep visual hull algorithm for robust 3D shape prediction.
no code implementations • 30 Nov 2018 • Ryota Natsume, Tatsuya Yatagawa, Shigeo Morishima
We herein represent the face region with a latent variable that is assigned with the proposed deep neural network (DNN) instead of facial textures.
no code implementations • 16 Nov 2018 • Shintaro Yamamoto, Yoshihiro Fukuhara, Ryota Suzuki, Shigeo Morishima, Hirokatsu Kataoka
Due to the recent boom in artificial intelligence (AI) research, including computer vision (CV), it has become impossible for researchers in these fields to keep up with the exponentially increasing number of manuscripts.
no code implementations • 22 Sep 2018 • Ryota Natsume, Kazuki Inoue, Yoshihiro Fukuhara, Shintaro Yamamoto, Shigeo Morishima, Hirokatsu Kataoka
Face recognition research is one of the most active topics in computer vision (CV), and deep neural networks (DNN) are now filling the gap between human-level and computer-driven performance levels in face verification algorithms.
no code implementations • 10 Apr 2018 • Ryota Natsume, Tatsuya Yatagawa, Shigeo Morishima
The proposed network independently handles face and hair appearances in the latent spaces, and then, face swapping is achieved by replacing the latent-space representations of the faces, and reconstruct the entire face image with them.