Search Results for author: Shigeo Morishima

Found 25 papers, 7 papers with code

Pointing out Human Answer Mistakes in a Goal-Oriented Visual Dialogue

no code implementations19 Sep 2023 Ryosuke Oshima, Seitaro Shinagawa, Hideki Tsunashima, Qi Feng, Shigeo Morishima

Effective communication between humans and intelligent agents has promising applications for solving complex problems.

Language Modelling

Enhancing Perception and Immersion in Pre-Captured Environments through Learning-Based Eye Height Adaptation

no code implementations24 Aug 2023 Qi Feng, Hubert P. H. Shum, Shigeo Morishima

To explore this influence, we first propose a pilot study that captures real environments with multiple eye heights and asks participants to judge the egocentric distances and immersion.

Semantic Segmentation

Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction

no code implementations10 Jun 2023 Tomoya Yoshinaga, Keitaro Tanaka, Shigeo Morishima

This paper describes an audio-visual speech enhancement (AV-SE) method that estimates from noisy input audio a mixture of the speech of the speaker appearing in an input video (on-screen target speech) and of a selected speaker not appearing in the video (off-screen target speech).

Computational Efficiency Speech Enhancement +1

Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning

no code implementations23 May 2023 Sara Kashiwagi, Keitaro Tanaka, Qi Feng, Shigeo Morishima

This paper presents a novel metric learning approach to address the performance gap between normal and silent speech in visual speech recognition (VSR).

Metric Learning speech-recognition +1

Memory Efficient Diffusion Probabilistic Models via Patch-based Generation

no code implementations14 Apr 2023 Shinei Arakawa, Hideki Tsunashima, Daichi Horita, Keitaro Tanaka, Shigeo Morishima

Second, we propose Global Content Conditioning (GCC) to ensure patches have coherent content when concatenated together.

Scapegoat Generation for Privacy Protection from Deepfake

no code implementations6 Mar 2023 Gido Kato, Yoshihiro Fukuhara, Mariko Isogawa, Hideki Tsunashima, Hirokatsu Kataoka, Shigeo Morishima

To protect privacy and prevent malicious use of deepfake, current studies propose methods that interfere with the generation process, such as detection and destruction approaches.

Face Swapping

Event-based Camera Simulation using Monte Carlo Path Tracing with Adaptive Denoising

1 code implementation5 Mar 2023 Yuta Tsuji, Tatsuya Yatagawa, Hiroyuki Kubo, Shigeo Morishima

This paper presents an algorithm to obtain an event-based video from noisy frames given by physics-based Monte Carlo path tracing over a synthetic 3D scene.

Denoising regression

Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos

1 code implementation19 Jul 2022 Tanqiu Qiao, Qianhui Men, Frederick W. B. Li, Yoshiki Kubotani, Shigeo Morishima, Hubert P. H. Shum

Consider that geometric features such as human pose and object position provide meaningful information to understand HOIs, we argue to combine the benefits of both visual and geometric features in HOI recognition, and propose a novel Two-level Geometric feature-informed Graph Convolutional Network (2G-GCN).

Human-Object Interaction Detection

The Sound of Bounding-Boxes

no code implementations30 Mar 2022 Takashi Oya, Shohei Iwase, Shigeo Morishima

To tackle this problem, we propose a fully unsupervised method that learns to detect objects in an image and separate sound source simultaneously.

Community-Driven Comprehensive Scientific Paper Summarization: Insight from cvpaper.challenge

no code implementations17 Mar 2022 Shintaro Yamamoto, Hirokatsu Kataoka, Ryota Suzuki, Seitaro Shinagawa, Shigeo Morishima

To alleviate this problem, we organized a group of non-native English speakers to write summaries of papers presented at a computer vision conference to share the knowledge of the papers read by the group.

360 Depth Estimation in the Wild -- The Depth360 Dataset and the SegFuse Network

1 code implementation16 Feb 2022 Qi Feng, Hubert P. H. Shum, Shigeo Morishima

In this work, we first establish a large-scale dataset with varied settings called Depth360 to tackle the training data problem.

Autonomous Driving Depth Estimation +2

RLTutor: Reinforcement Learning Based Adaptive Tutoring System by Modeling Virtual Student with Fewer Interactions

1 code implementation31 Jul 2021 Yoshiki Kubotani, Yoshihiro Fukuhara, Shigeo Morishima

However, optimization using reinforcement learning requires a large number of interactions, and thus it cannot be applied directly to actual students.

Decision Making reinforcement-learning +1

Self-Supervised Learning for Visual Summary Identification in Scientific Publications

no code implementations21 Dec 2020 Shintaro Yamamoto, Anne Lauscher, Simone Paolo Ponzetto, Goran Glavaš, Shigeo Morishima

Providing visual summaries of scientific publications can increase information access for readers and thereby help deal with the exponential growth in the number of scientific publications.

Self-Supervised Learning

Do We Need Sound for Sound Source Localization?

no code implementations11 Jul 2020 Takashi Oya, Shohei Iwase, Ryota Natsume, Takahiro Itazuri, Shugo Yamaguchi, Shigeo Morishima

Moreover, we show that the majority of sound-producing objects within the samples in this dataset can be inherently identified using only visual information, and thus that the dataset is inadequate to evaluate a system's capability to leverage aural information.

MirrorNet: A Deep Bayesian Approach to Reflective 2D Pose Estimation from Human Images

no code implementations8 Apr 2020 Takayuki Nakatsuka, Kazuyoshi Yoshii, Yuki Koyama, Satoru Fukayama, Masataka Goto, Shigeo Morishima

Specifically, we formulate a hierarchical generative model of poses and images by integrating a deep generative model of poses from pose features with that of images from poses and image features.

2D Pose Estimation Pose Estimation

Learning with Protection: Rejection of Suspicious Samples under Adversarial Environment

no code implementations25 Sep 2019 Masahiro Kato, Yoshihiro Fukuhara, Hirokatsu Kataoka, Shigeo Morishima

Our main idea is to apply a framework of learning with rejection and adversarial examples to assist in the decision making for such suspicious samples.

BIG-bench Machine Learning Binary Classification +3

PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization

1 code implementation ICCV 2019 Shunsuke Saito, Zeng Huang, Ryota Natsume, Shigeo Morishima, Angjoo Kanazawa, Hao Li

We introduce Pixel-aligned Implicit Function (PIFu), a highly effective implicit representation that locally aligns pixels of 2D images with the global context of their corresponding 3D object.

3D Human Pose Estimation 3D Human Reconstruction +3

SiCloPe: Silhouette-Based Clothed People

1 code implementation CVPR 2019 Ryota Natsume, Shunsuke Saito, Zeng Huang, Weikai Chen, Chongyang Ma, Hao Li, Shigeo Morishima

The synthesized silhouettes which are the most consistent with the input segmentation are fed into a deep visual hull algorithm for robust 3D shape prediction.

Generative Adversarial Network Image-to-Image Translation

FSNet: An Identity-Aware Generative Model for Image-based Face Swapping

no code implementations30 Nov 2018 Ryota Natsume, Tatsuya Yatagawa, Shigeo Morishima

We herein represent the face region with a latent variable that is assigned with the proposed deep neural network (DNN) instead of facial textures.

Face Swapping

Automatic Paper Summary Generation from Visual and Textual Information

no code implementations16 Nov 2018 Shintaro Yamamoto, Yoshihiro Fukuhara, Ryota Suzuki, Shigeo Morishima, Hirokatsu Kataoka

Due to the recent boom in artificial intelligence (AI) research, including computer vision (CV), it has become impossible for researchers in these fields to keep up with the exponentially increasing number of manuscripts.

Sentence

Understanding Fake Faces

no code implementations22 Sep 2018 Ryota Natsume, Kazuki Inoue, Yoshihiro Fukuhara, Shintaro Yamamoto, Shigeo Morishima, Hirokatsu Kataoka

Face recognition research is one of the most active topics in computer vision (CV), and deep neural networks (DNN) are now filling the gap between human-level and computer-driven performance levels in face verification algorithms.

Face Recognition Face Verification

RSGAN: Face Swapping and Editing using Face and Hair Representation in Latent Spaces

no code implementations10 Apr 2018 Ryota Natsume, Tatsuya Yatagawa, Shigeo Morishima

The proposed network independently handles face and hair appearances in the latent spaces, and then, face swapping is achieved by replacing the latent-space representations of the faces, and reconstruct the entire face image with them.

Attribute Face Swapping +1

Cannot find the paper you are looking for? You can Submit a new open access paper.