Search Results for author: Shigeo Morishima

Found 25 papers, 7 papers with code

Gaze-Driven Sentence Simplification for Language Learners: Enhancing Comprehension and Readability

no code implementations • 30 Sep 2023 • Taichi Higasa, Keitaro Tanaka, Qi Feng, Shigeo Morishima

Language learners should regularly engage in reading challenging materials as part of their study routine.

Reading Comprehension Sentence

Paper
Add Code

Pointing out Human Answer Mistakes in a Goal-Oriented Visual Dialogue

no code implementations • 19 Sep 2023 • Ryosuke Oshima, Seitaro Shinagawa, Hideki Tsunashima, Qi Feng, Shigeo Morishima

Effective communication between humans and intelligent agents has promising applications for solving complex problems.

Language Modelling

Paper
Add Code

Enhancing Perception and Immersion in Pre-Captured Environments through Learning-Based Eye Height Adaptation

no code implementations • 24 Aug 2023 • Qi Feng, Hubert P. H. Shum, Shigeo Morishima

To explore this influence, we first propose a pilot study that captures real environments with multiple eye heights and asks participants to judge the egocentric distances and immersion.

Semantic Segmentation

Paper
Add Code

Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction

no code implementations • 10 Jun 2023 • Tomoya Yoshinaga, Keitaro Tanaka, Shigeo Morishima

This paper describes an audio-visual speech enhancement (AV-SE) method that estimates from noisy input audio a mixture of the speech of the speaker appearing in an input video (on-screen target speech) and of a selected speaker not appearing in the video (off-screen target speech).

Computational Efficiency Speech Enhancement +1

Paper
Add Code

Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning

no code implementations • 23 May 2023 • Sara Kashiwagi, Keitaro Tanaka, Qi Feng, Shigeo Morishima

This paper presents a novel metric learning approach to address the performance gap between normal and silent speech in visual speech recognition (VSR).

Metric Learning speech-recognition +1

Paper
Add Code

Memory Efficient Diffusion Probabilistic Models via Patch-based Generation

no code implementations • 14 Apr 2023 • Shinei Arakawa, Hideki Tsunashima, Daichi Horita, Keitaro Tanaka, Shigeo Morishima

Second, we propose Global Content Conditioning (GCC) to ensure patches have coherent content when concatenated together.

Paper
Add Code

Scapegoat Generation for Privacy Protection from Deepfake

no code implementations • 6 Mar 2023 • Gido Kato, Yoshihiro Fukuhara, Mariko Isogawa, Hideki Tsunashima, Hirokatsu Kataoka, Shigeo Morishima

To protect privacy and prevent malicious use of deepfake, current studies propose methods that interfere with the generation process, such as detection and destruction approaches.

Face Swapping

Paper
Add Code

Event-based Camera Simulation using Monte Carlo Path Tracing with Adaptive Denoising

1 code implementation • 5 Mar 2023 • Yuta Tsuji, Tatsuya Yatagawa, Hiroyuki Kubo, Shigeo Morishima

This paper presents an algorithm to obtain an event-based video from noisy frames given by physics-based Monte Carlo path tracing over a synthetic 3D scene.

Denoising regression

Paper
Code

Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos

1 code implementation • 19 Jul 2022 • Tanqiu Qiao, Qianhui Men, Frederick W. B. Li, Yoshiki Kubotani, Shigeo Morishima, Hubert P. H. Shum

Consider that geometric features such as human pose and object position provide meaningful information to understand HOIs, we argue to combine the benefits of both visual and geometric features in HOI recognition, and propose a novel Two-level Geometric feature-informed Graph Convolutional Network (2G-GCN).

Human-Object Interaction Detection

Paper
Code

The Sound of Bounding-Boxes

no code implementations • 30 Mar 2022 • Takashi Oya, Shohei Iwase, Shigeo Morishima

To tackle this problem, we propose a fully unsupervised method that learns to detect objects in an image and separate sound source simultaneously.

Paper
Add Code

Community-Driven Comprehensive Scientific Paper Summarization: Insight from cvpaper.challenge

no code implementations • 17 Mar 2022 • Shintaro Yamamoto, Hirokatsu Kataoka, Ryota Suzuki, Seitaro Shinagawa, Shigeo Morishima

To alleviate this problem, we organized a group of non-native English speakers to write summaries of papers presented at a computer vision conference to share the knowledge of the papers read by the group.

Paper
Add Code

360 Depth Estimation in the Wild -- The Depth360 Dataset and the SegFuse Network

1 code implementation • 16 Feb 2022 • Qi Feng, Hubert P. H. Shum, Shigeo Morishima

In this work, we first establish a large-scale dataset with varied settings called Depth360 to tackle the training data problem.

Autonomous Driving Depth Estimation +2

Paper
Code

RLTutor: Reinforcement Learning Based Adaptive Tutoring System by Modeling Virtual Student with Fewer Interactions

1 code implementation • 31 Jul 2021 • Yoshiki Kubotani, Yoshihiro Fukuhara, Shigeo Morishima

However, optimization using reinforcement learning requires a large number of interactions, and thus it cannot be applied directly to actual students.

Decision Making reinforcement-learning +1

Paper
Code

LSTM-SAKT: LSTM-Encoded SAKT-like Transformer for Knowledge Tracing

no code implementations • 28 Jan 2021 • Takashi Oya, Shigeo Morishima

This paper introduces the 2nd place solution for the Riiid!

Knowledge Tracing

Paper
Add Code

Self-Supervised Learning for Visual Summary Identification in Scientific Publications

no code implementations • 21 Dec 2020 • Shintaro Yamamoto, Anne Lauscher, Simone Paolo Ponzetto, Goran Glavaš, Shigeo Morishima

Providing visual summaries of scientific publications can increase information access for readers and thereby help deal with the exponential growth in the number of scientific publications.

Self-Supervised Learning

Paper
Add Code

Do We Need Sound for Sound Source Localization?

no code implementations • 11 Jul 2020 • Takashi Oya, Shohei Iwase, Ryota Natsume, Takahiro Itazuri, Shugo Yamaguchi, Shigeo Morishima

Moreover, we show that the majority of sound-producing objects within the samples in this dataset can be inherently identified using only visual information, and thus that the dataset is inadequate to evaluate a system's capability to leverage aural information.

Paper
Add Code

MirrorNet: A Deep Bayesian Approach to Reflective 2D Pose Estimation from Human Images

no code implementations • 8 Apr 2020 • Takayuki Nakatsuka, Kazuyoshi Yoshii, Yuki Koyama, Satoru Fukayama, Masataka Goto, Shigeo Morishima

Specifically, we formulate a hierarchical generative model of poses and images by integrating a deep generative model of poses from pose features with that of images from poses and image features.

2D Pose Estimation Pose Estimation

Paper
Add Code

Learning with Protection: Rejection of Suspicious Samples under Adversarial Environment

no code implementations • 25 Sep 2019 • Masahiro Kato, Yoshihiro Fukuhara, Hirokatsu Kataoka, Shigeo Morishima

Our main idea is to apply a framework of learning with rejection and adversarial examples to assist in the decision making for such suspicious samples.

BIG-bench Machine Learning Binary Classification +3

Paper
Add Code

What Do Adversarially Robust Models Look At?

1 code implementation • 19 May 2019 • Takahiro Itazuri, Yoshihiro Fukuhara, Hirokatsu Kataoka, Shigeo Morishima

In this paper, we address the open question: "What do adversarially robust models look at?"

Adversarial Robustness Open-Ended Question Answering

Paper
Code

PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization

1 code implementation • ICCV 2019 • Shunsuke Saito, Zeng Huang, Ryota Natsume, Shigeo Morishima, Angjoo Kanazawa, Hao Li

We introduce Pixel-aligned Implicit Function (PIFu), a highly effective implicit representation that locally aligns pixels of 2D images with the global context of their corresponding 3D object.

Ranked #1 on 3D Object Reconstruction on RenderPeople

3D Human Pose Estimation 3D Human Reconstruction +3

1,734

Paper
Code

SiCloPe: Silhouette-Based Clothed People

1 code implementation • CVPR 2019 • Ryota Natsume, Shunsuke Saito, Zeng Huang, Weikai Chen, Chongyang Ma, Hao Li, Shigeo Morishima

The synthesized silhouettes which are the most consistent with the input segmentation are fed into a deep visual hull algorithm for robust 3D shape prediction.

Generative Adversarial Network Image-to-Image Translation

Paper
Code

FSNet: An Identity-Aware Generative Model for Image-based Face Swapping

no code implementations • 30 Nov 2018 • Ryota Natsume, Tatsuya Yatagawa, Shigeo Morishima

We herein represent the face region with a latent variable that is assigned with the proposed deep neural network (DNN) instead of facial textures.

Face Swapping

Paper
Add Code

Automatic Paper Summary Generation from Visual and Textual Information

no code implementations • 16 Nov 2018 • Shintaro Yamamoto, Yoshihiro Fukuhara, Ryota Suzuki, Shigeo Morishima, Hirokatsu Kataoka

Due to the recent boom in artificial intelligence (AI) research, including computer vision (CV), it has become impossible for researchers in these fields to keep up with the exponentially increasing number of manuscripts.

Sentence

Paper
Add Code

Understanding Fake Faces

no code implementations • 22 Sep 2018 • Ryota Natsume, Kazuki Inoue, Yoshihiro Fukuhara, Shintaro Yamamoto, Shigeo Morishima, Hirokatsu Kataoka

Face recognition research is one of the most active topics in computer vision (CV), and deep neural networks (DNN) are now filling the gap between human-level and computer-driven performance levels in face verification algorithms.

Face Recognition Face Verification

Paper
Add Code

RSGAN: Face Swapping and Editing using Face and Hair Representation in Latent Spaces

no code implementations • 10 Apr 2018 • Ryota Natsume, Tatsuya Yatagawa, Shigeo Morishima

The proposed network independently handles face and hair appearances in the latent spaces, and then, face swapping is achieved by replacing the latent-space representations of the faces, and reconstruct the entire face image with them.

Attribute Face Swapping +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.