Search Results for author: Yoshimitsu Aoki

Found 20 papers, 12 papers with code

3D Human Scan With A Moving Event Camera

no code implementations12 Apr 2024 Kai Kohyama, Shintaro Shiba, Yoshimitsu Aoki

The experimental results show that the proposed method outperforms conventional frame-based methods in the estimation accuracy of both pose and body mesh.

3D Pose Estimation Event-based vision +1

TAG: Guidance-free Open-Vocabulary Semantic Segmentation

1 code implementation17 Mar 2024 Yasufumi Kawano, Yoshimitsu Aoki

Unsupervised and open-vocabulary segmentation, proposed to tackle these issues, faces challenges, including the inability to assign specific class labels to clusters and the necessity of user-provided text queries for guidance.

Open Vocabulary Semantic Segmentation Segmentation +2

MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation

1 code implementation17 Mar 2024 Yasufumi Kawano, Yoshimitsu Aoki

Semantic segmentation is essential in computer vision for various applications, yet traditional approaches face significant challenges, including the high cost of annotation and extensive training for supervised learning.

Open Vocabulary Semantic Segmentation Proper Noun +2

Event-based Background-Oriented Schlieren

2 code implementations1 Nov 2023 Shintaro Shiba, Friedhelm Hamann, Yoshimitsu Aoki, Guillermo Gallego

Schlieren imaging is an optical technique to observe the flow of transparent media, such as air or water, without any particle seeding.

Event-based Optical Flow Optical Flow Estimation

Boosting Semantic Segmentation with Semantic Boundaries

1 code implementation19 Apr 2023 Haruya Ishikawa, Yoshimitsu Aoki

Motivated by the recent development in improving semantic segmentation by incorporating boundaries as auxiliary tasks, we propose a multi-task framework that uses semantic boundary detection (SBD) as an auxiliary task.

Boundary Detection Segmentation +1

FindView: Precise Target View Localization Task for Look Around Agents

1 code implementation16 Mar 2023 Haruya Ishikawa, Yoshimitsu Aoki

With the increase in demands for service robots and automated inspection, agents need to localize in its surrounding environment to achieve more natural communication with humans by shared contexts.

Listening Human Behavior: 3D Human Pose Estimation With Acoustic Signals

no code implementations CVPR 2023 Yuto Shibata, Yutaka Kawashima, Mariko Isogawa, Go Irie, Akisato Kimura, Yoshimitsu Aoki

Aiming to capture subtle sound changes to reveal detailed pose information, we explicitly extract phase features from the acoustic signals together with typical spectrum features and feed them into our human pose estimation network.

3D Human Pose Estimation

Fast Event-based Optical Flow Estimation by Triplet Matching

no code implementations23 Dec 2022 Shintaro Shiba, Yoshimitsu Aoki, Guillermo Gallego

Event cameras are novel bio-inspired sensors that offer advantages over traditional cameras (low latency, high dynamic range, low power, etc.).

Event-based Optical Flow Motion Estimation +1

Document Shadow Removal with Foreground Detection Learning From Fully Synthetic Images

1 code implementation 2022 2022 Yuhi Matsuo, Naofumi Akimoto, Yoshimitsu Aoki

In this paper, we present a large-scale and diverse dataset called fully synthetic document shadow removal dataset (FSDSRD) that does not require capturing documents.

Document Shadow Removal

Event Collapse in Contrast Maximization Frameworks

1 code implementation8 Jul 2022 Shintaro Shiba, Yoshimitsu Aoki, Guillermo Gallego

Contrast maximization (CMax) is a framework that provides state-of-the-art results on several event-based computer vision tasks, such as ego-motion or optical flow estimation.

Event-based Motion Estimation Optical Flow Estimation

Alleviating Over-segmentation Errors by Detecting Action Boundaries

2 code implementations14 Jul 2020 Yuchi Ishikawa, Seito Kasai, Yoshimitsu Aoki, Hirokatsu Kataoka

Our model architecture consists of a long-term feature extractor and two branches: the Action Segmentation Branch (ASB) and the Boundary Regression Branch (BRB).

Action Classification Action Segmentation +2

Retrieving and Highlighting Action with Spatiotemporal Reference

1 code implementation19 May 2020 Seito Kasai, Yuchi Ishikawa, Masaki Hayashi, Yoshimitsu Aoki, Kensho Hara, Hirokatsu Kataoka

In this paper, we present a framework that jointly retrieves and spatiotemporally highlights actions in videos by enhancing current deep cross-modal retrieval methods.

Action Recognition Cross-Modal Retrieval +5

Fast Soft Color Segmentation

no code implementations CVPR 2020 Naofumi Akimoto, Huachun Zhu, Yanghua Jin, Yoshimitsu Aoki

We address the problem of soft color segmentation, defined as decomposing a given image into several RGBA layers, each containing only homogeneous color regions.

Segmentation Video Editing

Anticipating Traffic Accidents with Adaptive Loss and Large-scale Incident DB

no code implementations CVPR 2018 Tomoyuki Suzuki, Hirokatsu Kataoka, Yoshimitsu Aoki, Yutaka Satoh

In this paper, we propose a novel approach for traffic accident anticipation through (i) Adaptive Loss for Early Anticipation (AdaLEA) and (ii) a large-scale self-annotated incident database for anticipation.

Accident Anticipation

Dominant Codewords Selection with Topic Model for Action Recognition

no code implementations1 May 2016 Hirokatsu Kataoka, Masaki Hayashi, Kenji Iwata, Yutaka Satoh, Yoshimitsu Aoki, Slobodan Ilic

Latent Dirichlet allocation (LDA) is used to develop approximations of human motion primitives; these are mid-level representations, and they adaptively integrate dominant vectors when classifying human activities.

Action Recognition Temporal Action Localization

Cannot find the paper you are looking for? You can Submit a new open access paper.