Search Results for author: Masatoshi Okutomi

Found 48 papers, 9 papers with code

CFDNet: A Generalizable Foggy Stereo Matching Network with Contrastive Feature Distillation

no code implementations • 28 Feb 2024 • Zihua Liu, Yizhou Li, Masatoshi Okutomi

Stereo matching under foggy scenes remains a challenging task since the scattering effect degrades the visibility and results in less distinctive features for dense correspondence matching.

Contrastive Learning Depth Estimation +1

Paper
Add Code

Digging Into Normal Incorporated Stereo Matching

1 code implementation • ACM International Conference on Multimedia 2022 • Zihua Liu, Songyan Zhang, Zhicheng Wang, Masatoshi Okutomi

To enhance geometric consistency, especially in low-texture regions, the estimated normal map is then leveraged to calculate a local affinity matrix, providing the residual learning with information about where the correction should refer and thus improving the residual learning efficiency.

Disparity Estimation Stereo Matching

Paper
Code

Self-Supervised Spatially Variant PSF Estimation for Aberration-Aware Depth-from-Defocus

no code implementations • 28 Feb 2024 • Zhuofeng Wu, Yusuke Monno, Masatoshi Okutomi

In this paper, we address the task of aberration-aware depth-from-defocus (DfD), which takes account of spatially variant point spread functions (PSFs) of a real camera.

Depth Estimation Self-Supervised Learning

Paper
Add Code

Reflection Removal Using Recurrent Polarization-to-Polarization Network

1 code implementation • 28 Feb 2024 • Wenjiao Bian, Yusuke Monno, Masatoshi Okutomi

This paper addresses reflection removal, which is the task of separating reflection components from a captured image and deriving the image with only transmission components.

Reflection Removal

Paper
Code

Global Occlusion-Aware Transformer for Robust Stereo Matching

1 code implementation • 22 Dec 2023 • Zihua Liu, Yizhou Li, Masatoshi Okutomi

Despite the remarkable progress facilitated by learning-based stereo-matching algorithms, the performance in the ill-conditioned regions, such as the occluded regions, remains a bottleneck.

Disparity Estimation Occlusion Estimation +1

Paper
Code

Polarimetric PatchMatch Multi-View Stereo

no code implementations • 11 Nov 2023 • Jinyu Zhao, Jumpei Oishi, Yusuke Monno, Masatoshi Okutomi

PatchMatch Multi-View Stereo (PatchMatch MVS) is one of the popular MVS approaches, owing to its balanced accuracy and efficiency.

Stereo Matching

Paper
Add Code

EMR-MSF: Self-Supervised Recurrent Monocular Scene Flow Exploiting Ego-Motion Rigidity

no code implementations • ICCV 2023 • Zijie Jiang, Masatoshi Okutomi

In this paper, we propose a superior model named EMR-MSF by borrowing the advantages of network architecture design under the scope of supervised learning.

Motion Estimation Scene Flow Estimation +1

Paper
Add Code

Polarimetric Multi-View Inverse Rendering

no code implementations • 24 Dec 2022 • Jinyu Zhao, Yusuke Monno, Masatoshi Okutomi

We then refine the initial model by optimizing photometric rendering errors and polarimetric errors using multi-view RGB, AoP, and DoP images, where we propose a novel polarimetric cost function that enables an effective constraint on the estimated surface normal of each vertex, while considering four possible ambiguous azimuth angles revealed from the AoP measurement.

3D Reconstruction Inverse Rendering

Paper
Add Code

Dual-Pixel Raindrop Removal

no code implementations • 24 Oct 2022 • Yizhou Li, Yusuke Monno, Masatoshi Okutomi

In this paper, we propose the first method using a Dual-Pixel (DP) sensor to better address the raindrop removal.

Rain Removal

Paper
Add Code

Two-Step Color-Polarization Demosaicking Network

no code implementations • 13 Sep 2022 • Vy Nguyen, Masayuki Tanaka, Yusuke Monno, Masatoshi Okutomi

Polarization information of light in a scene is valuable for various image processing and computer vision tasks.

Demosaicking Vocal Bursts Valence Prediction

Paper
Add Code

Deep Hyperspectral-Depth Reconstruction Using Single Color-Dot Projection

no code implementations • CVPR 2022 • Chunyu Li, Yusuke Monno, Masatoshi Okutomi

To jointly reconstruct the depth and the hyperspectral reflectance from a single color-dot image, we propose a novel end-to-end network architecture that effectively incorporates a geometric color-dot pattern loss and a photometric hyperspectral reflectance loss.

Paper
Add Code

Self-Supervised Ego-Motion Estimation Based on Multi-Layer Fusion of RGB and Inferred Depth

1 code implementation • 3 Mar 2022 • Zijie Jiang, Hajime Taira, Naoyuki Miyashita, Masatoshi Okutomi

In this paper, we investigate the effect of different fusion strategies for ego-motion estimation and propose a new framework for self-supervised learning of depth and ego-motion estimation, which performs ego-motion estimation by leveraging RGB and inferred depth information in a Multi-Layer Fusion manner.

Motion Estimation Self-Supervised Learning

Paper
Code

Single Image Deraining Network with Rain Embedding Consistency and Layered LSTM

1 code implementation • 5 Nov 2021 • Yizhou Li, Yusuke Monno, Masatoshi Okutomi

For this purpose, an encoder-decoder network draws wide attention, where the encoder is required to encode a high-quality rain embedding which determines the performance of the subsequent decoding stage to reconstruct the rain layer.

Single Image Deraining

Paper
Code

Learning-Based Depth and Pose Estimation for Monocular Endoscope with Loss Generalization

no code implementations • 28 Jul 2021 • Aji Resindra Widya, Yusuke Monno, Masatoshi Okutomi, Sho Suzuki, Takuji Gotoda, Kenji Miki

In addition, we propose a novel generalized photometric loss function to avoid the complicated process of finding proper weights for balancing the depth and the pose loss terms, which is required for existing direct depth and pose supervision approaches.

3D Reconstruction Pose Estimation

Paper
Add Code

Multi-Modal Pedestrian Detection with Large Misalignment Based on Modal-Wise Regression and Multi-Modal IoU

no code implementations • 23 Jul 2021 • Napat Wanchaitanawong, Masayuki Tanaka, Takashi Shibata, Masatoshi Okutomi

Due to this assumption's breakdown, the position of the bounding boxes does not match between the two modalities, resulting in a significant decrease in detection accuracy, especially in regions where the amount of misalignment is large.

Pedestrian Detection regression

Paper
Add Code

Geometric Data Augmentation Based on Feature Map Ensemble

no code implementations • 22 Jul 2021 • Takashi Shibata, Masayuki Tanaka, Masatoshi Okutomi

The performance of CNN is dramatically degraded by geometric transformation, such as large rotations.

Data Augmentation

Paper
Add Code

Video-Based Camera Localization Using Anchor View Detection and Recursive 3D Reconstruction

no code implementations • 7 Jul 2021 • Hajime Taira, Koki Onbe, Naoyuki Miyashita, Masatoshi Okutomi

In this paper we introduce a new camera localization strategy designed for image sequences captured in challenging industrial situations such as industrial parts inspection.

3D Reconstruction Camera Localization

Paper
Add Code

Spectral MVIR: Joint Reconstruction of 3D Shape and Spectral Reflectance

no code implementations • 15 Apr 2021 • Chunyu Li, Yusuke Monno, Masatoshi Okutomi

Reconstructing an object's high-quality 3D shape with inherent spectral reflectance property, beyond typical device-dependent RGB albedos, opens the door to applications requiring a high-fidelity 3D model in terms of both geometry and photometry.

Inverse Rendering

Paper
Add Code

VIO-Aided Structure from Motion Under Challenging Environments

no code implementations • 24 Jan 2021 • Zijie Jiang, Hajime Taira, Naoyuki Miyashita, Masatoshi Okutomi

In this paper, we present a robust and efficient Structure from Motion pipeline for accurate 3D reconstruction under challenging environments by leveraging the camera pose information from a visual-inertial odometry.

3D Reconstruction Image Registration

Paper
Add Code

Spectral Reflectance Estimation Using Projector with Unknown Spectral Power Distribution

no code implementations • 18 Dec 2020 • Hironori Hidaka, Yusuke Monno, Masatoshi Okutomi

A lighting-based multispectral imaging system using an RGB camera and a projector is one of the most practical and low-cost systems to acquire multispectral observations for estimating the scene's spectral reflectance information.

Paper
Add Code

Deep Snapshot HDR Imaging Using Multi-Exposure Color Filter Array

no code implementations • 20 Nov 2020 • Takeru Suda, Masayuki Tanaka, Yusuke Monno, Masatoshi Okutomi

In this paper, we propose a deep snapshot high dynamic range (HDR) imaging framework that can effectively reconstruct an HDR image from the RAW data captured using a multi-exposure color filter array (ME-CFA), which consists of a mosaic pattern of RGB filters with different exposure levels.

Image Reconstruction

Paper
Add Code

Adaptive Future Frame Prediction with Ensemble Network

no code implementations • 13 Nov 2020 • Wonjik Kim, Masayuki Tanaka, Masatoshi Okutomi, Yoko SASAKI

A common limitation of the existing learning-based approaches is a mismatch of training data and test data.

Paper
Add Code

Human Segmentation with Dynamic LiDAR Data

1 code implementation • 16 Oct 2020 • Tao Zhong, Wonjik Kim, Masayuki Tanaka, Masatoshi Okutomi

It has a two-branch structure, i. e., the spatial segmentation branch and the temporal velocity estimation branch.

Segmentation

Paper
Code

Monochrome and Color Polarization Demosaicking Using Edge-Aware Residual Interpolation

no code implementations • 28 Jul 2020 • Miki Morimatsu, Yusuke Monno, Masayuki Tanaka, Masatoshi Okutomi

Since the polarimeter consists of an image sensor equipped with a monochrome or color polarization filter array (MPFA or CPFA), the demosaicking process to interpolate missing pixel values plays a crucial role in obtaining high-quality polarization images.

Demosaicking

Paper
Add Code

Polarimetric Multi-View Inverse Rendering

no code implementations • ECCV 2020 • Jinyu Zhao, Yusuke Monno, Masatoshi Okutomi

A polarization camera has great potential for 3D reconstruction since the angle of polarization (AoP) of reflected light is related to an object's surface normal.

3D Reconstruction Inverse Rendering

Paper
Add Code

3D Pipe Network Reconstruction Based on Structure from Motion with Incremental Conic Shape Detection and Cylindrical Constraint

no code implementations • 18 Jun 2020 • Sho kagami, Hajime Taira, Naoyuki Miyashita, Akihiko Torii, Masatoshi Okutomi

Pipe inspection is a critical task for many industries and infrastructure of a city.

Position

Paper
Add Code

Classifying degraded images over various levels of degradation

no code implementations • 15 Jun 2020 • Kazuki Endo, Masayuki Tanaka, Masatoshi Okutomi

Classification for degraded images having various levels of degradation is very important in practical applications.

Classification Ensemble Learning +1

Paper
Add Code

Stomach 3D Reconstruction Based on Virtual Chromoendoscopic Image Generation

no code implementations • 26 Apr 2020 • Aji Resindra Widya, Yusuke Monno, Masatoshi Okutomi, Sho Suzuki, Takuji Gotoda, Kenji Miki

Gastric endoscopy is a standard clinical process that enables medical practitioners to diagnose various lesions inside a patient's stomach.

3D Reconstruction Image Generation +1

Paper
Add Code

Learning-Based Human Segmentation and Velocity Estimation Using Automatic Labeled LiDAR Sequence for Training

no code implementations • 11 Mar 2020 • Wonjik Kim, Masayuki Tanaka, Masatoshi Okutomi, Yoko SASAKI

In this paper, we propose an automatic labeled sequential data generation pipeline for human segmentation and velocity estimation with point clouds.

Segmentation

Paper
Add Code

Pro-Cam SSfM: Projector-Camera System for Structure and Spectral Reflectance from Motion

no code implementations • ICCV 2019 • Chunyu Li, Yusuke Monno, Hironori Hidaka, Masatoshi Okutomi

In this paper, we propose a novel projector-camera system for practical and low-cost acquisition of a dense object 3D model with the spectral reflectance property.

3D Reconstruction

Paper
Add Code

Is This The Right Place? Geometric-Semantic Pose Verification for Indoor Visual Localization

no code implementations • ICCV 2019 • Hajime Taira, Ignacio Rocco, Jiri Sedlar, Masatoshi Okutomi, Josef Sivic, Tomas Pajdla, Torsten Sattler, Akihiko Torii

The pose with the largest geometric consistency with the query image, e. g., in the form of an inlier count, is then selected in a second stage.

Visual Localization

Paper
Add Code

3D Reconstruction of Whole Stomach from Endoscope Video Using Structure-from-Motion

no code implementations • 30 May 2019 • Aji Resindra Widya, Yusuke Monno, Kosuke Imahori, Masatoshi Okutomi, Sho Suzuki, Takuji Gotoda, Kenji Miki

In this work, we investigated how to enable structure-from-motion (SfM) to reconstruct the whole shape of a stomach from a standard endoscope video.

3D Reconstruction

Paper
Add Code

Improving Transparency of Deep Neural Inference Process

no code implementations • 13 Mar 2019 • Hiroshi Kuwajima, Masayuki Tanaka, Masatoshi Okutomi

However, the inference process of deep learning is black-box, and not very suitable to safety-critical systems which must exhibit high transparency.

Paper
Add Code

Automatic Labeled LiDAR Data Generation based on Precise Human Model

1 code implementation • 14 Feb 2019 • Wonjik Kim, Masayuki Tanaka, Masatoshi Okutomi, Yoko SASAKI

We present 500k+ data generated by the proposed pipeline.

Paper
Code

Estimation and Restoration of Compositional Degradation Using Convolutional Neural Networks

no code implementations • 23 Dec 2018 • Kazutaka Uchida, Masayuki Tanaka, Masatoshi Okutomi

However, image degradations in practice are often a mixture of several types of degradation.

Image Restoration

Paper
Add Code

Gradient-Based Low-Light Image Enhancement

no code implementations • 25 Sep 2018 • Masayuki Tanaka, Takashi Shibata, Masatoshi Okutomi

A low-light image enhancement is a highly demanded image processing technique, especially for consumer digital cameras and cameras on mobile phones.

Low-Light Image Enhancement

Paper
Add Code

Non-blind Image Restoration Based on Convolutional Neural Network

no code implementations • 11 Sep 2018 • Kazutaka Uchida, Masayuki Tanaka, Masatoshi Okutomi

Blind image restoration processors based on convolutional neural network (CNN) are intensively researched because of their high performance.

Image Restoration

Paper
Add Code

Joint optimization for compressive video sensing and reconstruction under hardware constraints

no code implementations • ECCV 2018 • Michitaka Yoshida, Akihiko Torii, Masatoshi Okutomi, Kenta Endo, Yukinobu Sugiyama, Rin-ichiro Taniguchi, Hajime Nagahara

Compressive video sensing is the process of encoding multiple sub-frames into a single frame with controlled sensor exposures and reconstructing the sub-frames from the single compressed frame.

Compressive Sensing

Paper
Add Code

Structure-from-Motion using Dense CNN Features with Keypoint Relocalization

no code implementations • 10 May 2018 • Aji Resindra Widya, Akihiko Torii, Masatoshi Okutomi

Then, we demonstrate on the Aachen Day-Night dataset that the proposed SfM using dense CNN features with the keypoint relocalization outperforms a state-of-the-art SfM (COLMAP using RootSIFT) by a large margin.

Paper
Add Code

InLoc: Indoor Visual Localization with Dense Matching and View Synthesis

1 code implementation • CVPR 2018 • Hajime Taira, Masatoshi Okutomi, Torsten Sattler, Mircea Cimpoi, Marc Pollefeys, Josef Sivic, Tomas Pajdla, Akihiko Torii

We seek to predict the 6 degree-of-freedom (6DoF) pose of a query photograph with respect to a large indoor 3D map.

Indoor Localization Pose Estimation +2

Paper
Code

Misalignment-Robust Joint Filter for Cross-Modal Image Pairs

no code implementations • ICCV 2017 • Takashi Shibata, Masayuki Tanaka, Masatoshi Okutomi

Next, the joint-filter cost volume and a set of filtered images are computed from the target image and the set of the translated guidances.

Paper
Add Code

Benchmarking 6DOF Outdoor Visual Localization in Changing Conditions

2 code implementations • CVPR 2018 • Torsten Sattler, Will Maddern, Carl Toft, Akihiko Torii, Lars Hammarstrand, Erik Stenborg, Daniel Safari, Masatoshi Okutomi, Marc Pollefeys, Josef Sivic, Fredrik Kahl, Tomas Pajdla

Visual localization enables autonomous vehicles to navigate in their surroundings and augmented reality applications to link virtual to real worlds.

Autonomous Vehicles Benchmarking +3

745

Paper
Code

Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?

no code implementations • CVPR 2017 • Torsten Sattler, Akihiko Torii, Josef Sivic, Marc Pollefeys, Hajime Taira, Masatoshi Okutomi, Tomas Pajdla

3D structure-based methods employ 3D models of the scene to estimate the full 6DOF pose of a camera very accurately.

Autonomous Navigation Image Retrieval +2

Paper
Add Code

Gradient-Domain Image Reconstruction Framework With Intensity-Range and Base-Structure Constraints

no code implementations • CVPR 2016 • Takashi Shibata, Masayuki Tanaka, Masatoshi Okutomi

To generate detail-preserving and artifact-free output images, we combine the benefits of the two approaches into the proposed framework by introducing the intensity-range constraint and the base-structure constraint.

Image Reconstruction Image Restoration +1

Paper
Add Code

24/7 Place Recognition by View Synthesis

no code implementations • CVPR 2015 • Akihiko Torii, Relja Arandjelovic, Josef Sivic, Masatoshi Okutomi, Tomas Pajdla

We address the problem of large-scale visual place recognition for situations where the scene undergoes a major change in appearance, for example, due to illumination (day/night), change of seasons, aging, or structural modifications over time such as buildings built or destroyed.

Visual Place Recognition

Paper
Add Code

A General and Simple Method for Camera Pose and Focal Length Determination

no code implementations • CVPR 2014 • Yinqiang Zheng, Shigeki Sugimoto, Imari Sato, Masatoshi Okutomi

In this paper, we revisit the pose determination problem of a partially calibrated camera with unknown focal length, hereafter referred to as the PnPf problem, by using n (n ≥ 4) 3D-to-2D point correspondences.

Paper
Add Code

A Practical Rank-Constrained Eight-Point Algorithm for Fundamental Matrix Estimation

no code implementations • CVPR 2013 • Yinqiang Zheng, Shigeki Sugimoto, Masatoshi Okutomi

Due to its simplicity, the eight-point algorithm has been widely used in fundamental matrix estimation.

Paper
Add Code

Visual Place Recognition with Repetitive Structures

no code implementations • CVPR 2013 • Akihiko Torii, Josef Sivic, Tomas Pajdla, Masatoshi Okutomi

Even more importantly, they violate the feature independence assumed in the bag-of-visual-words representation which often leads to over-counting evidence and significant degradation of retrieval performance.

Retrieval Visual Place Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.