Search Results for author: Yuchao Dai

Found 120 papers, 39 papers with code

3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis

no code implementations • 9 Apr 2024 • Zhicheng Lu, Xiang Guo, Le Hui, Tianrui Chen, Min Yang, Xiao Tang, Feng Zhu, Yuchao Dai

In this way, our solution achieves 3D geometry-aware deformation modeling, which enables improved dynamic view synthesis and 3D dynamic reconstruction.

Dynamic Reconstruction

Paper
Add Code

LRRU: Long-short Range Recurrent Updating Networks for Depth Completion

no code implementations • ICCV 2023 • YuFei Wang, Bo Li, Ge Zhang, Qi Liu, Tao Gao, Yuchao Dai

Existing deep learning-based depth completion methods generally employ massive stacked layers to predict the dense depth map from sparse input data.

Depth Completion

Paper
Add Code

Multimodal Variational Auto-encoder based Audio-Visual Segmentation

1 code implementation • ICCV 2023 • Yuxin Mao, Jing Zhang, Mochu Xiang, Yiran Zhong, Yuchao Dai

To achieve this, our ECMVAE factorizes the representations of each modality with a modality-shared representation and a modality-specific representation.

Attribute Representation Learning

Paper
Code

Forward Flow for Novel View Synthesis of Dynamic Scenes

no code implementations • ICCV 2023 • Xiang Guo, Jiadai Sun, Yuchao Dai, GuanYing Chen, Xiaoqing Ye, Xiao Tan, Errui Ding, Yumeng Zhang, Jingdong Wang

This paper proposes a neural radiance field (NeRF) approach for novel view synthesis of dynamic scenes using forward warping.

Novel View Synthesis

Paper
Add Code

RPEFlow: Multimodal Fusion of RGB-PointCloud-Event for Joint Optical Flow and Scene Flow Estimation

1 code implementation • ICCV 2023 • Zhexiong Wan, Yuxin Mao, Jing Zhang, Yuchao Dai

Recently, the RGB images and point clouds fusion methods have been proposed to jointly estimate 2D optical flow and 3D scene flow.

Optical Flow Estimation Scene Flow Estimation

Paper
Code

Decomposed Guided Dynamic Filters for Efficient RGB-Guided Depth Completion

no code implementations • 5 Sep 2023 • YuFei Wang, Yuxin Mao, Qi Liu, Yuchao Dai

The decomposed filters not only maintain the favorable properties of guided dynamic filters as being content-dependent and spatially-variant, but also reduce model parameters and hardware costs, as the learned adaptors are decoupled with the number of feature channels.

Depth Completion object-detection +2

Paper
Add Code

Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling

no code implementations • 18 Aug 2023 • Haorui Ji, Hui Deng, Yuchao Dai, Hongdong Li

Most of the previous 3D human pose estimation work relied on the powerful memory capability of the network to obtain suitable 2D-3D mappings from the training data.

3D Human Pose Estimation 3D Pose Estimation

Paper
Add Code

Improving Audio-Visual Segmentation with Bidirectional Generation

no code implementations • 16 Aug 2023 • Dawei Hao, Yuxin Mao, Bowen He, Xiaodong Han, Yuchao Dai, Yiran Zhong

In this paper, inspired by the human ability to mentally simulate the sound of an object and its visual appearance, we introduce a bidirectional generation framework.

Motion Estimation Object +2

Paper
Add Code

Digging into Depth Priors for Outdoor Neural Radiance Fields

no code implementations • 8 Aug 2023 • Chen Wang, Jiadai Sun, Lina Liu, Chenming Wu, Zhelun Shen, Dayan Wu, Yuchao Dai, Liangjun Zhang

However, the shape-radiance ambiguity of radiance fields remains a challenge, especially in the sparse viewpoints setting.

Novel View Synthesis

Paper
Add Code

Digging Into Uncertainty-based Pseudo-label for Robust Stereo Matching

1 code implementation • 31 Jul 2023 • Zhelun Shen, Xibin Song, Yuchao Dai, Dingfu Zhou, Zhibo Rao, Liangjun Zhang

Due to the domain differences and unbalanced disparity distribution across multiple datasets, current stereo matching approaches are commonly limited to a specific dataset and generalize poorly to others.

Monocular Depth Estimation Pseudo Label +1

Paper
Code

Transferable Attack for Semantic Segmentation

1 code implementation • 31 Jul 2023 • Mengqi He, Jing Zhang, Zhaoyuan Yang, Mingyi He, Nick Barnes, Yuchao Dai

We analysis performance of semantic segmentation models wrt.

Data Augmentation Segmentation +1

Paper
Code

Contrastive Conditional Latent Diffusion for Audio-visual Segmentation

no code implementations • 31 Jul 2023 • Yuxin Mao, Jing Zhang, Mochu Xiang, Yunqiu Lv, Yiran Zhong, Yuchao Dai

We propose a latent diffusion model with contrastive learning for audio-visual segmentation (AVS) to extensively explore the contribution of audio.

Contrastive Learning Denoising +2

Paper
Add Code

Measuring and Modeling Uncertainty Degree for Monocular Depth Estimation

no code implementations • 19 Jul 2023 • Mochu Xiang, Jing Zhang, Nick Barnes, Yuchao Dai

Effectively measuring and modeling the reliability of a trained model is essential to the real-world deployment of monocular depth estimation (MDE) models.

Monocular Depth Estimation

Paper
Add Code

Linearized Relative Positional Encoding

no code implementations • 18 Jul 2023 • Zhen Qin, Weixuan Sun, Kaiyue Lu, Hui Deng, Dongxu Li, Xiaodong Han, Yuchao Dai, Lingpeng Kong, Yiran Zhong

Meanwhile, it emphasizes a general paradigm for designing broadly more relative positional encoding methods that are applicable to linear transformers.

Image Classification Language Modelling +2

Paper
Add Code

Joint Salient Object Detection and Camouflaged Object Detection via Uncertainty-aware Learning

no code implementations • 10 Jul 2023 • Aixuan Li, Jing Zhang, Yunqiu Lv, Tong Zhang, Yiran Zhong, Mingyi He, Yuchao Dai

In this case, salient objects are typically non-camouflaged, and camouflaged objects are usually not salient.

Attribute Contrastive Learning +5

Paper
Add Code

Weakly-supervised Contrastive Learning for Unsupervised Object Discovery

1 code implementation • 7 Jul 2023 • Yunqiu Lv, Jing Zhang, Nick Barnes, Yuchao Dai

Unsupervised object discovery (UOD) refers to the task of discriminating the whole region of objects from the background within a scene without relying on labeled datasets, which benefits the task of bounding-box-level localization and pixel-level segmentation.

Contrastive Learning Image Reconstruction +4

Paper
Code

Mutual Information Regularization for Weakly-supervised RGB-D Salient Object Detection

1 code implementation • 6 Jun 2023 • Aixuan Li, Yuxin Mao, Jing Zhang, Yuchao Dai

In particular, following the principle of disentangled representation learning, we introduce a mutual information upper bound with a mutual information minimization regularizer to encourage the disentangled representation of each modality for salient object detection.

Object object-detection +3

Paper
Code

Toeplitz Neural Network for Sequence Modeling

2 code implementations • 8 May 2023 • Zhen Qin, Xiaodong Han, Weixuan Sun, Bowen He, Dong Li, Dongxu Li, Yuchao Dai, Lingpeng Kong, Yiran Zhong

Sequence modeling has important applications in natural language processing and computer vision.

Language Modelling Position

Paper
Code

A Revisit of the Normalized Eight-Point Algorithm and A Self-Supervised Deep Solution

no code implementations • 21 Apr 2023 • Bin Fan, Yuchao Dai, Yongduek Seo, Mingyi He

The normalized eight-point algorithm has been widely viewed as the cornerstone in two-view geometry computation, where the seminal Hartley's normalization has greatly improved the performance of the direct linear transformation algorithm.

Self-Supervised Learning

Paper
Add Code

The Second Monocular Depth Estimation Challenge

no code implementations • 14 Apr 2023 • Jaime Spencer, C. Stella Qian, Michaela Trescakova, Chris Russell, Simon Hadfield, Erich W. Graf, Wendy J. Adams, Andrew J. Schofield, James Elder, Richard Bowden, Ali Anwar, Hao Chen, Xiaozhi Chen, Kai Cheng, Yuchao Dai, Huynh Thai Hoa, Sadat Hossain, Jianmian Huang, Mohan Jing, Bo Li, Chao Li, Baojun Li, Zhiwen Liu, Stefano Mattoccia, Siegfried Mercelis, Myungwoo Nam, Matteo Poggi, Xiaohua Qi, Jiahui Ren, Yang Tang, Fabio Tosi, Linh Trinh, S. M. Nadim Uddin, Khan Muhammad Umair, Kaixuan Wang, YuFei Wang, Yixing Wang, Mochu Xiang, Guangkai Xu, Wei Yin, Jun Yu, Qi Zhang, Chaoqiang Zhao

This paper discusses the results for the second edition of the Monocular Depth Estimation Challenge (MDEC).

Monocular Depth Estimation

Paper
Add Code

Fine-grained Audible Video Description

1 code implementation • CVPR 2023 • Xuyang Shen, Dong Li, Jinxing Zhou, Zhen Qin, Bowen He, Xiaodong Han, Aixuan Li, Yuchao Dai, Lingpeng Kong, Meng Wang, Yu Qiao, Yiran Zhong

We explore a new task for audio-visual-language modeling called fine-grained audible video description (FAVD).

Language Modelling Masked Language Modeling +5

Paper
Code

Event-guided Multi-patch Network with Self-supervision for Non-uniform Motion Deblurring

1 code implementation • 14 Feb 2023 • Hongguang Zhang, Limeng Zhang, Yuchao Dai, Hongdong Li, Piotr Koniusz

Contemporary deep learning multi-scale deblurring models suffer from many issues: 1) They perform poorly on non-uniformly blurred images/videos; 2) Simply increasing the model depth with finer-scale levels cannot improve deblurring; 3) Individual RGB frames contain a limited motion information for deblurring; 4) Previous models have a limited robustness to spatial transformations and noise.

Deblurring

186

Paper
Code

Efficient LiDAR Point Cloud Oversegmentation Network

no code implementations • ICCV 2023 • Le Hui, Linghua Tang, Yuchao Dai, Jin Xie, Jian Yang

Then, to generate homogeneous superpoints from the sparse LiDAR point cloud, we propose a LiDAR point grouping algorithm that simultaneously considers the similarity of point embeddings and the Euclidean distance of points in 3D space.

LIDAR Semantic Segmentation Semantic Segmentation

Paper
Add Code

Joint Appearance and Motion Learning for Efficient Rolling Shutter Correction

1 code implementation • CVPR 2023 • Bin Fan, Yuxin Mao, Yuchao Dai, Zhexiong Wan, Qi Liu

Rolling shutter correction (RSC) is becoming increasingly popular for RS cameras that are widely used in commercial and industrial applications.

Data Augmentation Rolling Shutter Correction

Paper
Code

Modeling the Distributional Uncertainty for Salient Object Detection Models

no code implementations • CVPR 2023 • Xinyu Tian, Jing Zhang, Mochu Xiang, Yuchao Dai

Most of the existing salient object detection (SOD) models focus on improving the overall model performance, without explicitly explaining the discrepancy between the training and testing distributions.

Long-tail Learning Object +3

Paper
Add Code

Masked Representation Learning for Domain Generalized Stereo Matching

no code implementations • CVPR 2023 • Zhibo Rao, Bangshu Xiong, Mingyi He, Yuchao Dai, Renjie He, Zhelun Shen, Xing Li

Experimental results on multi-datasets show that: (1) our method can be easily plugged into the current various stereo matching models to improve generalization performance; (2) our method can reduce the significant volatility of generalization performance among different training epochs; (3) we find that the current methods prefer to choose the best results among different training epochs as generalization performance, but it is impossible to select the best performance by ground truth in practice.

Image Reconstruction Multi-Task Learning +2

Paper
Add Code

Learning Dense and Continuous Optical Flow from an Event Camera

1 code implementation • 16 Nov 2022 • Zhexiong Wan, Yuchao Dai, Yuxin Mao

In this paper, we propose a novel deep learning-based dense and continuous optical flow estimation framework from a single image with event streams, which facilitates the accurate perception of high-speed motion.

Optical Flow Estimation

Paper
Code

CU-Net: LiDAR Depth-Only Completion With Coupled U-Net

1 code implementation • 26 Oct 2022 • YuFei Wang, Yuchao Dai, Qi Liu, Peng Yang, Jiadai Sun, Bo Li

We find that existing depth-only methods can obtain satisfactory results in the areas where the measurement points are almost accurate and evenly distributed (denoted as normal areas), while the performance is limited in the areas where the foreground and background points are overlapped due to occlusion (denoted as overlap areas) and the areas where there are no measurement points around (denoted as blank areas) since the methods have no reliable input information in these areas.

Paper
Code

Searching Dense Point Correspondences via Permutation Matrix Learning

no code implementations • 26 Oct 2022 • Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Bin Fan, Qi Liu

In response, this paper presents a novel end-to-end learning-based method to estimate the dense correspondence of 3D point clouds, in which the problem of point matching is formulated as a zero-one assignment problem to achieve a permutation matching matrix to implement the one-to-one principle fundamentally.

Paper
Add Code

Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds

no code implementations • 26 Oct 2022 • Zhiyuan Zhang, Yuchao Dai, Bin Fan, Jiadai Sun, Mingyi He

In this paper, we propose to learn a robust task-specific feature descriptor to consistently describe the correct point correspondence under interference.

Paper
Add Code

Linear Video Transformer with Feature Fixation

no code implementations • 15 Oct 2022 • Kaiyue Lu, Zexiang Liu, Jianyuan Wang, Weixuan Sun, Zhen Qin, Dong Li, Xuyang Shen, Hui Deng, Xiaodong Han, Yuchao Dai, Yiran Zhong

Therefore, we propose a feature fixation module to reweight the feature importance of the query and key before computing linear attention.

Feature Importance Video Classification

Paper
Add Code

Deep Idempotent Network for Efficient Single Image Blind Deblurring

no code implementations • 13 Oct 2022 • Yuxin Mao, Zhexiong Wan, Yuchao Dai, Xin Yu

Single image blind deblurring is highly ill-posed as neither the latent sharp image nor the blur kernel is known.

Single-Image Blind Deblurring

Paper
Add Code

Rolling Shutter Inversion: Bring Rolling Shutter Images to High Framerate Global Shutter Video

1 code implementation • 6 Oct 2022 • Bin Fan, Yuchao Dai, Hongdong Li

The RSSR is a very challenging task, and to our knowledge, no practical solution exists to date.

Optical Flow Estimation Super-Resolution

Paper
Code

Efficient Spatial-Temporal Information Fusion for LiDAR-Based 3D Moving Object Segmentation

1 code implementation • 5 Jul 2022 • Jiadai Sun, Yuchao Dai, Xianjing Zhang, Jintao Xu, Rui Ai, Weihao Gu, Xieyuanli Chen

We also use a point refinement module via 3D sparse convolution to fuse the information from both LiDAR range image and point cloud representations and reduce the artifacts on the borders of the objects.

Autonomous Driving Collision Avoidance +1

234

Paper
Code

Neural Deformable Voxel Grid for Fast Optimization of Dynamic View Synthesis

no code implementations • 15 Jun 2022 • Xiang Guo, GuanYing Chen, Yuchao Dai, Xiaoqing Ye, Jiadai Sun, Xiao Tan, Errui Ding

The second module contains a density and a color grid to model the geometry and density of the scene.

Novel View Synthesis

Paper
Add Code

Context-Aware Video Reconstruction for Rolling Shutter Cameras

1 code implementation • CVPR 2022 • Bin Fan, Yuchao Dai, Zhiyuan Zhang, Qi Liu, Mingyi He

Then, a refinement scheme is proposed to guide the GS frame synthesis along with bilateral occlusion masks to produce high-fidelity GS video frames at arbitrary times.

Motion Compensation Video Reconstruction

Paper
Code

Towards Deeper Understanding of Camouflaged Object Detection

1 code implementation • 23 May 2022 • Yunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Nick Barnes, Deng-Ping Fan

With the above understanding about camouflaged objects, we present the first triple-task learning framework to simultaneously localize, segment, and rank camouflaged objects, indicating the conspicuousness level of camouflage.

Object object-detection +1

Paper
Code

Deep Non-rigid Structure-from-Motion: A Sequence-to-Sequence Translation Perspective

no code implementations • 10 Apr 2022 • Hui Deng, Tong Zhang, Yuchao Dai, Jiawei Shi, Yiran Zhong, Hongdong Li

In this paper, we propose to model deep NRSfM from a sequence-to-sequence translation perspective, where the input 2D frame sequence is taken as a whole to reconstruct the deforming 3D non-rigid shape sequence.

3D Reconstruction Translation

Paper
Add Code

VRNet: Learning the Rectified Virtual Corresponding Points for 3D Point Cloud Registration

no code implementations • 24 Mar 2022 • Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Bin Fan, Mingyi He

3D point cloud registration is fragile to outliers, which are labeled as the points without corresponding points.

Point Cloud Registration

Paper
Add Code

A Representation Separation Perspective to Correspondences-free Unsupervised 3D Point Cloud Registration

no code implementations • 24 Mar 2022 • Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Dingfu Zhou, Xibin Song, Mingyi He

Existing correspondences-free methods generally learn the holistic representation of the entire point cloud, which is fragile for partial and noisy point clouds.

Point Cloud Registration

Paper
Add Code

Efficient Multi-View Stereo by Iterative Dynamic Cost Volume

1 code implementation • CVPR 2022 • Shaoqian Wang, Bo Li, Yuchao Dai

Specifically, a lightweight 3D CNN is utilized to generate the coarsest initial depth map which is essential to launch the GRU and guarantee a fast convergence.

Paper
Code

MUNet: Motion Uncertainty-aware Semi-supervised Video Object Segmentation

no code implementations • 29 Nov 2021 • Jiadai Sun, Yuxin Mao, Yuchao Dai, Yiran Zhong, Jianyuan Wang

The task of semi-supervised video object segmentation (VOS) has been greatly advanced and state-of-the-art performance has been made by dense matching-based methods.

Object Semantic Segmentation +2

Paper
Add Code

A General Divergence Modeling Strategy for Salient Object Detection

no code implementations • 23 Nov 2021 • Xinyu Tian, Jing Zhang, Yuchao Dai

Given multiple saliency annotations, we introduce a general divergence modeling strategy via random sampling, and apply our strategy to an ensemble based framework and three latent variable model based solutions to explore the subjective nature of saliency.

Object object-detection +2

Paper
Add Code

Dense Uncertainty Estimation via an Ensemble-based Conditional Latent Variable Model

no code implementations • 22 Nov 2021 • Jing Zhang, Yuchao Dai, Mehrtash Harandi, Yiran Zhong, Nick Barnes, Richard Hartley

Uncertainty estimation has been extensively studied in recent literature, which can usually be classified as aleatoric uncertainty and epistemic uncertainty.

Attribute object-detection +1

Paper
Add Code

End-to-end Learning the Partial Permutation Matrix for Robust 3D Point Cloud Registration

no code implementations • 28 Oct 2021 • Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Dingfu Zhou, Xibin Song, Mingyi He

Even though considerable progress has been made in deep learning-based 3D point cloud processing, how to obtain accurate correspondences for robust registration remains a major challenge because existing hard assignment methods cannot deal with outliers naturally.

Point Cloud Registration

Paper
Add Code

Dense Uncertainty Estimation

1 code implementation • 13 Oct 2021 • Jing Zhang, Yuchao Dai, Mochu Xiang, Deng-Ping Fan, Peyman Moghadam, Mingyi He, Christian Walder, Kaihao Zhang, Mehrtash Harandi, Nick Barnes

Deep neural networks can be roughly divided into deterministic neural networks and stochastic neural networks. The former is usually trained to achieve a mapping from input space to output space via maximum likelihood estimation for the weights, which leads to deterministic predictions during testing.

Decision Making

Paper
Code

RGB-D Saliency Detection via Cascaded Mutual Information Minimization

1 code implementation • ICCV 2021 • Jing Zhang, Deng-Ping Fan, Yuchao Dai, Xin Yu, Yiran Zhong, Nick Barnes, Ling Shao

In this paper, we introduce a novel multi-stage cascaded learning framework via mutual information minimization to "explicitly" model the multi-modal information between RGB image and depth data.

Ranked #5 on Thermal Image Segmentation on RGB-T-Glass-Segmentation

Saliency Detection Thermal Image Segmentation

Paper
Code

PR-RRN: Pairwise-Regularized Residual-Recursive Networks for Non-rigid Structure-from-Motion

no code implementations • ICCV 2021 • Haitian Zeng, Yuchao Dai, Xin Yu, Xiaohan Wang, Yi Yang

As NRSfM is a highly under-constrained problem, we propose two new pairwise regularization to further regularize the reconstruction.

Paper
Add Code

SUNet: Symmetric Undistortion Network for Rolling Shutter Correction

1 code implementation • ICCV 2021 • Bin Fan, Yuchao Dai, Mingyi He

The vast majority of modern consumer-grade cameras employ a rolling shutter mechanism, leading to image distortions if the camera moves during image acquisition.

Rolling Shutter Correction

Paper
Code

Complementary Patch for Weakly Supervised Semantic Segmentation

1 code implementation • ICCV 2021 • Fei Zhang, Chaochen Gu, Chenyue Zhang, Yuchao Dai

Therefore, a CAM with more information related to object seeds can be obtained by narrowing down the gap between the sum of CAMs generated by the CP Pair and the original CAM.

Ranked #51 on Weakly-Supervised Semantic Segmentation on PASCAL VOC 2012 test

Segmentation Weakly supervised Semantic Segmentation +1

Paper
Code

Exploring Depth Contribution for Camouflaged Object Detection

no code implementations • 24 Jun 2021 • Mochu Xiang, Jing Zhang, Yunqiu Lv, Aixuan Li, Yiran Zhong, Yuchao Dai

In this paper, we study the depth contribution for camouflaged object detection, where the depth maps are generated with existing monocular depth estimation (MDE) methods.

Generative Adversarial Network Monocular Depth Estimation +5

Paper
Add Code

Generative Transformer for Accurate and Reliable Salient Object Detection

2 code implementations • 20 Apr 2021 • Yuxin Mao, Jing Zhang, Zhexiong Wan, Yuchao Dai, Aixuan Li, Yunqiu Lv, Xinyu Tian, Deng-Ping Fan, Nick Barnes

For the former, we apply transformer to a deterministic model, and explain that the effective structure modeling and global context modeling abilities lead to its superior performance compared with the CNN based frameworks.

Attribute Camouflaged Object Segmentation +8

Paper
Code

CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching

3 code implementations • CVPR 2021 • Zhelun Shen, Yuchao Dai, Zhibo Rao

In this paper, we propose CFNet, a Cascade and Fused cost volume based network to improve the robustness of the stereo matching network.

Disparity Estimation Stereo Matching

149

Paper
Code

Uncertainty-aware Joint Salient Object and Camouflaged Object Detection

2 code implementations • CVPR 2021 • Aixuan Li, Jing Zhang, Yunqiu Lv, Bowen Liu, Tong Zhang, Yuchao Dai

Visual salient object detection (SOD) aims at finding the salient object(s) that attract human attention, while camouflaged object detection (COD) on the contrary intends to discover the camouflaged object(s) that hidden in the surrounding.

Object object-detection +2

Paper
Code

Deep Two-View Structure-from-Motion Revisited

1 code implementation • CVPR 2021 • Jianyuan Wang, Yiran Zhong, Yuchao Dai, Stan Birchfield, Kaihao Zhang, Nikolai Smolyanskiy, Hongdong Li

Two-view structure-from-motion (SfM) is the cornerstone of 3D reconstruction and visual SLAM.

Ranked #24 on Monocular Depth Estimation on KITTI Eigen split

3D Reconstruction Monocular Depth Estimation +3

173

Paper
Code

Simultaneously Localize, Segment and Rank the Camouflaged Objects

1 code implementation • CVPR 2021 • Yunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Bowen Liu, Nick Barnes, Deng-Ping Fan

With the above understanding about camouflaged objects, we present the first ranking based COD network (Rank-Net) to simultaneously localize, segment and rank camouflaged objects.

object-detection Object Detection

Paper
Code

IAFA: Instance-aware Feature Aggregation for 3D Object Detection from a Single Image

no code implementations • 5 Mar 2021 • Dingfu Zhou, Xibin Song, Yuchao Dai, Junbo Yin, Feixiang Lu, Jin Fang, Miao Liao, Liangjun Zhang

3D object detection from a single image is an important task in Autonomous Driving (AD), where various approaches have been proposed.

Ranked #19 on Monocular 3D Object Detection on KITTI Cars Moderate

Autonomous Driving Depth Estimation +2

Paper
Add Code

Inverting a Rolling Shutter Camera: Bring Rolling Shutter Images to High Framerate Global Shutter Video

no code implementations • ICCV 2021 • Bin Fan, Yuchao Dai

In this paper, we propose to invert the above RS imaging mechanism, i. e., recovering a high framerate GS video from consecutive RS images to achieve RS temporal super-resolution (RSSR).

Optical Flow Estimation Super-Resolution

Paper
Add Code

UASNet: Uncertainty Adaptive Sampling Network for Deep Stereo Matching

no code implementations • ICCV 2021 • Yamin Mao, Zhihua Liu, Weiming Li, Yuchao Dai, Qiang Wang, Yun-Tae Kim, Hong-Seok Lee

Extensive experiments show that the proposed method achieves the highest ground truth covering ratio compared with other cascade cost volume based stereo matching methods.

Stereo Matching

Paper
Add Code

Neural Image Compression via Attentional Multi-Scale Back Projection and Frequency Decomposition

no code implementations • ICCV 2021 • Ge Gao, Pei You, Rong pan, Shunyuan Han, Yuanyuan Zhang, Yuchao Dai, Hojae Lee

In recent years, neural image compression emerges as a rapidly developing topic in computer vision, where the state-of-the-art approaches now exhibit superior compression performance than their conventional counterparts.

Image Compression MS-SSIM +1

Paper
Add Code

Class Attention Network for Semantic Segmentation of Remote Sensing Images

no code implementations • 31 Dec 2020 • Zhibo Rao, Mingyi He, Yuchao Dai

In this paper, we proposed a novel class attention module and decomposition-fusion strategy to cope with imbalanced labels.

Earth Observation Scene Parsing +2

Paper
Add Code

Uncertainty-Aware Deep Calibrated Salient Object Detection

no code implementations • 10 Dec 2020 • Jing Zhang, Yuchao Dai, Xin Yu, Mehrtash Harandi, Nick Barnes, Richard Hartley

Existing deep neural network based salient object detection (SOD) methods mainly focus on pursuing high network accuracy.

Object object-detection +2

Paper
Add Code

Depth Completion using Piecewise Planar Model

no code implementations • 6 Dec 2020 • Yiran Zhong, Yuchao Dai, Hongdong Li

More specifically, we represent the desired depth map as a collection of 3D planar and the reconstruction problem is formulated as the optimization of planar parameters.

Depth Completion Visual Odometry

Paper
Add Code

Efficient Depth Completion Using Learned Bases

no code implementations • 2 Dec 2020 • Yiran Zhong, Yuchao Dai, Hongdong Li

The given sparse depth points are served as a data term to constrain the weighting process.

Depth Completion

Paper
Add Code

Displacement-Invariant Cost Computation for Efficient Stereo Matching

no code implementations • 1 Dec 2020 • Yiran Zhong, Charles Loop, Wonmin Byeon, Stan Birchfield, Yuchao Dai, Kaihao Zhang, Alexey Kamenev, Thomas Breuel, Hongdong Li, Jan Kautz

A common way to speed up the computation is to downsample the feature volume, but this loses high-frequency details.

Autonomous Driving Stereo Matching

Paper
Add Code

Displacement-Invariant Matching Cost Learning for Accurate Optical Flow Estimation

3 code implementations • NeurIPS 2020 • Jianyuan Wang, Yiran Zhong, Yuchao Dai, Kaihao Zhang, Pan Ji, Hongdong Li

Learning matching costs has been shown to be critical to the success of the state-of-the-art deep stereo matching methods, in which 3D convolutions are applied on a 4D feature volume to learn a 3D cost volume.

Optical Flow Estimation Stereo Matching

145

Paper
Code

Hierarchical Neural Architecture Search for Deep Stereo Matching

1 code implementation • NeurIPS 2020 • Xuelian Cheng, Yiran Zhong, Mehrtash Harandi, Yuchao Dai, Xiaojun Chang, Tom Drummond, Hongdong Li, ZongYuan Ge

To reduce the human efforts in neural network design, Neural Architecture Search (NAS) has been applied with remarkable success to various high-level vision tasks such as classification and semantic segmentation.

Ranked #2 on Stereo Disparity Estimation on Scene Flow

Neural Architecture Search Semantic Segmentation +3

252

Paper
Code

Novel View Synthesis from only a 6-DoF Camera Pose by Two-stage Networks

no code implementations • 22 Oct 2020 • Xiang Guo, Bo Li, Yuchao Dai, Tongxin Zhang, Hui Deng

That is, we synthesize the novel view from only a 6-DoF camera pose directly.

Generative Adversarial Network Novel View Synthesis

Paper
Add Code

PRAFlow_RVC: Pyramid Recurrent All-Pairs Field Transforms for Optical Flow Estimation in Robust Vision Challenge 2020

no code implementations • 14 Sep 2020 • Zhexiong Wan, Yuxin Mao, Yuchao Dai

Optical flow estimation is an important computer vision task, which aims at estimating the dense correspondences between two frames.

Optical Flow Estimation

Paper
Add Code

Uncertainty Inspired RGB-D Saliency Detection

4 code implementations • 7 Sep 2020 • Jing Zhang, Deng-Ping Fan, Yuchao Dai, Saeed Anwar, Fatemeh Saleh, Sadegh Aliakbarian, Nick Barnes

Our framework includes two main models: 1) a generator model, which maps the input image and latent variable to stochastic saliency prediction, and 2) an inference model, which gradually updates the latent variable by sampling it from the true or approximate posterior distribution.

Ranked #1 on RGB-D Salient Object Detection on LFSD

RGB-D Salient Object Detection RGB Salient Object Detection +1

317

Paper
Code

PCW-Net: Pyramid Combination and Warping Cost Volume for Stereo Matching

2 code implementations • 23 Jun 2020 • Zhelun Shen, Yuchao Dai, Xibin Song, Zhibo Rao, Dingfu Zhou, Liangjun Zhang

First, we construct combination volumes on the upper levels of the pyramid and develop a cost volume fusion module to integrate them for initial disparity estimation.

Disparity Estimation Domain Generalization +1

Paper
Code

Dense Non-Rigid Structure from Motion: A Manifold Viewpoint

no code implementations • 15 Jun 2020 • Suryansh Kumar, Luc van Gool, Carlos E. P. de Oliveira, Anoop Cherian, Yuchao Dai, Hongdong Li

Assuming that a deforming shape is composed of a union of local linear subspace and, span a global low-rank space over multiple frames enables us to efficiently model complex non-rigid deformations.

Clustering

Paper
Add Code

Relative Pose Estimation for Stereo Rolling Shutter Cameras

no code implementations • 14 Jun 2020 • Ke Wang, Bin Fan, Yuchao Dai

In this paper, we present a novel linear algorithm to estimate the 6 DoF relative pose from consecutive frames of stereo rolling shutter (RS) cameras.

Pose Estimation

Paper
Add Code

Channel Attention based Iterative Residual Learning for Depth Map Super-Resolution

no code implementations • CVPR 2020 • Xibin Song, Yuchao Dai, Dingfu Zhou, Liu Liu, Wei Li, Hongdng Li, Ruigang Yang

Second, we propose a new framework for real-world DSR, which consists of four modules : 1) An iterative residual learning module with deep supervision to learn effective high-frequency components of depth maps in a coarse-to-fine manner; 2) A channel attention strategy to enhance channels with abundant high-frequency components; 3) A multi-stage fusion module to effectively re-exploit the results in the coarse-to-fine process; and 4) A depth refinement module to improve the depth map by TGV regularization and input loss.

Benchmarking Depth Map Super-Resolution

Paper
Add Code

UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders

1 code implementation • CVPR 2020 • Jing Zhang, Deng-Ping Fan, Yuchao Dai, Saeed Anwar, Fatemeh Sadat Saleh, Tong Zhang, Nick Barnes

In this paper, we propose the first framework (UCNet) to employ uncertainty for RGB-D saliency detection by learning from the data labeling process.

Ranked #4 on RGB-D Salient Object Detection on LFSD

RGB-D Salient Object Detection Saliency Detection +1

174

Paper
Code

Weakly-Supervised Salient Object Detection via Scribble Annotations

1 code implementation • CVPR 2020 • Jing Zhang, Xin Yu, Aixuan Li, Peipei Song, Bowen Liu, Yuchao Dai

In this paper, we propose a weakly-supervised salient object detection model to learn saliency from such annotations.

Edge Detection Object +3

144

Paper
Code

Superpixel Soup: Monocular Dense 3D Reconstruction of a Complex Dynamic Scene

no code implementations • 19 Nov 2019 • Suryansh Kumar, Yuchao Dai, Hongdong Li

We assume that a dynamic scene can be approximated by numerous piecewise planar surfaces, where each planar surface enjoys its own rigid motion, and the global change in the scene between two frames is as-rigid-as-possible (ARAP).

3D Reconstruction

Paper
Add Code

Joint Stereo Video Deblurring, Scene Flow Estimation and Moving Object Segmentation

no code implementations • 6 Oct 2019 • Liyuan Pan, Yuchao Dai, Miaomiao Liu, Fatih Porikli, Quan Pan

Under our model, these three tasks are naturally connected and expressed as the parameter estimation of 3D scene structure and camera motion (structure and motion for the dynamic scenes).

Deblurring Scene Flow Estimation +1

Paper
Add Code

MVS^2: Deep Unsupervised Multi-view Stereo with Multi-View Symmetry

no code implementations • 30 Aug 2019 • Yuchao Dai, Zhidong Zhu, Zhibo Rao, Bo Li

The success of existing deep-learning based multi-view stereo (MVS) approaches greatly depends on the availability of large-scale supervision in the form of dense depth maps.

Benchmarking

Paper
Add Code

IoU Loss for 2D/3D Object Detection

1 code implementation • 11 Aug 2019 • Dingfu Zhou, Jin Fang, Xibin Song, Chenye Guan, Junbo Yin, Yuchao Dai, Ruigang Yang

In 2D/3D object detection task, Intersection-over-Union (IoU) has been widely employed as an evaluation metric to evaluate the performance of different detectors in the testing stage.

3D Object Detection Object +1

399

Paper
Code

Multi-scale Cross-form Pyramid Network for Stereo Matching

no code implementations • 25 Apr 2019 • Zhidong Zhu, Mingyi He, Yuchao Dai, Zhibo Rao, Bo Li

The network consists of three modules: Multi-Scale 2D local feature extraction module, Cross-form spatial pyramid module and Multi-Scale 3D Feature Matching and Fusion module.

3D Feature Matching 3D Scene Reconstruction +3

Paper
Add Code

MSDC-Net: Multi-Scale Dense and Contextual Networks for Automated Disparity Map for Stereo Matching

no code implementations • 25 Apr 2019 • Zhibo Rao, Mingyi He, Yuchao Dai, Zhidong Zhu, Bo Li, Renjie He

The multi-scale residual 3D convolution module learns the different scale geometry context from the cost volume which aggregated by the multi-scale fusion 2D convolution module.

Autonomous Driving object-detection +3

Paper
Add Code

Unsupervised Deep Epipolar Flow for Stationary or Dynamic Scenes

no code implementations • CVPR 2019 • Yiran Zhong, Pan Ji, Jianyuan Wang, Yuchao Dai, Hongdong Li

In this paper, we propose Deep Epipolar Flow, an unsupervised optical flow method which incorporates global geometric constraints into network learning.

Benchmarking Optical Flow Estimation

Paper
Add Code

Deep Stacked Hierarchical Multi-patch Network for Image Deblurring

1 code implementation • CVPR 2019 • Hongguang Zhang, Yuchao Dai, Hongdong Li, Piotr Koniusz

depth, we propose a stacked version of our multi-patch model.

Ranked #9 on Deblurring on RealBlur-R (trained on GoPro) (SSIM (sRGB) metric)

Deblurring Image Deblurring

186

Paper
Code

High Frame Rate Video Reconstruction based on an Event Camera

1 code implementation • 12 Mar 2019 • Liyuan Pan, Richard Hartley, Cedric Scheerlinck, Miaomiao Liu, Xin Yu, Yuchao Dai

Based on the abundant event data alongside a low frame rate, easily blurred images, we propose a simple yet effective approach to reconstruct high-quality and high frame rate sharp videos.

Video Generation Video Reconstruction +1

Paper
Code

Ground Plane based Absolute Scale Estimation for Monocular Visual Odometry

no code implementations • 3 Mar 2019 • Dingfu Zhou, Yuchao Dai, Hongdong Li

Recovering the absolute metric scale from a monocular camera is a challenging but highly desirable problem for monocular camera-based systems.

Monocular Visual Odometry

Paper
Add Code

Single Image Deblurring and Camera Motion Estimation with Depth Map

no code implementations • 1 Mar 2019 • Liyuan Pan, Yuchao Dai, Miaomiao Liu

Camera shake during exposure is a major problem in hand-held photography, as it causes image blur that destroys details in the captured images.~In the real world, such blur is mainly caused by both the camera motion and the complex scene structure.~While considerable existing approaches have been proposed based on various assumptions regarding the scene structure or the camera motion, few existing methods could handle the real 6 DoF camera motion.~In this paper, we propose to jointly estimate the 6 DoF camera motion and remove the non-uniform blur caused by camera motion by exploiting their underlying geometric relationships, with a single blurry image and its depth map (either direct depth measurements, or a learned depth map) as input.~We formulate our joint deblurring and 6 DoF camera motion estimation as an energy minimization problem which is solved in an alternative manner.

Deblurring Image Deblurring +1

Paper
Add Code

Dense Depth Estimation of a Complex Dynamic Scene without Explicit 3D Motion Estimation

no code implementations • 11 Feb 2019 • Suryansh Kumar, Ram Srivatsav Ghorakavi, Yuchao Dai, Hongdong Li

Given per-pixel optical flow correspondences between two consecutive frames and, the sparse depth prior for the reference frame, we show that, we can effectively recover the dense depth map for the successive frames without solving for 3D motion parameters.

Depth Estimation Motion Estimation +1

Paper
Add Code

ApolloCar3D: A Large 3D Car Instance Understanding Benchmark for Autonomous Driving

no code implementations • CVPR 2019 • Xibin Song, Peng Wang, Dingfu Zhou, Rui Zhu, Chenye Guan, Yuchao Dai, Hao Su, Hongdong Li, Ruigang Yang

Specifically, we first segment each car with a pre-trained Mask R-CNN, and then regress towards its 3D pose and shape based on a deformable 3D car model with or without using semantic keypoints.

3D Car Instance Understanding Autonomous Driving

Paper
Add Code

Phase-only Image Based Kernel Estimation for Single-image Blind Deblurring

no code implementations • 26 Nov 2018 • Liyuan Pan, Richard Hartley, Miaomiao Liu, Yuchao Dai

The image blurring process is generally modelled as the convolution of a blur kernel with a latent image.

Blind Image Deblurring Image Deblurring +1

Paper
Add Code

Bringing a Blurry Frame Alive at High Frame-Rate with an Event Camera

1 code implementation • CVPR 2019 • Liyuan Pan, Cedric Scheerlinck, Xin Yu, Richard Hartley, Miaomiao Liu, Yuchao Dai

In this paper, we propose a simple and effective approach, the \textbf{Event-based Double Integral (EDI)} model, to reconstruct a high frame-rate, sharp video from a single blurry frame and its event data.

Video Generation

Paper
Code

Stochastic Attraction-Repulsion Embedding for Large Scale Image Localization

5 code implementations • ICCV 2019 • Liu Liu, Hongdong Li, Yuchao Dai

This paper tackles the problem of large-scale image-based localization (IBL) where the spatial location of a query image is determined by finding out the most similar reference images in a large database.

Image-Based Localization Representation Learning +1

264

Paper
Code

Stereo Computation for a Single Mixture Image

no code implementations • ECCV 2018 • Yiran Zhong, Yuchao Dai, Hongdong Li

This paper proposes an original problem of \emph{stereo computation from a single mixture image}-- a challenging problem that had not been researched before.

Stereo Matching Stereo Matching Hand +1

Paper
Add Code

Deeply Supervised Depth Map Super-Resolution as Novel View Synthesis

no code implementations • 27 Aug 2018 • Xibin Song, Yuchao Dai, Xueying Qin

However, there still exist two major issues with these DCNN based depth map super-resolution methods that hinder the performance: i) The low-resolution depth maps either need to be up-sampled before feeding into the network or substantial deconvolution has to be used; and ii) The supervision (high-resolution depth maps) is only applied at the end of the network, thus it is difficult to handle large up-sampling factors, such as $\times 8, \times 16$.

Benchmarking Blocking +2

Paper
Add Code

3D Geometry-Aware Semantic Labeling of Outdoor Street Scenes

no code implementations • 13 Aug 2018 • Yiran Zhong, Yuchao Dai, Hongdong Li

This paper is concerned with the problem of how to better exploit 3D geometric information for dense semantic image labeling.

Paper
Add Code

Open-World Stereo Video Matching with Deep RNN

no code implementations • ECCV 2018 • Yiran Zhong, Hongdong Li, Yuchao Dai

Deep Learning based stereo matching methods have shown great successes and achieved top scores across different benchmarks.

Stereo Matching Stereo Matching Hand

Paper
Add Code

Occluded Joints Recovery in 3D Human Pose Estimation based on Distance Matrix

no code implementations • 30 Jul 2018 • Xiang Guo, Yuchao Dai

In this paper, we propose to address the problem of single image 3D human pose estimation with occluded measurements by exploiting the Euclidean distance matrix (EDM).

3D Human Pose Estimation

Paper
Add Code

Deep Unsupervised Saliency Detection: A Multiple Noisy Labeling Perspective

no code implementations • CVPR 2018 • Jing Zhang, Tong Zhang, Yuchao Dai, Mehrtash Harandi, Richard Hartley

Such supervision, while labor-intensive and not always possible, tends to hinder the generalization ability of the learned models.

Benchmarking Saliency Prediction +1

Paper
Add Code

Scalable Dense Non-rigid Structure-from-Motion: A Grassmannian Perspective

no code implementations • CVPR 2018 • Suryansh Kumar, Anoop Cherian, Yuchao Dai, Hongdong Li

To address these issues, in this paper, we propose a new approach for dense NRSfM by modeling the problem on a Grassmann manifold.

Paper
Add Code

Depth Map Completion by Jointly Exploiting Blurry Color Images and Sparse Depth Maps

no code implementations • 27 Nov 2017 • Liyuan Pan, Yuchao Dai, Miaomiao Liu, Fatih Porikli

In this paper, we propose to tackle the problem of depth map completion by jointly exploiting the blurry color image sequences and the sparse depth map measurements, and present an energy minimization based formulation to simultaneously complete the depth maps, estimate the scene flow and deblur the color images.

Paper
Add Code

Efficient Global 2D-3D Matching for Camera Localization in a Large-Scale 3D Map

no code implementations • ICCV 2017 • Liu Liu, Hongdong Li, Yuchao Dai

In this paper, we introduce a global method which harnesses global contextual information exhibited both within the query image and among all the 3D points in the map.

3D Feature Matching Camera Localization

Paper
Add Code

Self-Supervised Learning for Stereo Matching with Self-Improving Ability

no code implementations • 4 Sep 2017 • Yiran Zhong, Yuchao Dai, Hongdong Li

Exiting deep-learning based dense stereo matching methods often rely on ground-truth disparity maps as the training signals, which are however not always available in many situations.

Self-Supervised Learning Stereo Matching +1

Paper
Add Code

Deep Edge-Aware Saliency Detection

no code implementations • 15 Aug 2017 • Jing Zhang, Yuchao Dai, Fatih Porikli, Mingyi He

There has been profound progress in visual saliency thanks to the deep learning architectures, however, there still exist three major challenges that hinder the detection performance for scenes with complex compositions, multiple salient objects, and salient objects of diverse scales.

Descriptive Saliency Detection

Paper
Add Code

Monocular Dense 3D Reconstruction of a Complex Dynamic Scene from Two Perspective Frames

no code implementations • ICCV 2017 • Suryansh Kumar, Yuchao Dai, Hongdong Li

This paper proposes a new approach for monocular dense 3D reconstruction of a complex dynamic scene from two perspective frames.

3D Reconstruction Dynamic Reconstruction +3

Paper
Add Code

Monocular Depth Estimation with Hierarchical Fusion of Dilated CNNs and Soft-Weighted-Sum Inference

1 code implementation • 2 Aug 2017 • Bo Li, Yuchao Dai, Mingyi He

Extensive experiments on the NYU Depth V2 and KITTI datasets show the superiority of our method compared with current state-of-the-art methods.

Monocular Depth Estimation Quantization +1

Paper
Code

"Maximizing rigidity" revisited: a convex programming approach for generic 3D shape reconstruction from multiple perspective views

no code implementations • ICCV 2017 • Pan Ji, Hongdong Li, Yuchao Dai, Ian Reid

Rigid structure-from-motion (RSfM) and non-rigid structure-from-motion (NRSfM) have long been treated in the literature as separate (different) problems.

3D Reconstruction 3D Shape Reconstruction

Paper
Add Code

Pixel-variant Local Homography for Fisheye Stereo Rectification Minimizing Resampling Distortion

no code implementations • 12 Jul 2017 • Dingfu Zhou, Yuchao Dai, Hongdong Li

First, we prove that there indeed exist enough degrees of freedom to apply pixel-wise local homography for stereo rectification.

3D Reconstruction Stereo Matching +1

Paper
Add Code

Dense Non-rigid Structure-from-Motion Made Easy - A Spatial-Temporal Smoothness based Solution

no code implementations • 27 Jun 2017 • Yuchao Dai, Huizhong Deng, Mingyi He

Second, we propose to exploit the spatial smoothness by resorting to the Laplacian of the 3D non-rigid shape.

Paper
Add Code

Integrated Deep and Shallow Networks for Salient Object Detection

no code implementations • 2 Jun 2017 • Jing Zhang, Bo Li, Yuchao Dai, Fatih Porikli, Mingyi He

Then the results from deep FCNN and RBD are concatenated to feed into a shallow network to map the concatenated feature maps to saliency maps.

Object object-detection +3

Paper
Add Code

Spatial-Temporal Union of Subspaces for Multi-body Non-rigid Structure-from-Motion

no code implementations • 14 May 2017 • Suryansh Kumar, Yuchao Dai, Hongdong Li

This spatio-temporal representation not only provides competitive 3D reconstruction but also outputs robust segmentation of multiple non-rigid objects.

3D Reconstruction

Paper
Add Code

Single image depth estimation by dilated deep residual convolutional neural network and soft-weight-sum inference

1 code implementation • 27 Apr 2017 • Bo Li, Yuchao Dai, Huahui Chen, Mingyi He

This paper proposes a new residual convolutional neural network (CNN) architecture for single image depth estimation.

Depth Estimation

Paper
Code

Skeleton based action recognition using translation-scale invariant image mapping and multi-scale deep cnn

no code implementations • 19 Apr 2017 • Bo Li, Mingyi He, Xuelian Cheng, Yu-cheng Chen, Yuchao Dai

Especially on the largest and challenge NTU RGB+D, UTD-MHAD, and MSRC-12 dataset, our method outperforms other methods by a large margion, which proves the efficacy of the proposed method.

Ranked #80 on Skeleton Based Action Recognition on NTU RGB+D

Action Recognition Image Classification +3

Paper
Add Code

Skeleton Boxes: Solving skeleton based action detection with a single deep convolutional neural network

no code implementations • 19 Apr 2017 • Bo Li, Huahui Chen, Yu-cheng Chen, Yuchao Dai, Mingyi He

However, due to the difficulty in representing the 3D skeleton video and the lack of training data, action detection from streaming 3D skeleton video still lags far behind its recognition counterpart and image based object detection.

Action Detection Action Recognition +3

Paper
Add Code

Simultaneous Stereo Video Deblurring and Scene Flow Estimation

no code implementations • CVPR 2017 • Liyuan Pan, Yuchao Dai, Miaomiao Liu, Fatih Porikli

Unlike the existing approach [31] which used a pre-computed scene flow, we propose a single framework to jointly estimate the scene flow and deblur the image, where the motion cues from scene flow estimation and blur information could reinforce each other, and produce superior results than the conventional scene flow estimation or stereo deblurring methods.

Deblurring Scene Flow Estimation

Paper
Add Code

Multi-body Non-rigid Structure-from-Motion

no code implementations • 15 Jul 2016 • Suryansh Kumar, Yuchao Dai, Hongdong Li

Recent progress have extended SFM to the areas of {multi-body SFM} (where there are {multiple rigid} relative motions in the scene), as well as {non-rigid SFM} (where there is a single non-rigid, deformable object or scene).

3D Reconstruction Clustering

Paper
Add Code

Deep Depth Super-Resolution : Learning Depth Super-Resolution using Deep Convolutional Neural Network

no code implementations • 7 Jul 2016 • Xibin Song, Yuchao Dai, Xueying Qin

In this paper, we bridge up the gap and extend the success of deep convolutional neural network to depth super-resolution.

Image Super-Resolution

Paper
Add Code

Robust and Efficient Relative Pose with a Multi-camera System for Autonomous Vehicle in Highly Dynamic Environments

no code implementations • 12 May 2016 • Liu Liu, Hongdong Li, Yuchao Dai

When the solver is used in combination with RANSAC, we are able to quickly prune unpromising hypotheses, significantly improve the chance of finding inliers.

Motion Estimation

Paper
Add Code

Robust Optical Flow Estimation of Double-Layer Images under Transparency or Reflection

no code implementations • CVPR 2016 • Jiaolong Yang, Hongdong Li, Yuchao Dai, Robby T. Tan

This paper deals with a challenging, frequently encountered, yet not properly investigated problem in two-frame optical flow estimation.

Optical Flow Estimation valid

Paper
Add Code

Rolling Shutter Camera Relative Pose: Generalized Epipolar Geometry

no code implementations • CVPR 2016 • Yuchao Dai, Hongdong Li, Laurent Kneip

The vast majority of modern consumer-grade cameras employ a rolling shutter mechanism.

Paper
Add Code

Depth and Surface Normal Estimation From Monocular Images Using Regression on Deep Features and Hierarchical CRFs

no code implementations • CVPR 2015 • Bo Li, Chunhua Shen, Yuchao Dai, Anton Van Den Hengel, Mingyi He

Predicting the depth (or surface normal) of a scene from single monocular color images is a challenging task.

regression Surface Normal Estimation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.