1 code implementation • 31 Jul 2023 • Mengqi He, Jing Zhang, Zhaoyuan Yang, Mingyi He, Nick Barnes, Yuchao Dai
We analysis performance of semantic segmentation models wrt.
no code implementations • 10 Jul 2023 • Aixuan Li, Jing Zhang, Yunqiu Lv, Tong Zhang, Yiran Zhong, Mingyi He, Yuchao Dai
In this case, salient objects are typically non-camouflaged, and camouflaged objects are usually not salient.
no code implementations • 21 Apr 2023 • Bin Fan, Yuchao Dai, Yongduek Seo, Mingyi He
The normalized eight-point algorithm has been widely viewed as the cornerstone in two-view geometry computation, where the seminal Hartley's normalization has greatly improved the performance of the direct linear transformation algorithm.
no code implementations • CVPR 2023 • Zhibo Rao, Bangshu Xiong, Mingyi He, Yuchao Dai, Renjie He, Zhelun Shen, Xing Li
Experimental results on multi-datasets show that: (1) our method can be easily plugged into the current various stereo matching models to improve generalization performance; (2) our method can reduce the significant volatility of generalization performance among different training epochs; (3) we find that the current methods prefer to choose the best results among different training epochs as generalization performance, but it is impossible to select the best performance by ground truth in practice.
no code implementations • 26 Oct 2022 • Zhiyuan Zhang, Yuchao Dai, Bin Fan, Jiadai Sun, Mingyi He
In this paper, we propose to learn a robust task-specific feature descriptor to consistently describe the correct point correspondence under interference.
1 code implementation • CVPR 2022 • Bin Fan, Yuchao Dai, Zhiyuan Zhang, Qi Liu, Mingyi He
Then, a refinement scheme is proposed to guide the GS frame synthesis along with bilateral occlusion masks to produce high-fidelity GS video frames at arbitrary times.
no code implementations • 24 Mar 2022 • Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Dingfu Zhou, Xibin Song, Mingyi He
Existing correspondences-free methods generally learn the holistic representation of the entire point cloud, which is fragile for partial and noisy point clouds.
no code implementations • 24 Mar 2022 • Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Bin Fan, Mingyi He
3D point cloud registration is fragile to outliers, which are labeled as the points without corresponding points.
no code implementations • 28 Oct 2021 • Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Dingfu Zhou, Xibin Song, Mingyi He
Even though considerable progress has been made in deep learning-based 3D point cloud processing, how to obtain accurate correspondences for robust registration remains a major challenge because existing hard assignment methods cannot deal with outliers naturally.
1 code implementation • 13 Oct 2021 • Jing Zhang, Yuchao Dai, Mochu Xiang, Deng-Ping Fan, Peyman Moghadam, Mingyi He, Christian Walder, Kaihao Zhang, Mehrtash Harandi, Nick Barnes
Deep neural networks can be roughly divided into deterministic neural networks and stochastic neural networks. The former is usually trained to achieve a mapping from input space to output space via maximum likelihood estimation for the weights, which leads to deterministic predictions during testing.
1 code implementation • ICCV 2021 • Bin Fan, Yuchao Dai, Mingyi He
The vast majority of modern consumer-grade cameras employ a rolling shutter mechanism, leading to image distortions if the camera moves during image acquisition.
no code implementations • 25 Jan 2021 • Shivam Pathak, Mingyi He, Sergey Malinchik, Stanislav Sobolevsky
Digital sensing provides an unprecedented opportunity to assess and understand mobility.
no code implementations • 31 Dec 2020 • Zhibo Rao, Mingyi He, Yuchao Dai
In this paper, we proposed a novel class attention module and decomposition-fusion strategy to cope with imbalanced labels.
no code implementations • 1 Nov 2020 • Zhibo Rao, Mingyi He, Bo Li, Renjie He
The network architecture used in this RVC, called as NLCA-Net v2, is consists of four parts: feature extraction, cost volume construction, feature matching, and refinement, as shown in Fig.
no code implementations • 2 Jun 2020 • Yu-cheng Chen, YingLi Tian, Mingyi He
Vision-based monocular human pose estimation, as one of the most fundamental and challenging problems in computer vision, aims to obtain posture of the human body from input images or video sequences.
no code implementations • 28 May 2020 • Longlong Jing, Yu-cheng Chen, Ling Zhang, Mingyi He, YingLi Tian
By exploring the inherent multi-modality attributes of 3D objects, in this paper, we propose to jointly learn modal-invariant and view-invariant features from different modalities including image, point cloud, and mesh with heterogeneous networks for 3D data.
no code implementations • 13 Apr 2020 • Longlong Jing, Yu-cheng Chen, Ling Zhang, Mingyi He, YingLi Tian
Specifically, 2D image features of rendered images from different views are extracted by a 2D convolutional neural network, and 3D point cloud features are extracted by a graph convolution neural network.
no code implementations • 25 Apr 2019 • Zhibo Rao, Mingyi He, Yuchao Dai, Zhidong Zhu, Bo Li, Renjie He
The multi-scale residual 3D convolution module learns the different scale geometry context from the cost volume which aggregated by the multi-scale fusion 2D convolution module.
no code implementations • 25 Apr 2019 • Zhidong Zhu, Mingyi He, Yuchao Dai, Zhibo Rao, Bo Li
The network consists of three modules: Multi-Scale 2D local feature extraction module, Cross-form spatial pyramid module and Multi-Scale 3D Feature Matching and Fusion module.
no code implementations • 15 Aug 2017 • Jing Zhang, Yuchao Dai, Fatih Porikli, Mingyi He
There has been profound progress in visual saliency thanks to the deep learning architectures, however, there still exist three major challenges that hinder the detection performance for scenes with complex compositions, multiple salient objects, and salient objects of diverse scales.
1 code implementation • 2 Aug 2017 • Bo Li, Yuchao Dai, Mingyi He
Extensive experiments on the NYU Depth V2 and KITTI datasets show the superiority of our method compared with current state-of-the-art methods.
no code implementations • 27 Jun 2017 • Yuchao Dai, Huizhong Deng, Mingyi He
Second, we propose to exploit the spatial smoothness by resorting to the Laplacian of the 3D non-rigid shape.
no code implementations • 2 Jun 2017 • Jing Zhang, Bo Li, Yuchao Dai, Fatih Porikli, Mingyi He
Then the results from deep FCNN and RBD are concatenated to feed into a shallow network to map the concatenated feature maps to saliency maps.
1 code implementation • 27 Apr 2017 • Bo Li, Yuchao Dai, Huahui Chen, Mingyi He
This paper proposes a new residual convolutional neural network (CNN) architecture for single image depth estimation.
no code implementations • 19 Apr 2017 • Bo Li, Huahui Chen, Yu-cheng Chen, Yuchao Dai, Mingyi He
However, due to the difficulty in representing the 3D skeleton video and the lack of training data, action detection from streaming 3D skeleton video still lags far behind its recognition counterpart and image based object detection.
no code implementations • 19 Apr 2017 • Bo Li, Mingyi He, Xuelian Cheng, Yu-cheng Chen, Yuchao Dai
Especially on the largest and challenge NTU RGB+D, UTD-MHAD, and MSRC-12 dataset, our method outperforms other methods by a large margion, which proves the efficacy of the proposed method.
Ranked #84 on Skeleton Based Action Recognition on NTU RGB+D
no code implementations • CVPR 2015 • Bo Li, Chunhua Shen, Yuchao Dai, Anton Van Den Hengel, Mingyi He
Predicting the depth (or surface normal) of a scene from single monocular color images is a challenging task.