1 code implementation • 2 Aug 2024 • Yang Jin, Lei Zhang, Shi Yan, Bin Fan, Binglu Wang
Gaze object prediction (GOP) aims to predict the category and location of the object that a human is looking at.
no code implementations • 25 Mar 2024 • Yinke Dong, Haifeng Yuan, Hongkun Liu, Wei Jing, Fangzhen Li, Hongmin Liu, Bin Fan
In this work, a progressive interaction network is proposed to enable the agent's feature to progressively focus on relevant maps, in order to better learn agents' feature representation capturing the relevant map constraints.
no code implementations • 22 Mar 2024 • Shixiong Xu, Gaofeng Meng, Xing Nie, Bolin Ni, Bin Fan, Shiming Xiang
This intriguing phenomenon, discovered in replay-based Class Incremental Learning (CIL), highlights the imbalanced forgetting of learned classes, as their accuracy is similar before the occurrence of catastrophic forgetting.
no code implementations • CVPR 2024 • Yakun Chang, Yeliduosi Xiaokaiti, Yujia Liu, Bin Fan, Zhaojun Huang, Tiejun Huang, Boxin Shi
However reconstructing HDR videos in high-speed conditions using single-bit spikings presents challenges due to the limited bit depth.
no code implementations • 21 Apr 2023 • Bin Fan, Yuchao Dai, Yongduek Seo, Mingyi He
The normalized eight-point algorithm has been widely viewed as the cornerstone in two-view geometry computation, where the seminal Hartley's normalization has greatly improved the performance of the direct linear transformation algorithm.
1 code implementation • journal 2023 • Changwei Wang, Rongtao Xu, Ke Lu, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang
Local features detection and description are widely used in many vision applications with high industrial and commercial demands.
no code implementations • 17 Jan 2023 • Sen Pei, Jiaxi Sun, Richard Yi Da Xu, Bin Fan, Shiming Xiang, Gaofeng Meng
Generally, existing approaches in dealing with out-of-distribution (OOD) detection mainly focus on the statistical difference between the features of OOD and in-distribution (ID) data extracted by the classifiers.
no code implementations • ICCV 2023 • Yongjie Chen, Hongmin Liu, Haoran Yin, Bin Fan
Thanks to the excellent global modeling capability of attention mechanisms, the Vision Transformer has achieved better results than ConvNet in many computer tasks.
1 code implementation • CVPR 2023 • Bin Fan, Yuxin Mao, Yuchao Dai, Zhexiong Wan, Qi Liu
Rolling shutter correction (RSC) is becoming increasingly popular for RS cameras that are widely used in commercial and industrial applications.
no code implementations • 26 Oct 2022 • Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Bin Fan, Qi Liu
In response, this paper presents a novel end-to-end learning-based method to estimate the dense correspondence of 3D point clouds, in which the problem of point matching is formulated as a zero-one assignment problem to achieve a permutation matching matrix to implement the one-to-one principle fundamentally.
no code implementations • 26 Oct 2022 • Zhiyuan Zhang, Yuchao Dai, Bin Fan, Jiadai Sun, Mingyi He
In this paper, we propose to learn a robust task-specific feature descriptor to consistently describe the correct point correspondence under interference.
1 code implementation • 6 Oct 2022 • Bin Fan, Yuchao Dai, Hongdong Li
The RSSR is a very challenging task, and to our knowledge, no practical solution exists to date.
1 code implementation • 1 Aug 2022 • Hu Su, Yonghao He, Rui Jiang, Jiabin Zhang, Wei Zou, Bin Fan
The dynamic smooth label is assigned to supervise the classification branch.
1 code implementation • CVPR 2022 • Bin Fan, Yuchao Dai, Zhiyuan Zhang, Qi Liu, Mingyi He
Then, a refinement scheme is proposed to guide the GS frame synthesis along with bilateral occlusion masks to produce high-fidelity GS video frames at arbitrary times.
no code implementations • 24 Mar 2022 • Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Bin Fan, Mingyi He
3D point cloud registration is fragile to outliers, which are labeled as the points without corresponding points.
1 code implementation • 14 Mar 2022 • Changwei Wang, Rongtao Xu, Yuyang Zhang, Shibiao Xu, Weiliang Meng, Bin Fan, Xiaopeng Zhang
Limited by the locality of convolutional neural networks, most existing local features description methods only learn local descriptors with local information and lack awareness of global and surrounding spatial context.
1 code implementation • CVPR 2022 • Chenghao Zhang, Kun Tian, Bin Fan, Gaofeng Meng, Zhaoxiang Zhang, Chunhong Pan
The deep stereo models have achieved state-of-the-art performance on driving scenes, but they suffer from severe performance degradation when tested on unseen scenes.
1 code implementation • ICCV 2021 • Bin Fan, Yuchao Dai, Mingyi He
The vast majority of modern consumer-grade cameras employ a rolling shutter mechanism, leading to image distortions if the camera moves during image acquisition.
no code implementations • ICCV 2021 • Bin Fan, Yuchao Dai
In this paper, we propose to invert the above RS imaging mechanism, i. e., recovering a high framerate GS video from consecutive RS images to achieve RS temporal super-resolution (RSSR).
no code implementations • ECCV 2020 • Haibao Yu, Qi Han, Jianbo Li, Jianping Shi, Guangliang Cheng, Bin Fan
Learning to find an optimal mixed precision model that can preserve accuracy and satisfy the specific constraints on model size and computation is extremely challenge due to the difficult in training a mixed precision model and the huge space of all possible bit quantizations.
no code implementations • 14 Jun 2020 • Ke Wang, Bin Fan, Yuchao Dai
In this paper, we present a novel linear algorithm to estimate the 6 DoF relative pose from consecutive frames of stereo rolling shutter (RS) cameras.
2 code implementations • 1 Mar 2020 • Yifan Chen, Han Wang, Xiaolu Sun, Bin Fan, Chu Tang
Visual attention has proven to be effective in improving the performance of person re-identification.
Ranked #11 on Person Re-Identification on DukeMTMC-reID
2 code implementations • CVPR 2020 • Chaoxu Guo, Bin Fan, Qian Zhang, Shiming Xiang, Chunhong Pan
In this paper, we begin by first analyzing the design defects of feature pyramid in FPN, and then introduce a new feature pyramid architecture named AugFPN to address these problems.
no code implementations • IJCNLP 2019 • Xinyu Xiao, Lingfeng Wang, Bin Fan, Shinming Xiang, Chunhong Pan
To address these problems, we propose an Adaptive Semantic Guidance Network (ASGN), which instantiates the whole video semantics to different POS-aware semantics with the supervision of part of speech (POS) tag.
1 code implementation • ICCV 2019 • Yongcheng Liu, Bin Fan, Gaofeng Meng, Jiwen Lu, Shiming Xiang, Chunhong Pan
Point cloud processing is very challenging, as the diverse shapes formed by irregular points are often indistinguishable.
Ranked #23 on 3D Part Segmentation on ShapeNet-Part
4 code implementations • CVPR 2019 • Yongcheng Liu, Bin Fan, Shiming Xiang, Chunhong Pan
Specifically, the convolutional weight for local point set is forced to learn a high-level relation expression from predefined geometric priors, between a sampled point from this point set and the others.
Ranked #11 on 3D Point Cloud Classification on ModelNet40-C
2 code implementations • CVPR 2019 • Yurun Tian, Xin Yu, Bin Fan, Fuchao Wu, Huub Heijnen, Vassileios Balntas
Despite the fact that Second Order Similarity (SOS) has been used with significant success in tasks such as graph matching and clustering, it has not been exploited for learning local descriptors.
no code implementations • ICCV 2019 • Chaoxu Guo, Bin Fan, Jie Gu, Qian Zhang, Shiming Xiang, Veronique Prinet, Chunhong Pan
Instead of relying on optical flow, this paper proposes a novel module called Progressive Sparse Local Attention (PSLA), which establishes the spatial correspondence between features across frames in a local region with progressively sparser stride and uses the correspondence to propagate features.
1 code implementation • 30 Jul 2018 • Yongcheng Liu, Bin Fan, Lingfeng Wang, Jun Bai, Shiming Xiang, Chunhong Pan
Specifically, for confusing manmade objects, ScasNet improves the labeling coherence with sequential global-to-local contexts aggregation.
no code implementations • 14 Dec 2017 • Bin Fan, Qingqun Kong, Xinchao Wang, Zhiheng Wang, Shiming Xiang, Chunhong Pan, Pascal Fua
To obtain a comprehensive evaluation, we choose to include both float type features and binary ones.
1 code implementation • CVPR 2017 • Yurun Tian, Bin Fan, Fuchao Wu
In this paper, we propose to learn high per- formance descriptor in Euclidean space via the Convolu- tional Neural Network (CNN).
no code implementations • 14 Feb 2016 • Bin Fan, Qingqun Kong, Wei Sui, Zhiheng Wang, Xinchao Wang, Shiming Xiang, Chunhong Pan, Pascal Fua
Binary features have been incrementally popular in the past few years due to their low memory footprints and the efficient computation of Hamming distance between binary descriptors.
no code implementations • ICCV 2015 • Kun Ding, Chunlei Huo, Bin Fan, Chunhong Pan
Hashing is very effective for many tasks in reducing the processing time and in compressing massive databases.
no code implementations • CVPR 2015 • Yanhong Bi, Bin Fan, Fuchao Wu
Experiments have shown the superiority of Cayley-Klein metric over Mahalanobis ones and the effectiveness of our Cayley-Klein metric learning methods.
no code implementations • 12 Sep 2014 • Feiyun Zhu, Bin Fan, Xinliang Zhu, Ying Wang, Shiming Xiang, Chunhong Pan
Subset selection from massive data with noised information is increasingly popular for various applications.
no code implementations • 2 Sep 2014 • Feiyun Zhu, Ying Wang, Bin Fan, Gaofeng Meng, Chunhong Pan
Based on this observation, we exploit a learning-based sparsity method to simultaneously learn the HU results and a sparse guidance map.
no code implementations • 18 Jul 2014 • Zhenhua Wang, Bin Fan, Fuchao Wu
This paper proposes a novel Affine Subspace Representation (ASR) descriptor to deal with affine distortions induced by viewpoint changes.
no code implementations • 19 Mar 2014 • Feiyun Zhu, Ying Wang, Shiming Xiang, Bin Fan, Chunhong Pan
With this constraint, our method can learn a compact space, where highly similar pixels are grouped to share correlated sparse representations.
no code implementations • 13 Mar 2014 • Feiyun Zhu, Ying Wang, Bin Fan, Gaofeng Meng, Shiming Xiang, Chunhong Pan
Hyperspectral unmixing, the process of estimating a common set of spectral bases and their corresponding composite percentages at each pixel, is an important task for hyperspectral analysis, visualization and understanding.