Search Results for author: Bin Fan

Found 39 papers, 16 papers with code

Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model

1 code implementation2 Aug 2024 Yang Jin, Lei Zhang, Shi Yan, Bin Fan, Binglu Wang

Gaze object prediction (GOP) aims to predict the category and location of the object that a human is looking at.

Object object-detection +3

ProIn: Learning to Predict Trajectory Based on Progressive Interactions for Autonomous Driving

no code implementations25 Mar 2024 Yinke Dong, Haifeng Yuan, Hongkun Liu, Wei Jing, Fangzhen Li, Hongmin Liu, Bin Fan

In this work, a progressive interaction network is proposed to enable the agent's feature to progressively focus on relevant maps, in order to better learn agents' feature representation capturing the relevant map constraints.

Autonomous Driving motion prediction

Defying Imbalanced Forgetting in Class Incremental Learning

no code implementations22 Mar 2024 Shixiong Xu, Gaofeng Meng, Xing Nie, Bolin Ni, Bin Fan, Shiming Xiang

This intriguing phenomenon, discovered in replay-based Class Incremental Learning (CIL), highlights the imbalanced forgetting of learned classes, as their accuracy is similar before the occurrence of catastrophic forgetting.

class-incremental learning Class Incremental Learning +2

Towards HDR and HFR Video from Rolling-Mixed-Bit Spikings

no code implementations CVPR 2024 Yakun Chang, Yeliduosi Xiaokaiti, Yujia Liu, Bin Fan, Zhaojun Huang, Tiejun Huang, Boxin Shi

However reconstructing HDR videos in high-speed conditions using single-bit spikings presents challenges due to the limited bit depth.

A Revisit of the Normalized Eight-Point Algorithm and A Self-Supervised Deep Solution

no code implementations21 Apr 2023 Bin Fan, Yuchao Dai, Yongduek Seo, Mingyi He

The normalized eight-point algorithm has been widely viewed as the cornerstone in two-view geometry computation, where the seminal Hartley's normalization has greatly improved the performance of the direct linear transformation algorithm.

Self-Supervised Learning

Attention Weighted Local Descriptors

1 code implementation journal 2023 Changwei Wang, Rongtao Xu, Ke Lu, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

Local features detection and description are widely used in many vision applications with high industrial and commercial demands.

3D Reconstruction Homography Estimation +2

Free Lunch for Generating Effective Outlier Supervision

no code implementations17 Jan 2023 Sen Pei, Jiaxi Sun, Richard Yi Da Xu, Bin Fan, Shiming Xiang, Gaofeng Meng

Generally, existing approaches in dealing with out-of-distribution (OOD) detection mainly focus on the statistical difference between the features of OOD and in-distribution (ID) data extracted by the classifiers.

Out of Distribution (OOD) Detection

Building Vision Transformers with Hierarchy Aware Feature Aggregation

no code implementations ICCV 2023 Yongjie Chen, Hongmin Liu, Haoran Yin, Bin Fan

Thanks to the excellent global modeling capability of attention mechanisms, the Vision Transformer has achieved better results than ConvNet in many computer tasks.

Image Classification object-detection +2

Joint Appearance and Motion Learning for Efficient Rolling Shutter Correction

1 code implementation CVPR 2023 Bin Fan, Yuxin Mao, Yuchao Dai, Zhexiong Wan, Qi Liu

Rolling shutter correction (RSC) is becoming increasingly popular for RS cameras that are widely used in commercial and industrial applications.

Data Augmentation Decoder +1

Searching Dense Point Correspondences via Permutation Matrix Learning

no code implementations26 Oct 2022 Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Bin Fan, Qi Liu

In response, this paper presents a novel end-to-end learning-based method to estimate the dense correspondence of 3D point clouds, in which the problem of point matching is formulated as a zero-one assignment problem to achieve a permutation matching matrix to implement the one-to-one principle fundamentally.

Learning a Task-specific Descriptor for Robust Matching of 3D Point Clouds

no code implementations26 Oct 2022 Zhiyuan Zhang, Yuchao Dai, Bin Fan, Jiadai Sun, Mingyi He

In this paper, we propose to learn a robust task-specific feature descriptor to consistently describe the correct point correspondence under interference.

Context-Aware Video Reconstruction for Rolling Shutter Cameras

1 code implementation CVPR 2022 Bin Fan, Yuchao Dai, Zhiyuan Zhang, Qi Liu, Mingyi He

Then, a refinement scheme is proposed to guide the GS frame synthesis along with bilateral occlusion masks to produce high-fidelity GS video frames at arbitrary times.

Motion Compensation Video Reconstruction

VRNet: Learning the Rectified Virtual Corresponding Points for 3D Point Cloud Registration

no code implementations24 Mar 2022 Zhiyuan Zhang, Jiadai Sun, Yuchao Dai, Bin Fan, Mingyi He

3D point cloud registration is fragile to outliers, which are labeled as the points without corresponding points.

Point Cloud Registration

MTLDesc: Looking Wider to Describe Better

1 code implementation14 Mar 2022 Changwei Wang, Rongtao Xu, Yuyang Zhang, Shibiao Xu, Weiliang Meng, Bin Fan, Xiaopeng Zhang

Limited by the locality of convolutional neural networks, most existing local features description methods only learn local descriptors with local information and lack awareness of global and surrounding spatial context.

Indoor Localization Triplet

Continual Stereo Matching of Continuous Driving Scenes With Growing Architecture

1 code implementation CVPR 2022 Chenghao Zhang, Kun Tian, Bin Fan, Gaofeng Meng, Zhaoxiang Zhang, Chunhong Pan

The deep stereo models have achieved state-of-the-art performance on driving scenes, but they suffer from severe performance degradation when tested on unseen scenes.

Continual Learning RAG +1

SUNet: Symmetric Undistortion Network for Rolling Shutter Correction

1 code implementation ICCV 2021 Bin Fan, Yuchao Dai, Mingyi He

The vast majority of modern consumer-grade cameras employ a rolling shutter mechanism, leading to image distortions if the camera moves during image acquisition.

Decoder Rolling Shutter Correction

Inverting a Rolling Shutter Camera: Bring Rolling Shutter Images to High Framerate Global Shutter Video

no code implementations ICCV 2021 Bin Fan, Yuchao Dai

In this paper, we propose to invert the above RS imaging mechanism, i. e., recovering a high framerate GS video from consecutive RS images to achieve RS temporal super-resolution (RSSR).

Optical Flow Estimation Super-Resolution

Search What You Want: Barrier Panelty NAS for Mixed Precision Quantization

no code implementations ECCV 2020 Haibao Yu, Qi Han, Jianbo Li, Jianping Shi, Guangliang Cheng, Bin Fan

Learning to find an optimal mixed precision model that can preserve accuracy and satisfy the specific constraints on model size and computation is extremely challenge due to the difficult in training a mixed precision model and the huge space of all possible bit quantizations.

Quantization valid

Relative Pose Estimation for Stereo Rolling Shutter Cameras

no code implementations14 Jun 2020 Ke Wang, Bin Fan, Yuchao Dai

In this paper, we present a novel linear algorithm to estimate the 6 DoF relative pose from consecutive frames of stereo rolling shutter (RS) cameras.

Pose Estimation

AugFPN: Improving Multi-scale Feature Learning for Object Detection

2 code implementations CVPR 2020 Chaoxu Guo, Bin Fan, Qian Zhang, Shiming Xiang, Chunhong Pan

In this paper, we begin by first analyzing the design defects of feature pyramid in FPN, and then introduce a new feature pyramid architecture named AugFPN to address these problems.

Object object-detection +1

Guiding the Flowing of Semantics: Interpretable Video Captioning via POS Tag

no code implementations IJCNLP 2019 Xinyu Xiao, Lingfeng Wang, Bin Fan, Shinming Xiang, Chunhong Pan

To address these problems, we propose an Adaptive Semantic Guidance Network (ASGN), which instantiates the whole video semantics to different POS-aware semantics with the supervision of part of speech (POS) tag.

POS TAG +1

Relation-Shape Convolutional Neural Network for Point Cloud Analysis

4 code implementations CVPR 2019 Yongcheng Liu, Bin Fan, Shiming Xiang, Chunhong Pan

Specifically, the convolutional weight for local point set is forced to learn a high-level relation expression from predefined geometric priors, between a sampled point from this point set and the others.

3D Part Segmentation 3D Point Cloud Classification +2

SOSNet: Second Order Similarity Regularization for Local Descriptor Learning

2 code implementations CVPR 2019 Yurun Tian, Xin Yu, Bin Fan, Fuchao Wu, Huub Heijnen, Vassileios Balntas

Despite the fact that Second Order Similarity (SOS) has been used with significant success in tasks such as graph matching and clustering, it has not been exploited for learning local descriptors.

Clustering Graph Matching +1

Progressive Sparse Local Attention for Video object detection

no code implementations ICCV 2019 Chaoxu Guo, Bin Fan, Jie Gu, Qian Zhang, Shiming Xiang, Veronique Prinet, Chunhong Pan

Instead of relying on optical flow, this paper proposes a novel module called Progressive Sparse Local Attention (PSLA), which establishes the spatial correspondence between features across frames in a local region with progressively sparser stride and uses the correspondence to propagate features.

Object object-detection +2

Semantic Labeling in Very High Resolution Images via a Self-Cascaded Convolutional Neural Network

1 code implementation30 Jul 2018 Yongcheng Liu, Bin Fan, Lingfeng Wang, Jun Bai, Shiming Xiang, Chunhong Pan

Specifically, for confusing manmade objects, ScasNet improves the labeling coherence with sequential global-to-local contexts aggregation.

L2-Net: Deep Learning of Discriminative Patch Descriptor in Euclidean Space

1 code implementation CVPR 2017 Yurun Tian, Bin Fan, Fuchao Wu

In this paper, we propose to learn high per- formance descriptor in Euclidean space via the Convolu- tional Neural Network (CNN).

Deep Learning Patch Matching

Do We Need Binary Features for 3D Reconstruction?

no code implementations14 Feb 2016 Bin Fan, Qingqun Kong, Wei Sui, Zhiheng Wang, Xinchao Wang, Shiming Xiang, Chunhong Pan, Pascal Fua

Binary features have been incrementally popular in the past few years due to their low memory footprints and the efficient computation of Hamming distance between binary descriptors.

3D Reconstruction

kNN Hashing With Factorized Neighborhood Representation

no code implementations ICCV 2015 Kun Ding, Chunlei Huo, Bin Fan, Chunhong Pan

Hashing is very effective for many tasks in reducing the processing time and in compressing massive databases.

Retrieval

Beyond Mahalanobis Metric: Cayley-Klein Metric Learning

no code implementations CVPR 2015 Yanhong Bi, Bin Fan, Fuchao Wu

Experiments have shown the superiority of Cayley-Klein metric over Mahalanobis ones and the effectiveness of our Cayley-Klein metric learning methods.

Metric Learning

10,000+ Times Accelerated Robust Subset Selection (ARSS)

no code implementations12 Sep 2014 Feiyun Zhu, Bin Fan, Xinliang Zhu, Ying Wang, Shiming Xiang, Chunhong Pan

Subset selection from massive data with noised information is increasingly popular for various applications.

Action Recognition Collaborative Filtering +16

Effective Spectral Unmixing via Robust Representation and Learning-based Sparsity

no code implementations2 Sep 2014 Feiyun Zhu, Ying Wang, Bin Fan, Gaofeng Meng, Chunhong Pan

Based on this observation, we exploit a learning-based sparsity method to simultaneously learn the HU results and a sparse guidance map.

Hyperspectral Unmixing

Affine Subspace Representation for Feature Description

no code implementations18 Jul 2014 Zhenhua Wang, Bin Fan, Fuchao Wu

This paper proposes a novel Affine Subspace Representation (ASR) descriptor to deal with affine distortions induced by viewpoint changes.

Structured Sparse Method for Hyperspectral Unmixing

no code implementations19 Mar 2014 Feiyun Zhu, Ying Wang, Shiming Xiang, Bin Fan, Chunhong Pan

With this constraint, our method can learn a compact space, where highly similar pixels are grouped to share correlated sparse representations.

Hyperspectral Unmixing

Spectral Unmixing via Data-guided Sparsity

no code implementations13 Mar 2014 Feiyun Zhu, Ying Wang, Bin Fan, Gaofeng Meng, Shiming Xiang, Chunhong Pan

Hyperspectral unmixing, the process of estimating a common set of spectral bases and their corresponding composite percentages at each pixel, is an important task for hyperspectral analysis, visualization and understanding.

Hyperspectral Unmixing

Cannot find the paper you are looking for? You can Submit a new open access paper.