Search Results for author: Wenfei Yang

Found 16 papers, 9 papers with code

Structure-Aware Correspondence Learning for Relative Pose Estimation

no code implementations24 Mar 2025 Yihan Chen, Wenfei Yang, Huan Ren, Shifeng Zhang, Tianzhu Zhang, Feng Wu

Despite the success of existing 3D correspondence-based methods, the reliance on explicit feature matching suffers from small overlaps in visible regions and unreliable feature estimation for invisible regions.

Image Reconstruction Pose Estimation

State Space Model Meets Transformer: A New Paradigm for 3D Object Detection

1 code implementation International Conference on Learning Representations 2025 Chuxin Wang, Wenfei Yang, Xiang Liu, Tianzhu Zhang

To the best of our knowledge, this is the first method to model queries as system states and scene points as system inputs, which can simultaneously update scene point features and query features with linear complexity.

3D Object Detection Decoder +2

Rethinking Masked Representation Learning for 3D Point Cloud Understanding

1 code implementation IEEE Transactions on Image Processing 2024 Chuxin Wang, Yixin Zha, Jianfeng He, Wenfei Yang, Tianzhu Zhang

Recently, masked point modeling-based methods have shown significant performance improvements for point cloud understanding, yet these methods rely on overlapping grouping strategies (k-nearest neighbor algorithm) resulting in early leakage of structural information of mask groups, and overlook the semantic modeling of object components resulting in parts with the same semantics having obvious feature differences due to position differences.

3D Part Segmentation Few-Shot 3D Point Cloud Classification +1

MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting

1 code implementation10 Oct 2024 Ruijie Zhu, Yanzhe Liang, Hanzhi Chang, Jiacheng Deng, Jiahao Lu, Wenfei Yang, Tianzhu Zhang, Yongdong Zhang

Specifically, we first introduce an optical flow decoupling module that decouples optical flow into camera flow and motion flow, corresponding to camera movement and object motion respectively.

3D Reconstruction Dynamic Reconstruction +3

Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth Estimation

2 code implementations4 Sep 2024 Li Liu, Ruijie Zhu, Jiacheng Deng, Ziyang Song, Wenfei Yang, Tianzhu Zhang

Specifically, in the proposed plane guided depth generator (PGDG), we design a set of plane queries as prototypes to softly model planes in the scene and predict per-pixel plane coefficients.

Depth Prediction Monocular Depth Estimation

Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation

1 code implementation CVPR 2024 Xiao Lin, Wenfei Yang, Yuan Gao, Tianzhu Zhang

(2) The second design is a Geometric-Aware Feature Aggregation module, which can efficiently integrate the local and global geometric information into keypoint features.

6D Pose Estimation using RGB Keypoint Detection

Unifying Visual and Vision-Language Tracking via Contrastive Learning

1 code implementation20 Jan 2024 Yinchao Ma, Yuyang Tang, Wenfei Yang, Tianzhu Zhang, Jinpeng Zhang, Mengxue Kang

Single object tracking aims to locate the target object in a video sequence according to the state specified by different modal references, including the initial bounding box (BBOX), natural language (NL), or both (NL+BBOX).

Contrastive Learning Visual Grounding +2

Not Every Side Is Equal: Localization Uncertainty Estimation for Semi-Supervised 3D Object Detection

1 code implementation ICCV 2023 Chuxin Wang, Wenfei Yang, Tianzhu Zhang

Semi-supervised 3D object detection from point cloud aims to train a detector with a small number of labeled data and a large number of unlabeled data.

3D Object Detection object-detection +1

Uncertainty Guided Collaborative Training for Weakly Supervised Temporal Action Detection

no code implementations CVPR 2021 Wenfei Yang, Tianzhu Zhang, Xiaoyuan Yu, Tian Qi, Yongdong Zhang, Feng Wu

To alleviate this problem, we propose a novel Uncertainty Guided Collaborative Training (UGCT) strategy, which mainly includes two key designs: (1) The first design is an online pseudo label generation module, in which the RGB and FLOW streams work collaboratively to learn from each other.

Action Detection Pseudo Label

Action Unit Memory Network for Weakly Supervised Temporal Action Localization

no code implementations CVPR 2021 Wang Luo, Tianzhu Zhang, Wenfei Yang, Jingen Liu, Tao Mei, Feng Wu, Yongdong Zhang

In this paper, we present an Action Unit Memory Network (AUMN) for weakly supervised temporal action localization, which can mitigate the above two challenges by learning an action unit memory bank.

Diversity Weakly Supervised Action Localization +2

Cannot find the paper you are looking for? You can Submit a new open access paper.