Search Results for author: Yaonan Wang

Found 48 papers, 14 papers with code

Unsupervised Deformable Image Registration with Structural Nonparametric Smoothing

no code implementations12 Jun 2025 Hang Zhang, Xiang Chen, Renjiu Hu, Rongguang Wang, Jinwei Zhang, Min Liu, Yaonan Wang, Gaolei Li, Xinxing Cheng, Jinming Duan

Learning-based deformable image registration (DIR) accelerates alignment by amortizing traditional optimization via neural networks.

Image Registration

Reconsider the Template Mesh in Deep Learning-based Mesh Reconstruction

no code implementations21 May 2025 Fengting Zhang, Boxu Liang, Qinghao Liu, Min Liu, Xiang Chen, Yaonan Wang

Mesh reconstruction is a cornerstone process across various applications, including in-silico trials, digital twins, surgical planning, and navigation.

Deep Learning

Coordinated Energy-Trajectory Economic Model Predictive Control for Autonomous Surface Vehicles under Disturbances

no code implementations10 Mar 2025 Zhongqi Deng, YuAn Wang, Jian Huang, HUI ZHANG, Yaonan Wang

The paper proposes a novel Economic Model Predictive Control (EMPC) scheme for Autonomous Surface Vehicles (ASVs) to simultaneously address path following accuracy and energy constraints under environmental disturbances.

CPU Model Predictive Control

Prompt-driven Transferable Adversarial Attack on Person Re-Identification with Attribute-aware Textual Inversion

no code implementations27 Feb 2025 Yuan Bian, Min Liu, Yunqi Yi, Xueping Wang, Yaonan Wang

Person re-identification (re-id) models are vital in security surveillance systems, requiring transferable adversarial attacks to explore the vulnerabilities of them.

Adversarial Attack Attribute +2

Multi-Keypoint Affordance Representation for Functional Dexterous Grasping

1 code implementation27 Feb 2025 Fan Yang, Dongsheng Luo, Wenrui Chen, Jiacheng Lin, Junjie Cai, Kailun Yang, Zhiyong Li, Yaonan Wang

Additionally, we present a Keypoint-based Grasp matrix Transformation (KGT) method, ensuring spatial consistency between hand keypoints and object contact points, thus providing a direct link between visual perception and dexterous grasping actions.

Multiview Point Cloud Registration Based on Minimum Potential Energy for Free-Form Blade Measurement

no code implementations11 Feb 2025 Zijie Wu, Yaonan Wang, Yang Mo, Qing Zhu, He Xie, Haotian Wu, Mingtao Feng, Ajmal Mian

In this paper, we propose a novel global registration method that is based on the minimum potential energy (MPE) method to address these problems.

Form Point Cloud Registration

Modality Unified Attack for Omni-Modality Person Re-Identification

no code implementations22 Jan 2025 Yuan Bian, Min Liu, Yunqi Yi, Xueping Wang, Yunfeng Ma, Yaonan Wang

Specifically, we propose a novel Modality Unified Attack method to train modality-specific adversarial generators to generate AEs that effectively attack different omni-modality models.

Person Re-Identification

Feature Information Driven Position Gaussian Distribution Estimation for Tiny Object Detection

no code implementations CVPR 2025 Jinghao Bian, Mingtao Feng, Weisheng Dong, Fangfang Wu, Jianqiao Luo, Yaonan Wang, Guangming Shi

Taking the information map as prior knowledge guidance, we construct a multi-scale position gaussian distribution map prediction module, simultaneously modulating the information map and distribution map to focus on tiny objects during training.

object-detection Object Detection +1

Tacit Learning with Adaptive Information Selection for Cooperative Multi-Agent Reinforcement Learning

no code implementations20 Dec 2024 Lunjun Liu, Weilai Jiang, Yaonan Wang

In multi-agent reinforcement learning (MARL), the centralized training with decentralized execution (CTDE) framework has gained widespread adoption due to its strong performance.

Decision Making Multi-agent Reinforcement Learning

EADReg: Probabilistic Correspondence Generation with Efficient Autoregressive Diffusion Model for Outdoor Point Cloud Registration

no code implementations22 Nov 2024 Linrui Gong, Jiuming Liu, Junyi Ma, Lihao Liu, Yaonan Wang, Hesheng Wang

To address this issue, we propose a novel framework named EADReg for efficient and robust registration of LiDAR point clouds based on autoregressive diffusion models.

Point Cloud Registration

A Polarization Image Dehazing Method Based on the Principle of Physical Diffusion

no code implementations15 Nov 2024 Zhenjun Zhang, Lijun Tang, Hongjin Wang, Lilian Zhang, Yunze He, Yaonan Wang

Computer vision is increasingly used in areas such as unmanned vehicles, surveillance systems and remote sensing.

Image Dehazing

Fidelity-Imposed Displacement Editing for the Learn2Reg 2024 SHG-BF Challenge

no code implementations28 Oct 2024 Jiacheng Wang, Xiang Chen, Renjiu Hu, Rongguang Wang, Jiazheng Wang, Min Liu, Yaonan Wang, Hang Zhang

Co-examination of second-harmonic generation (SHG) and bright-field (BF) microscopy enables the differentiation of tissue components and collagen fibers, aiding the analysis of human breast and pancreatic cancer tissues.

Contrastive Learning

Unsupervised Multimodal 3D Medical Image Registration with Multilevel Correlation Balanced Optimization

1 code implementation8 Sep 2024 Jiazheng Wang, Xiang Chen, Yuxi Zhang, Min Liu, Yaonan Wang, Hang Zhang

However, due to the differences between multimodal images and intraoperative image deformation caused by tissue displacement and removal during surgery, effective registration of preoperative and intraoperative multimodal images faces significant challenges.

global-optimization Image Registration +1

Learning to Learn Transferable Generative Attack for Person Re-Identification

no code implementations6 Sep 2024 Yuan Bian, Min Liu, Xueping Wang, Yunfeng Ma, Yaonan Wang

Deep learning-based person re-identification (re-id) models are widely employed in surveillance systems and inevitably inherit the vulnerability of deep networks to adversarial attacks.

Meta-Learning Person Re-Identification

Large Scale Unsupervised Brain MRI Image Registration Solution for Learn2Reg 2024

no code implementations2 Sep 2024 Yuxi Zhang, Xiang Chen, Jiazheng Wang, Min Liu, Yaonan Wang, Dongdong Liu, Renjiu Hu, Hang Zhang

In this paper, we summarize the methods and experimental results we proposed for Task 2 in the learn2reg 2024 Challenge.

Image Registration Task 2

CVPT: Cross-Attention help Visual Prompt Tuning adapt visual task

1 code implementation27 Aug 2024 Lingyun Huang, Jianxu Mao, Yaonan Wang, Junfei Yi, Ziming Tao

These methods optimize large-scale pre-trained models for specific tasks by fine-tuning a select group of parameters.

parameter-efficient fine-tuning Visual Prompt Tuning

Learning Granularity-Aware Affordances from Human-Object Interaction for Tool-Based Functional Grasping in Dexterous Robotics

1 code implementation30 Jun 2024 Fan Yang, Wenrui Chen, Kailun Yang, Haoran Lin, Dongsheng Luo, Conghui Tang, Zhiyong Li, Yaonan Wang

To address this, we propose a granularity-aware affordance feature extraction method for locating functional affordance areas and predicting dexterous coarse gestures.

Human-Object Interaction Detection Object

Teaching with Uncertainty: Unleashing the Potential of Knowledge Distillation in Object Detection

no code implementations11 Jun 2024 Junfei Yi, Jianxu Mao, Tengfei Liu, Mingjie Li, Hanyu Gu, HUI ZHANG, Xiaojun Chang, Yaonan Wang

In this paper, we propose a novel feature-based distillation paradigm with knowledge uncertainty for object detection, termed "Uncertainty Estimation-Discriminative Knowledge Extraction-Knowledge Transfer (UET)", which can seamlessly integrate with existing distillation methods.

Knowledge Distillation object-detection +2

Quantum Adjoint Convolutional Layers for Effective Data Representation

no code implementations26 Apr 2024 Ren-xin Zhao, Shi Wang, Yaonan Wang

Quantum Convolutional Layer (QCL) is considered as one of the core of Quantum Convolutional Neural Networks (QCNNs) due to its efficient data feature extraction capability.

3D Object Detection from Point Cloud via Voting Step Diffusion

no code implementations21 Mar 2024 Haoran Hou, Mingtao Feng, Zijie Wu, Weisheng Dong, Qing Zhu, Yaonan Wang, Ajmal Mian

In this work, we focus on the distributional properties of point clouds and formulate the voting process as generating new points in the high-density region of the distribution of object centers.

3D Object Detection Object +2

External Knowledge Enhanced 3D Scene Generation from Sketch

no code implementations21 Mar 2024 Zijie Wu, Mingtao Feng, Yaonan Wang, He Xie, Weisheng Dong, Bo Miao, Ajmal Mian

Generating realistic 3D scenes is challenging due to the complexity of room layouts and object geometries. We propose a sketch based knowledge enhanced diffusion architecture (SEK) for generating customized, diverse, and plausible 3D scenes.

Denoising Object +1

Towards Real-World Aerial Vision Guidance with Categorical 6D Pose Tracker

1 code implementation9 Jan 2024 Jingtao Sun, Yaonan Wang, Danwei Wang

In this paper, we investigate the real-world robot task of aerial vision guidance for aerial robotics manipulation, utilizing category-level 6-DoF pose tracking.

Pose Tracking

L4D-Track: Language-to-4D Modeling Towards 6-DoF Tracking and Shape Reconstruction in 3D Point Cloud Stream

no code implementations CVPR 2024 Jingtao Sun, Yaonan Wang, Mingtao Feng, Yulan Guo, Ajmal Mian, Mike Zheng Shou

To this end we present a generic Language-to-4D modeling paradigm termed L4D-Track that tackles zero-shot 6-DoF \underline Track ing and shape reconstruction by learning pairwise implicit 3D representation and multi-level multi-modal alignment.

3D Shape Reconstruction Pose Tracking

Sketch and Text Guided Diffusion Model for Colored Point Cloud Generation

no code implementations ICCV 2023 Zijie Wu, Yaonan Wang, Mingtao Feng, He Xie, Ajmal Mian

In this paper, we propose a sketch and text guided probabilistic diffusion model for colored point cloud generation that conditions the denoising process jointly with a hand drawn sketch of the object and its textual description.

Denoising Image Generation +1

OAFuser: Towards Omni-Aperture Fusion for Light Field Semantic Segmentation

2 code implementations28 Jul 2023 Fei Teng, Jiaming Zhang, Kunyu Peng, Yaonan Wang, Rainer Stiefelhagen, Kailun Yang

To simultaneously streamline the redundant information from the light field cameras and avoid feature loss during network propagation, we present a simple yet very effective Sub-Aperture Fusion Module (SAFM).

Autonomous Driving Scene Understanding +1

Towards Anytime Optical Flow Estimation with Event Cameras

1 code implementation11 Jul 2023 Yaozu Ye, Hao Shi, Kailun Yang, Ze Wang, Xiaoting Yin, Lei Sun, Yaonan Wang, Kaiwei Wang

Event cameras respond to changes in log-brightness at the millisecond level, making them ideal for optical flow estimation.

Autonomous Driving Motion Estimation +1

A Graph Reconstruction by Dynamic Signal Coefficient for Fault Classification

no code implementations30 May 2023 Wenbin He, Jianxu Mao, Yaonan Wang, Zhe Li, Qiu Fang, Haotian Wu

To improve the performance in identifying the faults under strong noise for rotating machinery, this paper presents a dynamic feature reconstruction signal graph method, which plays the key role of the proposed end-to-end fault diagnosis model.

Fault Diagnosis feature selection +1

A Signed Subgraph Encoding Approach via Linear Optimization for Link Sign Prediction

no code implementations17 May 2023 Zhihong Fang, Shaolin Tan, Yaonan Wang

In this paper, we propose a different link sign prediction architecture call SELO (Subgraph Encoding via Linear Optimization), which obtains overall leading prediction performances compared the state-of-the-art algorithm SDGNN.

Link Sign Prediction Prediction

Learning from Pixel-Level Noisy Label : A New Perspective for Light Field Saliency Detection

1 code implementation28 Apr 2022 Mingtao Feng, Kendong Liu, Liang Zhang, Hongshan Yu, Yaonan Wang, Ajmal Mian

Saliency detection with light field images is becoming attractive given the abundant cues available, however, this comes at the expense of large-scale pixel level annotated data which is expensive to generate.

Saliency Prediction

TransKD: Transformer Knowledge Distillation for Efficient Semantic Segmentation

2 code implementations27 Feb 2022 Ruiping Liu, Kailun Yang, Alina Roitberg, Jiaming Zhang, Kunyu Peng, Huayao Liu, Yaonan Wang, Rainer Stiefelhagen

Furthermore, we introduce two optimization modules to enhance the patch embedding distillation from different perspectives: (1) Global-Local Context Mixer (GL-Mixer) extracts both global and local information of a representative embedding; (2) Embedding Assistant (EA) acts as an embedding method to seamlessly bridge teacher and student models with the teacher's number of channels.

Autonomous Driving Knowledge Distillation +3

Learning From Pixel-Level Noisy Label: A New Perspective for Light Field Saliency Detection

1 code implementation CVPR 2022 Mingtao Feng, Kendong Liu, Liang Zhang, Hongshan Yu, Yaonan Wang, Ajmal Mian

Saliency detection with light field images is becoming attractive given the abundant cues available, however, this comes at the expense of large-scale pixel level annotated data which is expensive to generate.

Saliency Prediction

Projected Sliced Wasserstein Autoencoder-based Hyperspectral Images Anomaly Detection

no code implementations20 Dec 2021 Yurong Chen, HUI ZHANG, Yaonan Wang, Q. M. Jonathan Wu, Yimin Yang

In this case, the Wasserstein distance can be calculated with the closed-form, even the prior distribution is not Gaussian.

Anomaly Detection

Free-form Description Guided 3D Visual Graph Network for Object Grounding in Point Cloud

1 code implementation ICCV 2021 Mingtao Feng, Zhen Li, Qi Li, Liang Zhang, Xiangdong Zhang, Guangming Zhu, HUI ZHANG, Yaonan Wang, Ajmal Mian

There are three main challenges in 3D object grounding: to find the main focus in the complex and diverse description; to understand the point cloud scene; and to locate the target object.

Form Object

Minimum Potential Energy of Point Cloud for Robust Global Registration

no code implementations11 Jun 2020 Zijie Wu, Yaonan Wang, Qing Zhu, Jianxu Mao, Haotian Wu, Mingtao Feng, Ajmal Mian

Different from the most existing point set registration methods which usually extract the descriptors to find correspondences between point sets, our proposed MPE alignment method is able to handle large scale raw data offset without depending on traditional descriptors extraction, whether for the local or global registration methods.

Relation Graph Network for 3D Object Detection in Point Clouds

no code implementations30 Nov 2019 Mingtao Feng, Syed Zulqarnain Gilani, Yaonan Wang, Liang Zhang, Ajmal Mian

Convolutional Neural Networks (CNNs) have emerged as a powerful strategy for most object detection tasks on 2D images.

3D Object Detection Object +3

Exploiting Global Camera Network Constraints for Unsupervised Video Person Re-identification

no code implementations27 Aug 2019 Xueping Wang, Rameswar Panda, Min Liu, Yaonan Wang, Amit K. Roy-Chowdhury

Additionally, a cross-view matching strategy followed by global camera network constraints is proposed to explore the matching relationships across the entire camera network.

Graph Matching Metric Learning +2

Partial Labeled Gastric Tumor Segmentation via patch-based Reiterative Learning

no code implementations20 Dec 2017 Yang Nan, Gianmarc Coppola, Qiaokang Liang, Kunglin Zou, Wei Sun, Dan Zhang, Yaonan Wang, Guanzhen Yu

Gastric cancer is the second leading cause of cancer-related deaths worldwide, and the major hurdle in biomedical image analysis is the determination of the cancer extent.

Image Segmentation Tumor Segmentation

Pulling back error to the hidden-node parameter technology: Single-hidden-layer feedforward network without output weight

no code implementations6 May 2014 Yimin Yang, Q. M. Jonathan Wu, Guang-Bin Huang, Yaonan Wang

SLFNs are universal approximators when at least the parameters of the networks including hidden-node parameter and output weight are exist.

Cannot find the paper you are looking for? You can Submit a new open access paper.