Search Results for author: Jianfei Yang

Found 70 papers, 22 papers with code

Mind the Discriminability: Asymmetric Adversarial Domain Adaptation

no code implementations ECCV 2020 Jianfei Yang, Han Zou, Yuxun Zhou, Zhaoyang Zeng, Lihua Xie ()

Adversarial domain adaptation has made tremendous success by learning domain-invariant feature representations.

Domain Adaptation

REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning?

no code implementations16 May 2025 Chenxi Jiang, Chuhao Zhou, Jianfei Yang

To this end, we propose the first robot task planning benchmark with vague REs (REI-Bench), where we discover that the vagueness of REs can severely degrade robot planning performance, leading to success rate drops of up to 77. 9%.

Generative Dataset Distillation using Min-Max Diffusion Model

no code implementations24 Mar 2025 Junqiao Fan, Yunjiao Zhou, Min Chang Jordan Ren, Jianfei Yang

In this work, we leverage the popular diffusion model as the generator to compute a surrogate dataset, boosted by a min-max loss to control the dataset's diversity and representativeness during training.

Dataset Distillation Diversity

RAW-Adapter: Adapting Pre-trained Visual Model to Camera RAW Images and A Benchmark

1 code implementation21 Mar 2025 Ziteng Cui, Jianfei Yang, Tatsuya Harada

Despite this, most existing visual perception methods that utilize RAW data directly integrate image signal processing (ISP) stages with subsequent network modules, often overlooking potential synergies at the model level.

Data Augmentation

Emergence of Painting Ability via Recognition-Driven Evolution

no code implementations9 Jan 2025 Yi Lin, Lin Gu, Ziteng Cui, Shenghan Su, Yumo Hao, Yingtao Tian, Tatsuya Harada, Jianfei Yang

The palette branch learns a limited colour palette, while the stroke branch parameterises each stroke using B\'ezier curves to render an image, subsequently evaluated by a high-level recognition module.

Image Compression

Unsupervised UAV 3D Trajectories Estimation with Sparse Point Clouds

1 code implementation17 Dec 2024 Hanfang Liang, Yizhuo Yang, Jinming Hu, Jianfei Yang, Fen Liu, Shenghai Yuan

Compact UAV systems, while advancing delivery and surveillance, pose significant security challenges due to their small size, which hinders detection by traditional methods.

NoisyEQA: Benchmarking Embodied Question Answering Against Noisy Queries

no code implementations14 Dec 2024 Tao Wu, Chuhao Zhou, Yen Heng Wong, Lin Gu, Jianfei Yang

Additionally, we also propose a 'Self-Correction' prompting mechanism and a new evaluation metric to enhance and measure both noise detection capability and answer quality.

Benchmarking Embodied Question Answering +2

Look a Group at Once: Multi-Slide Modeling for Survival Prediction

no code implementations18 Nov 2024 Xinyang Li, Yi Zhang, Yi Xie, Jianfei Yang, Xi Wang, Hao Chen, Haixian Zhang

In this paper, we introduce GroupMIL, a novel framework inspired by the clinical practice of collective analysis, which models multiple slides as a single sample and organizes groups of patches and slides sequentially to capture cross-slide prognostic features.

Survival Prediction

Enhancing Dataset Distillation via Label Inconsistency Elimination and Learning Pattern Refinement

1 code implementation17 Oct 2024 Chuhao Zhou, Chenxi Jiang, Yi Xie, Haozhi Cao, Jianfei Yang

In the final evaluation, our M-DATM achieved accuracies of 0. 4061 and 0. 1831 on the CIFAR-100 and Tiny ImageNet datasets, ranking 1st in the Fixed Images Per Class (IPC) Track.

Dataset Distillation

X-Fi: A Modality-Invariant Foundation Model for Multimodal Human Sensing

no code implementations14 Oct 2024 Xinyan Chen, Jianfei Yang

Human sensing, which employs various sensors and advanced deep learning technologies to accurately capture and interpret human body information, has significantly impacted fields like public security and robotics.

Human Activity Recognition Pose Estimation

Feedback Favors the Generalization of Neural ODEs

no code implementations14 Oct 2024 Jindou Jia, Zihan Yang, Meng Wang, Kexin Guo, Jianfei Yang, Xiang Yu, Lei Guo

Inspired by the feedback philosophy, we present feedback neural networks, showing that a feedback loop can flexibly correct the learned latent dynamics of neural ordinary differential equations (neural ODEs), leading to a prominent generalization improvement.

Model Predictive Control Philosophy +1

RecurFormer: Not All Transformer Heads Need Self-Attention

no code implementations10 Oct 2024 Ruiqing Yan, Linghan Zheng, Xingbo Du, Han Zou, Yufeng Guo, Jianfei Yang

Transformer-based large language models (LLMs) excel in modeling complex language patterns but face significant computational costs during inference, especially with long inputs due to the attention mechanism's memory overhead.

All Mamba

IoT-LLM: Enhancing Real-World IoT Task Reasoning with Large Language Models

no code implementations3 Oct 2024 Tuo An, Yunjiao Zhou, Han Zou, Jianfei Yang

Large Language Models (LLMs) have demonstrated remarkable capabilities across textual and visual domains but often generate outputs that violate physical laws, revealing a gap in their understanding of the physical world.

Benchmarking In-Context Learning

GERA: Geometric Embedding for Efficient Point Registration Analysis

no code implementations1 Oct 2024 Geng Li, Haozhi Cao, Mingyang Liu, Shenghai Yuan, Jianfei Yang

Point cloud registration aims to provide estimated transformations to align point clouds, which plays a crucial role in pose estimation of various navigation systems, such as surgical guidance systems and autonomous vehicles.

Autonomous Vehicles Point Cloud Registration +1

Editable Fairness: Fine-Grained Bias Mitigation in Language Models

no code implementations7 Aug 2024 Ruizhe Chen, Yichen Li, Jianfei Yang, Joey Tianyi Zhou, Zuozhu Liu

Then, we propose a novel debiasing approach, Fairness Stamp (FAST), which enables fine-grained calibration of individual social biases.

Fairness

More Than Positive and Negative: Communicating Fine Granularity in Medical Diagnosis

no code implementations5 Aug 2024 Xiangyu Peng, Kai Wang, Jianfei Yang, Yingying Zhu, Yang You

Specifically, we devise a division rule based on medical knowledge to divide positive cases into two subcategories, namely atypical positive and typical positive.

Medical Diagnosis

A Trustworthy AIoT-enabled Localization System via Federated Learning and Blockchain

no code implementations8 Jul 2024 Junfei Wang, He Huang, Jingze Feng, Steven Wong, Lihua Xie, Jianfei Yang

There is a significant demand for indoor localization technology in smart buildings, and the most promising solution in this field is using RF sensors and fingerprinting-based methods that employ machine learning models trained on crowd-sourced user data gathered from IoT devices.

Federated Learning Indoor Localization

Salient Sparse Visual Odometry With Pose-Only Supervision

no code implementations6 Apr 2024 Siyu Chen, Kangcheng Liu, Chen Wang, Shenghai Yuan, Jianfei Yang, Lihua Xie

Visual Odometry (VO) is vital for the navigation of autonomous systems, providing accurate position and orientation estimates at reasonable costs.

Optical Flow Estimation Visual Odometry

Diffusion Model is a Good Pose Estimator from 3D RF-Vision

no code implementations24 Mar 2024 Junqiao Fan, Jianfei Yang, Yuecong Xu, Lihua Xie

However, the mmWave radar has a limited resolution with severe noise, leading to inaccurate and inconsistent human pose estimation.

Pose Estimation

Compact 3D Gaussian Splatting For Dense Visual SLAM

no code implementations17 Mar 2024 Tianchen Deng, Yaohui Chen, Leyan Zhang, Jianfei Yang, Shenghai Yuan, Jiuming Liu, Danwei Wang, Hesheng Wang, Weidong Chen

Recent work has shown that 3D Gaussian-based SLAM enables high-quality reconstruction, accurate pose estimation, and real-time rendering of scenes.

Pose Estimation

Reliable Spatial-Temporal Voxels For Multi-Modal Test-Time Adaptation

1 code implementation11 Mar 2024 Haozhi Cao, Yuecong Xu, Jianfei Yang, Pengyu Yin, Xingyu Ji, Shenghai Yuan, Lihua Xie

Motivated by the fact that reliable predictions should be consistent with their spatial-temporal correspondences, Latte aggregates consecutive frames in a slide window manner and constructs Spatial-Temopral (ST) voxels to capture temporally local prediction consistency for each modality.

Test-time Adaptation

PowerSkel: A Device-Free Framework Using CSI Signal for Human Skeleton Estimation in Power Station

1 code implementation4 Mar 2024 Cunyi Yin, Xiren Miao, Jing Chen, Hao Jiang, Jianfei Yang, Yunjiao Zhou, Min Wu, Zhenghua Chen

WiFi-based human pose estimation is a suitable method for monitoring power operations due to its low cost, device-free, and robustness to various illumination conditions. In this paper, a novel Channel State Information (CSI)-based pose estimation framework, namely PowerSkel, is developed to address these challenges.

Knowledge Distillation Pose Estimation

MaskFi: Unsupervised Learning of WiFi and Vision Representations for Multimodal Human Activity Recognition

no code implementations29 Feb 2024 Jianfei Yang, Shijie Tang, Yuecong Xu, Yunjiao Zhou, Lihua Xie

Benefiting from our unsupervised learning procedure, the network requires only a small amount of annotated data for finetuning and can adapt to the new environment with better performance.

Human Activity Recognition Representation Learning

MMAUD: A Comprehensive Multi-Modal Anti-UAV Dataset for Modern Miniature Drone Threats

1 code implementation6 Feb 2024 Shenghai Yuan, Yizhuo Yang, Thien Hoang Nguyen, Thien-Minh Nguyen, Jianfei Yang, Fen Liu, Jianping Li, Han Wang, Lihua Xie

In response to the evolving challenges posed by small unmanned aerial vehicles (UAVs), which possess the potential to transport harmful payloads or independently cause damage, we introduce MMAUD: a comprehensive Multi-Modal Anti-UAV Dataset.

TENT: Connect Language Models with IoT Sensors for Zero-Shot Activity Recognition

no code implementations14 Nov 2023 Yunjiao Zhou, Jianfei Yang, Han Zou, Lihua Xie

Through the IoT-language contrastive learning, we derive a unified semantic feature space that aligns multi-modal features with language embeddings, so that the IoT data corresponds to specific words that describe the IoT data.

Contrastive Learning Human Activity Recognition

AdaPose: Towards Cross-Site Device-Free Human Pose Estimation with Commodity WiFi

no code implementations29 Sep 2023 Yunjiao Zhou, Jianfei Yang, He Huang, Lihua Xie

The results demonstrate the effectiveness and robustness of AdaPose in eliminating domain shift, thereby facilitating the widespread application of WiFi-based pose estimation in smart cities.

Domain Adaptation Pose Estimation

Fully-Connected Spatial-Temporal Graph for Multivariate Time-Series Data

1 code implementation11 Sep 2023 Yucheng Wang, Yuecong Xu, Jianfei Yang, Min Wu, XiaoLi Li, Lihua Xie, Zhenghua Chen

For graph construction, we design a decay graph to connect sensors across all timestamps based on their temporal distances, enabling us to fully model the ST dependencies by considering the correlations between DEDT.

graph construction Graph Neural Network +1

Graph-Aware Contrasting for Multivariate Time-Series Classification

1 code implementation11 Sep 2023 Yucheng Wang, Yuecong Xu, Jianfei Yang, Min Wu, XiaoLi Li, Lihua Xie, Zhenghua Chen

As MTS data typically originate from multiple sensors, ensuring spatial consistency becomes essential for the overall performance of contrastive learning on MTS data.

Classification Contrastive Learning +3

Can We Evaluate Domain Adaptation Models Without Target-Domain Labels?

no code implementations30 May 2023 Jianfei Yang, Hanjie Qian, Yuecong Xu, Kai Wang, Lihua Xie

Unsupervised domain adaptation (UDA) involves adapting a model trained on a label-rich source domain to an unlabeled target domain.

Unsupervised Domain Adaptation

Augmenting and Aligning Snippets for Few-Shot Video Domain Adaptation

no code implementations ICCV 2023 Yuecong Xu, Jianfei Yang, Yunjiao Zhou, Zhenghua Chen, Min Wu, XiaoLi Li

We thus consider a more realistic \textit{Few-Shot Video-based Domain Adaptation} (FSVDA) scenario where we adapt video models with only a few target video samples.

Action Recognition Unsupervised Domain Adaptation

Confidence Attention and Generalization Enhanced Distillation for Continuous Video Domain Adaptation

no code implementations18 Mar 2023 Xiyu Wang, Yuecong Xu, Jianfei Yang, Bihan Wen, Alex C. Kot

The second module compares the outputs of augmented data from the current model to the outputs of weakly augmented data from the source model, forming a novel consistency regularization on the model to alleviate the accumulation of prediction errors.

Autonomous Driving Self-Knowledge Distillation +1

Video Unsupervised Domain Adaptation with Deep Learning: A Comprehensive Survey

1 code implementation17 Nov 2022 Yuecong Xu, Haozhi Cao, Zhenghua Chen, XiaoLi Li, Lihua Xie, Jianfei Yang

Video analysis tasks such as action recognition have received increasing research interest with growing applications in fields such as smart healthcare, thanks to the introduction of large-scale datasets and deep learning-based representations.

Action Recognition Deep Learning +2

AirFi: Empowering WiFi-based Passive Human Gesture Recognition to Unseen Environment via Domain Generalization

no code implementations21 Sep 2022 Dazhuo Wang, Jianfei Yang, Wei Cui, Lihua Xie, Sumei Sun

The AirFi is a novel domain generalization framework that learns the critical part of CSI regardless of different environments and generalizes the model to unseen scenarios, which does not require collecting any data for adaptation to the new environment.

Domain Generalization Few-Shot Learning +1

GaitFi: Robust Device-Free Human Identification via WiFi and Vision Multimodal Learning

1 code implementation30 Aug 2022 Lang Deng, Jianfei Yang, Shenghai Yuan, Han Zou, Chris Xiaoxuan Lu, Lihua Xie

As an important biomarker for human identification, human gait can be collected at a distance by passive sensors without subject cooperation, which plays an essential role in crime prevention, security detection and other human identification applications.

Gait Recognition Retrieval +1

MetaFi: Device-Free Pose Estimation via Commodity WiFi for Metaverse Avatar Simulation

no code implementations22 Aug 2022 Jianfei Yang, Yunjiao Zhou, He Huang, Han Zou, Lihua Xie

Avatar refers to a representative of a physical user in the virtual world that can engage in different activities and interact with other objects in metaverse.

Pose Estimation

Leveraging Endo- and Exo-Temporal Regularization for Black-box Video Domain Adaptation

no code implementations10 Aug 2022 Yuecong Xu, Jianfei Yang, Haozhi Cao, Min Wu, XiaoLi Li, Lihua Xie, Zhenghua Chen

To enable video models to be applied seamlessly across video tasks in different environments, various Video Unsupervised Domain Adaptation (VUDA) methods have been proposed to improve the robustness and transferability of video models.

Action Recognition Unsupervised Domain Adaptation

Divide to Adapt: Mitigating Confirmation Bias for Domain Adaptation of Black-Box Predictors

1 code implementation28 May 2022 Jianfei Yang, Xiangyu Peng, Kai Wang, Zheng Zhu, Jiashi Feng, Lihua Xie, Yang You

Domain Adaptation of Black-box Predictors (DABP) aims to learn a model on an unlabeled target domain supervised by a black-box predictor trained on a source domain.

Domain Adaptation Knowledge Distillation

Reliable Label Correction is a Good Booster When Learning with Extremely Noisy Labels

1 code implementation30 Apr 2022 Kai Wang, Xiangyu Peng, Shuo Yang, Jianfei Yang, Zheng Zhu, Xinchao Wang, Yang You

This paradigm, however, is prone to significant degeneration under heavy label noise, as the number of clean samples is too small for conventional methods to behave well.

Learning with noisy labels

Calibrating Class Weights with Multi-Modal Information for Partial Video Domain Adaptation

no code implementations13 Apr 2022 Xiyu Wang, Yuecong Xu, Kezhi Mao, Jianfei Yang

It utilizes a novel class weight calibration method to alleviate the negative transfer caused by incorrect class weights.

Domain Adaptation Video Classification

AutoFi: Towards Automatic WiFi Human Sensing via Geometric Self-Supervised Learning

1 code implementation12 Apr 2022 Jianfei Yang, Xinyan Chen, Han Zou, Dazhuo Wang, Lihua Xie

The AutoFi transfers knowledge from randomly collected CSI samples into human gait recognition and achieves state-of-the-art performance.

Activity Recognition Domain Adaptation +4

EfficientFi: Towards Large-Scale Lightweight WiFi Sensing via CSI Compression

no code implementations8 Apr 2022 Jianfei Yang, Xinyan Chen, Han Zou, Dazhuo Wang, Qianwen Xu, Lihua Xie

WiFi technology has been applied to various places due to the increasing requirement of high-speed Internet access.

Cloud Computing Edge-computing +2

SecureSense: Defending Adversarial Attack for Secure Device-Free Human Activity Recognition

no code implementations4 Apr 2022 Jianfei Yang, Han Zou, Lihua Xie

The results validate that our method works well on wireless human activity recognition and person identification systems.

Adversarial Attack Human Activity Recognition +1

AI-enabled Automatic Multimodal Fusion of Cone-Beam CT and Intraoral Scans for Intelligent 3D Tooth-Bone Reconstruction and Clinical Applications

no code implementations11 Mar 2022 Jin Hao, Jiaxiang Liu, Jin Li, Wei Pan, Ruizhe Chen, Huimin Xiong, Kaiwei Sun, Hangzheng Lin, Wanlu Liu, Wanghui Ding, Jianfei Yang, Haoji Hu, Yueling Zhang, Yang Feng, Zeyu Zhao, Huikai Wu, Youyi Zheng, Bing Fang, Zuozhu Liu, Zhihe Zhao

Here, we present a Deep Dental Multimodal Analysis (DDMA) framework consisting of a CBCT segmentation model, an intraoral scan (IOS) segmentation model (the most accurate digital dental model), and a fusion model to generate 3D fused crown-root-bone structures with high fidelity and accurate occlusal and dentition information.

Segmentation

Source-free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition

1 code implementation9 Mar 2022 Yuecong Xu, Jianfei Yang, Haozhi Cao, Keyu Wu, Wu Min, Zhenghua Chen

Video-based Unsupervised Domain Adaptation (VUDA) methods improve the robustness of video models, enabling them to be applied to action recognition tasks across different environments.

Action Recognition Source-Free Domain Adaptation +1

Going Deeper into Recognizing Actions in Dark Environments: A Comprehensive Benchmark Study

no code implementations19 Feb 2022 Yuecong Xu, Jianfei Yang, Haozhi Cao, Jianxiong Yin, Zhenghua Chen, XiaoLi Li, Zhengguo Li, Qianwen Xu

While action recognition (AR) has gained large improvements with the introduction of large-scale video datasets and the development of deep neural networks, AR models robust to challenging environments in real-world scenarios are still under-explored.

Action Recognition Autonomous Driving

Shuffle Augmentation of Features from Unlabeled Data for Unsupervised Domain Adaptation

no code implementations28 Jan 2022 Changwei Xu, Jianfei Yang, Haoran Tang, Han Zou, Cheng Lu, Tianshuo Zhang

Unsupervised Domain Adaptation (UDA), a branch of transfer learning where labels for target samples are unavailable, has been widely researched and developed in recent years with the help of adversarially trained models.

Transfer Learning Unsupervised Domain Adaptation

Towards Realistic Visual Dubbing with Heterogeneous Sources

no code implementations17 Jan 2022 Tianyi Xie, Liucheng Liao, Cheng Bi, Benlai Tang, Xiang Yin, Jianfei Yang, Mingjie Wang, Jiali Yao, Yang Zhang, Zejun Ma

The task of few-shot visual dubbing focuses on synchronizing the lip movements with arbitrary speech input for any talking head video.

Disentanglement Talking Head Generation

Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation

no code implementations CVPR 2022 Minghui Hu, Yujie Wang, Tat-Jen Cham, Jianfei Yang, P. N. Suganthan

We show that with the help of a content-rich discrete visual codebook from VQ-VAE, the discrete diffusion model can also generate high fidelity images with global context, which compensates for the deficiency of the classical autoregressive model along pixel space.

Denoising Image Inpainting +1

Self-Supervised Video Representation Learning by Video Incoherence Detection

no code implementations26 Sep 2021 Haozhi Cao, Yuecong Xu, Jianfei Yang, Kezhi Mao, Lihua Xie, Jianxiong Yin, Simon See

This paper introduces a novel self-supervised method that leverages incoherence detection for video representation learning.

Action Recognition Contrastive Learning +3

Multi-Source Video Domain Adaptation with Temporal Attentive Moment Alignment

no code implementations21 Sep 2021 Yuecong Xu, Jianfei Yang, Haozhi Cao, Keyu Wu, Min Wu, Rui Zhao, Zhenghua Chen

Multi-Source Domain Adaptation (MSDA) is a more practical domain adaptation scenario in real-world scenarios.

Unsupervised Domain Adaptation

Partial Video Domain Adaptation with Partial Adversarial Temporal Attentive Network

no code implementations ICCV 2021 Yuecong Xu, Jianfei Yang, Haozhi Cao, Qi Li, Kezhi Mao, Zhenghua Chen

For videos, such negative transfer could be triggered by both spatial and temporal features, which leads to a more challenging Partial Video Domain Adaptation (PVDA) problem.

Partial Domain Adaptation

Suppressing Mislabeled Data via Grouping and Self-Attention

1 code implementation ECCV 2020 Xiaojiang Peng, Kai Wang, Zhaoyang Zeng, Qing Li, Jianfei Yang, Yu Qiao

Specifically, this plug-and-play AFM first leverages a \textit{group-to-attend} module to construct groups and assign attention weights for group-wise samples, and then uses a \textit{mixup} module with the attention weights to interpolate massive noisy-suppressed samples.

Image Classification

PNL: Efficient Long-Range Dependencies Extraction with Pyramid Non-Local Module for Action Recognition

no code implementations9 Jun 2020 Yuecong Xu, Haozhi Cao, Jianfei Yang, Kezhi Mao, Jianxiong Yin, Simon See

Empirical results prove the effectiveness and efficiency of our PNL module, which achieves state-of-the-art performance of 83. 09% on the Mini-Kinetics dataset, with decreased computation cost compared to the non-local block.

Action Recognition

ARID: A New Dataset for Recognizing Action in the Dark

1 code implementation6 Jun 2020 Yuecong Xu, Jianfei Yang, Haozhi Cao, Kezhi Mao, Jianxiong Yin, Simon See

We bridge the gap of the lack of data for this task by collecting a new dataset: the Action Recognition in the Dark (ARID) dataset.

Action Recognition

Towards Stable and Comprehensive Domain Alignment: Max-Margin Domain-Adversarial Training

no code implementations ICLR 2020 Jianfei Yang, Han Zou, Yuxun Zhou, Lihua Xie

The proposed MDAT stabilizes the gradient reversing in ARN by replacing the domain classifier with a reconstruction network, and in this manner ARN conducts both feature-level and pixel-level domain alignment without involving extra network structures.

Domain Adaptation Model Selection

Suppressing Uncertainties for Large-Scale Facial Expression Recognition

2 code implementations CVPR 2020 Kai Wang, Xiaojiang Peng, Jianfei Yang, Shijian Lu, Yu Qiao

Annotating a qualitative large-scale facial expression dataset is extremely difficult due to the uncertainties caused by ambiguous facial expressions, low-quality facial images, and the subjectiveness of annotators.

Facial Expression Recognition Facial Expression Recognition (FER)

Bootstrap Model Ensemble and Rank Loss for Engagement Intensity Regression

no code implementations8 Jul 2019 Kai Wang, Jianfei Yang, Da Guo, Kaipeng Zhang, Xiaojiang Peng, Yu Qiao

Based on our winner solution last year, we mainly explore head features and body features with a bootstrap strategy and two novel loss functions in this paper.

regression

Kervolutional Neural Networks

6 code implementations CVPR 2019 Chen Wang, Jianfei Yang, Lihua Xie, Junsong Yuan

Convolutional neural networks (CNNs) have enabled the state-of-the-art performance in many computer vision tasks.

Cannot find the paper you are looking for? You can Submit a new open access paper.