no code implementations • 12 Jun 2025 • Yuhang Zhang, Haosheng Yu, Jiaping Xiao, Mir Feroskhan
Moreover, real-world VLN tasks in indoor and outdoor environments under direct and indirect instructions demonstrate that VLFly achieves robust open-vocabulary goal understanding and generalized navigation capabilities, even in the presence of abstract language input.
1 code implementation • 27 May 2025 • Jiaping Xiao, Cheng Wen Tsao, Yuhang Zhang, Mir Feroskhan
Path planning is a critical component in autonomous drone operations, enabling safe and efficient navigation through complex environments.
no code implementations • 11 May 2025 • Binbin Wei, Yuhang Zhang, Shishun Tian, Muxin Liao, Wei Li, Wenbin Zou
Hence, we propose a novel framework, namely Depth-Sensitive Soft Suppression with RGB-D inter-modal stylization flow (DSSS), focusing on learning domain-invariant features from depth maps for the DG semantic segmentation.
no code implementations • 14 Mar 2025 • Yixiao Sun, Haitian Xie, Yuhang Zhang
Our integrated approach provides a doubly robust identification strategy for causal effects in panel data with a group structure, identifying the average treatment effect on the treated (ATT) under either the parallel trends assumption or the group-level SC assumption.
1 code implementation • 2 Mar 2025 • Yuhang Zhang, Zhiyao Zhang, Junyi Ji, Marcos Quiñones-Grueiro, William Barbour, Derek Gloudemans, Gergely Zachár, Clay Weston, Gautam Biswas, Daniel B. Work
We evaluate the performance of the MARL-based algorithm in comparison to a previously deployed non-RL VSL benchmark algorithm on I-24.
no code implementations • 1 Feb 2025 • Jie Ren, Yuhang Zhang, Dongrui Liu, Xiaopeng Zhang, Qi Tian
Direct preference optimization (DPO) has shown success in aligning diffusion models with human preference.
no code implementations • 13 Jan 2025 • Yuhang Zhang, Joshua Maraval, Zhengyu Zhang, Nicolas Ramin, Shishun Tian, Lu Zhang
To address these challenges, we conducted two subjective experiments for the quality assessment of NVS technologies containing both GS-based and NeRF-based methods, focusing on dynamic and real-world scenes.
no code implementations • 2 Jan 2025 • Lixiong Qin, Shilong Ou, Miaoxuan Zhang, Jiangning Wei, Yuhang Zhang, Xiaoshuai Song, Yuchen Liu, Mei Wang, Weiran Xu
Faces and humans are crucial elements in social interaction and are widely included in everyday photos and videos.
no code implementations • 29 Nov 2024 • Yuhang Zhang, Yuan Zhou, Zeyu Liu, Yuxuan Cai, Qiuyue Wang, Aidong Men, Huan Yang
Current methods for generating human motion videos rely on extracting pose sequences from reference videos, which restricts flexibility and control.
no code implementations • 6 Nov 2024 • Tao Liu, Wu Yang, Chen Xu, Jiguang Lv, Huanran Wang, Yuhang Zhang, Shuchun Xu, Dapeng Man
We propose a more practical threat model for federated learning: the distributed multi-target backdoor.
1 code implementation • 1 Sep 2024 • Xiuqi Zheng, Yuhang Zhang, Haoran Zhang, Hongrui Liang, Xueqi Bao, Zhuqing Jiang, Qicheng Lao
Adapting large pre-trained foundation models, e. g., SAM, for medical image segmentation remains a significant challenge.
1 code implementation • 20 Aug 2024 • Yuhang Zhang, Xiuqi Zheng, Chenyi Liang, Jiani Hu, Weihong Deng
To preserve the generalization ability of CLIP and the high precision of the FER model, we design a novel approach that learns sigmoid masks based on the fixed CLIP face features to extract expression features.
no code implementations • 20 Jul 2024 • Zhiyao Zhang, George Gunter, Marcos Quinones-Grueiro, Yuhang Zhang, William Barbour, Gautam Biswas, Daniel Work
If necessary, a temporary phase re-service is inserted before the next regular phase.
no code implementations • 17 Jul 2024 • Huiguo He, Huan Yang, Zixi Tuo, Yuan Zhou, Qiuyue Wang, Yuhang Zhang, Zeyu Liu, Wenhao Huang, Hongyang Chao, Jian Yin
DreamStory consists of (1) an LLM acting as a story director and (2) an innovative Multi-Subject consistent Diffusion model (MSD) for generating consistent multi-subject across the images.
1 code implementation • 21 Jun 2024 • Austin Coursey, Junyi Ji, Marcos Quinones-Grueiro, William Barbour, Yuhang Zhang, Tyler Derr, Gautam Biswas, Daniel B. Work
In this paper, we introduce the first large-scale lane-level freeway traffic dataset for anomaly detection.
2 code implementations • 9 May 2024 • Peng Gao, Le Zhuo, Dongyang Liu, Ruoyi Du, Xu Luo, Longtian Qiu, Yuhang Zhang, Chen Lin, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xi, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, He Tong, Jingwen He, Yu Qiao, Hongsheng Li
Sora unveils the potential of scaling Diffusion Transformer for generating photorealistic images and videos at arbitrary resolutions, aspect ratios, and durations, yet it still lacks sufficient implementation details.
1 code implementation • 26 Apr 2024 • Tao Liu, Yuhang Zhang, Zhu Feng, Zhiqin Yang, Chen Xu, Dapeng Man, Wu Yang
Trained backdoored global model is more resilient to benign updates, leading to a higher attack success rate on the test set.
1 code implementation • 14 Mar 2024 • Lixiong Qin, Mei Wang, Xuannan Liu, Yuhang Zhang, Wei Deng, Xiaoshuai Song, Weiran Xu, Weihong Deng
This design enhances the unification of model structure while improving application efficiency in terms of storage overhead.
no code implementations • 23 Jan 2024 • Yuhang Zhang, Yue Yao, Xuannan Liu, Lixiong Qin, Wenjing Wang, Weihong Deng
Facial expression recognition (FER) models are typically trained on datasets with a fixed number of seven basic classes.
Facial Expression Recognition
Facial Expression Recognition (FER)
+1
1 code implementation • 12 Jan 2024 • Bowen Shi, Peisen Zhao, Zichen Wang, Yuhang Zhang, Yaoming Wang, Jin Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian, Xiaopeng Zhang
Vision-language foundation models, represented by Contrastive Language-Image Pre-training (CLIP), have gained increasing attention for jointly understanding both vision and textual tasks.
Ranked #1 on
Open Vocabulary Panoptic Segmentation
on ADE20K
Open Vocabulary Panoptic Segmentation
Open Vocabulary Semantic Segmentation
+2
no code implementations • 28 Dec 2023 • Yuhang Zhang, Yuang Deng, Xiaopeng Zhang, Jie Li, Robert C. Qiu, Qi Tian
In DeLR, the query is based on region-level, and we only annotate the object region that is queried; 2) Instead of directly providing both localization and recognition annotations, we separately query the two components, and thus reduce the recognition budget with the pseudo class labels provided by the model.
no code implementations • 22 Dec 2023 • Xuannan Liu, Yaoyao Zhong, Xing Cui, Yuhang Zhang, Peipei Li, Weihong Deng
This strategy initially focuses on adapting the masks to the unique individual faces via image-specific training and then enhances their feature-level generalization ability to diverse facial variations of individuals via person-specific training.
no code implementations • 8 Dec 2023 • Jiaping Xiao, Rangya Zhang, Yuhang Zhang, Mir Feroskhan
Drones as advanced cyber-physical systems are undergoing a transformative shift with the advent of vision-based learning, a field that is rapidly gaining prominence due to its profound impact on drone autonomy and functionality.
no code implementations • 18 Oct 2023 • Yuhang Zhang, Marcos Quinones-Grueiro, Zhiyao Zhang, Yanbing Wang, William Barbour, Gautam Biswas, Daniel Work
Variable Speed Limit (VSL) control acts as a promising highway traffic management strategy with worldwide deployment, which can enhance traffic safety by dynamically adjusting speed limits according to real-time traffic conditions.
no code implementations • 12 Oct 2023 • Zhao Ning Zou, Yuhang Zhang, Robert Wijaya
We studied this issue by measuring the performance of DETR with different experiments and benchmarking the network with convolutional neural network (CNN) based detectors like YOLO and Faster-RCNN.
no code implementations • 28 Sep 2023 • Yuhang Zhang, Yue Liu, Zhihua Zhang
Motivated by the synthetic control method, we construct a synthetic treatment group for the target population by a weighted mixture of treatment groups of source populations.
1 code implementation • 27 Sep 2023 • Wenjie Li, Mei Wang, Kai Zhang, Juncheng Li, Xiaoming Li, Yuhang Zhang, Guangwei Gao, Weihong Deng, Chia-Wen Lin
We also discuss notable benchmarks commonly utilized in the field.
1 code implementation • 25 Sep 2023 • Muxin Liao, Shishun Tian, Yuhang Zhang, Guoguang Hua, Wenbin Zou, Xia Li
Based on these observations, a calibration-based dual prototypical contrastive learning (CDPCL) approach is proposed to reduce the domain discrepancy between the learned class-wise features and the prototypes of different domains for domain generalization semantic segmentation.
1 code implementation • journal 2023 • Saining Zhang, Yuhang Zhang, Ye Zhang, YuFei Wang, Zhigang Song
In recent years, facial expression recognition (FER) has garnered significant attention within the realm of computer vision research.
Ranked #2 on
Facial Expression Recognition (FER)
on AffectNet
Facial Expression Recognition
Facial Expression Recognition (FER)
+1
1 code implementation • ICCV 2023 • Xuannan Liu, Yaoyao Zhong, Yuhang Zhang, Lixiong Qin, Weihong Deng
Deep neural networks are vulnerable to universal adversarial perturbation (UAP), an instance-agnostic perturbation capable of fooling the target model for most samples.
no code implementations • 17 Jun 2023 • Suyash C. Vishnoi, Junyi Ji, MirSaleh Bahavarnia, Yuhang Zhang, Ahmad F. Taha, Christian G. Claudel, Daniel B. Work
The effectiveness of the proposed traffic control algorithms is tested using a traffic control example and compared with existing proportional-integral (PI)- and model predictive control (MPC)- based controllers from the literature.
1 code implementation • 23 Apr 2023 • Yue Hu, Yuhang Zhang, Yanbing Wang, Daniel Work
In this work, we consider the problem of detecting a variety of socially abnormal driving behaviors, i. e., behaviors that do not conform to the behavior of other nearby drivers.
no code implementations • 16 Feb 2023 • Yuhang Zhang, Weihong Deng, Liang Zheng
We further provide interesting analyses of the effects of backbones and IND/OOD datasets on OOD detection performance.
Out-of-Distribution Detection
Out of Distribution (OOD) Detection
no code implementations • 29 Dec 2022 • Yuhang Zhang, Shishun Tian, Muxin Liao, Zhengyu Zhang, Wenbin Zou, Chen Xu
In this paper, we propose a class-wise non-salient region generalized (CNSG) framework for the VGSS task.
1 code implementation • 2 Dec 2022 • Yuhang Zhang, Weihong Deng, Xingchen Cui, Yunfeng Yin, Hongzhi Shi, Dongchao Wen
We introduce mean point ensemble to utilize a more robust loss function and more information from unselected samples to reduce error accumulation from the model perspective.
1 code implementation • 21 Jul 2022 • Yuhang Zhang, Chengrui Wang, Xu Ling, Weihong Deng
We find that FER models remember noisy samples by focusing on a part of the features that can be considered related to the noisy labels instead of learning from the whole features that lead to the latent truth.
no code implementations • 29 Jun 2022 • Yuhang Zhang, Yulian Jiang, Shenquan Wang
In this article, the observer-based coordinated tracking control problem for a class of nonlinear multi-agent systems(MASs) with intermittent communication and information constraints is studied under dynamic switching topology.
no code implementations • 22 Jan 2022 • Siyan Li, Yue Xiao, Yuhang Zhang, Lei Chu, Robert C. Qiu
It is a challenging problem to detect and recognize targets on complex large-scene Synthetic Aperture Radar (SAR) images.
no code implementations • CVPR 2022 • Yuhang Zhang, Xiaopeng Zhang, Lingxi Xie, Jie Li, Robert C. Qiu, Hengtong Hu, Qi Tian
The Yes query is treated as positive pairs of the queried category for contrastive pulling, while the No query is treated as hard negative pairs for contrastive repelling.
1 code implementation • NeurIPS 2021 • Yuhang Zhang, Chengrui Wang, Weihong Deng
To quantify these uncertainties and achieve good performance under noisy data, we regard uncertainty as a relative concept and propose an innovative uncertainty learning method called Relative Uncertainty Learning (RUL).
Ranked #19 on
Facial Expression Recognition (FER)
on RAF-DB
Facial Expression Recognition
Facial Expression Recognition (FER)
no code implementations • 16 May 2021 • Yuhang Zhang, Xiaopeng Zhang, Robert. C. Qiu, Jie Li, Haohang Xu, Qi Tian
Semi-supervised learning acts as an effective way to leverage massive unlabeled data.
no code implementations • 16 Feb 2021 • Yuhang Zhang, Yao Mu, Yujie Yang, Yang Guan, Shengbo Eben Li, Qi Sun, Jianyu Chen
Reinforcement learning has shown great potential in developing high-level autonomous driving.
1 code implementation • IEEE Biomedical Circuits and Systems (BIOCAS) 2019 • Yi Ma, Xinzi Xu, Qing Yu, Yuhang Zhang, Yongfu Li, Jian Zhao and Guoxing Wang
Improving access to health care services for the medically under-served population is vital to ensure that critical illness can be addressed immediately.
Ranked #23 on
Audio Classification
on ICBHI Respiratory Sound Database
no code implementations • 25 Sep 2019 • Yuhang Zhang, Zhenwei Miao, Tiebin Mi, Robert Caiming Qiu
Three-dimensional data, such as point clouds, are often composed of three coordinates with few featrues.
no code implementations • 16 Jan 2018 • Zenan Ling, Robert C. Qiu, Zhijian Jin, Yuhang Zhang, Xing He, Haichun Liu, Lei Chu
The location of broken insulators in aerial images is a challenging task.