no code implementations • 10 Mar 2025 • Chuanming Wang, Henming Mao, Huanhuan Zhang, Huiyuan Fu, Huadong Ma
To further enhance discriminative capability, we propose a cross relationship modeling pattern that combines visual feature with all class prompt features, enabling a deeper exploration of the relationships between these two modalities.
2 code implementations • 18 Feb 2025 • Mengshi Qi, Changsheng Lv, Huadong Ma
Furthermore, we introduce a counterfactual learning module to augment the model's reasoning ability by modeling physical knowledge relationships among different objects under counterfactual intervention.
no code implementations • 28 Jan 2025 • Pengfei Zhu, Peng Shu, Mengshi Qi, Liang Liu, Huadong Ma
This involves firstly training a fully observed model and then using a distillation process to create the final model.
no code implementations • 25 Jan 2025 • Mengshi Qi, Xiaoyang Bi, Pengfei Zhu, Huadong Ma
Robustly predicting attention regions of interest for self-driving systems is crucial for driving safety but presents significant challenges due to the labor-intensive nature of obtaining large-scale attention labels and the domain gap between self-driving scenarios and natural scenes.
no code implementations • 16 Jan 2025 • Wulian Yun, Mengshi Qi, Fei Peng, Huadong Ma
Secondly, we introduce a Multi-level Feature Learning strategy, which utilizes the outputs from different stages of the backbone to estimate the heatmap to guide network training, enriching the supervisory information while effectively capturing keypoint relationships.
no code implementations • 9 Jan 2025 • Jiaxuan Peng, Mengshi Qi, Dong Zhao, Huadong Ma
In this work, we introduce a novel balanced continual multi-modal learning method for 3D HPE, which harnesses the power of RGB, LiDAR, mmWave, and WiFi.
1 code implementation • 7 Jan 2025 • Mengshi Qi, Hao Ye, Jiaxuan Peng, Huadong Ma
Firstly, we introduce a multi-scale dynamic visual-skeleton encoder to capture fine-grained spatio-temporal visual and skeletal features.
1 code implementation • 20 Dec 2024 • Mengshi Qi, Yuxin Yang, Huadong Ma
Effective modeling of group interactions and dynamic semantic intentions is crucial for forecasting behaviors like trajectories or movements.
no code implementations • 28 Nov 2024 • Dacheng Liao, Mengshi Qi, Liang Liu, Huadong Ma
In current open real-world autonomous driving scenarios, challenges such as sensor failure and extreme weather conditions hinder the generalization of most autonomous driving perception models to these unseen domain due to the domain shifts between the test and training data.
no code implementations • 28 Nov 2024 • Changsheng Lv, Mengshi Qi, Liang Liu, Huadong Ma
Understanding the traffic scenes and then generating high-definition (HD) maps present significant challenges in autonomous driving.
1 code implementation • 16 Aug 2024 • Rui Wang, Mengshi Qi, Yingxia Shao, Anfu Zhou, Huadong Ma
To tackle this challenge, we introduce a novel physics-informed temporal network~(PITN) with adversarial contrastive learning to enable precise BP estimation with very limited data.
no code implementations • 29 Jul 2024 • Wulian Yun, Mengshi Qi, Fei Peng, Huadong Ma
Differing from the traditional teacher-student network, we propose a teacher-reference-student architecture to learn both unlabeled and labeled data, where the teacher network and the reference network are used to generate pseudo-labels for unlabeled data to supervise the student network.
1 code implementation • 19 Jul 2024 • Zhe Zhao, Mengshi Qi, Huadong Ma
Generating realistic human grasps is a crucial yet challenging task for applications involving object manipulation in computer graphics and robotics.
1 code implementation • 6 Jun 2024 • Shan Li, Lu Yang, Pu Cao, Liulei Li, Huadong Ma
The successful application of semantic segmentation technology in the real world has been among the most exciting achievements in the computer vision community over the past decade.
no code implementations • 25 Apr 2024 • Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, HaoNing Wu, Yixuan Gao, Yuqin Cao, ZiCheng Zhang, Xiele Wu, Radu Timofte, Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen, Huacong Zhang, Haiyi Xie, Chengwei Wang, Baoying Chen, Jishen Zeng, Jianquan Yang, Weigang Wang, Xi Fang, Xiaoxin Lv, Jun Yan, Tianwu Zhi, Yabin Zhang, Yaohui Li, Yang Li, Jingwen Xu, Jianzhao Liu, Yiting Liao, Junlin Li, Zihao Yu, Yiting Lu, Xin Li, Hossein Motamednia, S. Farhad Hosseini-Benvidi, Fengbin Guan, Ahmad Mahmoudi-Aznaveh, Azadeh Mansouri, Ganzorig Gankhuyag, Kihwan Yoon, Yifang Xu, Haotian Fan, Fangyuan Kong, Shiling Zhao, Weifeng Dong, Haibing Yin, Li Zhu, Zhiling Wang, Bingchen Huang, Avinab Saha, Sandeep Mishra, Shashank Gupta, Rajesh Sureddi, Oindrila Saha, Luigi Celona, Simone Bianco, Paolo Napoletano, Raimondo Schettini, Junfeng Yang, Jing Fu, Wei zhang, Wenzhi Cao, Limei Liu, Han Peng, Weijun Yuan, Zhan Li, Yihang Cheng, Yifan Deng, Haohui Li, Bowen Qu, Yao Li, Shuqing Luo, Shunzhou Wang, Wei Gao, Zihao Lu, Marcos V. Conde, Xinrui Wang, Zhibo Chen, Ruling Liao, Yan Ye, Qiulin Wang, Bing Li, Zhaokun Zhou, Miao Geng, Rui Chen, Xin Tao, Xiaoyu Liang, Shangkun Sun, Xingyuan Ma, Jiaze Li, Mengduo Yang, Haoran Xu, Jie zhou, Shiding Zhu, Bohan Yu, Pengfei Chen, Xinrui Xu, Jiabin Shen, Zhichao Duan, Erfan Asadi, Jiahe Liu, Qi Yan, Youran Qu, Xiaohui Zeng, Lele Wang, Renjie Liao
A total of 196 participants have registered in the video track.
no code implementations • 23 Apr 2024 • Kaikai Deng, Dong Zhao, Wenxin Zheng, Yue Ling, Kangwen Yin, Huadong Ma
Millimeter wave radar is gaining traction recently as a promising modality for enabling pervasive and privacy-preserving gesture recognition.
1 code implementation • 17 Apr 2024 • Xin Li, Kun Yuan, Yajing Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, HaoNing Wu, ZiCheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei LI, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Fangyuan Kong, Haotian Fan, Yifang Xu, Haoran Xu, Mengduo Yang, Jie zhou, Jiaze Li, Shijie Wen, Mai Xu, Da Li, Shunyu Yao, Jiazhi Du, WangMeng Zuo, Zhibo Li, Shuai He, Anlong Ming, Huiyuan Fu, Huadong Ma, Yong Wu, Fie Xue, Guozhi Zhao, Lina Du, Jie Guo, Yu Zhang, huimin zheng, JunHao Chen, Yue Liu, Dulan Zhou, Kele Xu, Qisheng Xu, Tao Sun, Zhixiang Ding, Yuhang Hu
This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality Assessment (S-UGC VQA), where various excellent solutions are submitted and evaluated on the collected dataset KVQ from popular short-form video platform, i. e., Kuaishou/Kwai Platform.
2 code implementations • 13 Mar 2024 • Yihao Liu, Feng Xue, Anlong Ming, Mingshuai Zhao, Huadong Ma, Nicu Sebe
Firstly, to obtain consistent depth across diverse scenes, we propose a novel metric scale modeling, i. e., variation-based unnormalized depth bins.
1 code implementation • 28 Feb 2024 • Jin Liu, Huiyuan Fu, Chuanming Wang, Huadong Ma
Exposure correction aims to enhance images suffering from improper exposure to achieve satisfactory visual effects.
1 code implementation • 27 Feb 2024 • Jin Liu, Bo wang, Chuanming Wang, Huiyuan Fu, Huadong Ma
Exposure correction aims to enhance visual data suffering from improper exposures, which can greatly improve satisfactory visual effects.
1 code implementation • 12 Jan 2024 • Huiyuan Fu, Kuilong Cui, Chuanming Wang, Mengshi Qi, Huadong Ma
With the rapid advancements in deep learning technologies, person re-identification (ReID) has witnessed remarkable performance improvements.
no code implementations • 5 Jan 2024 • Yuxin Yang, Pengfei Zhu, Mengshi Qi, Huadong Ma
To uncover latent motion patterns in human behavior, we introduce a novel memory-based method, named Motion Pattern Priors Memory Network.
1 code implementation • 5 Jan 2024 • Qi An, Mengshi Qi, Huadong Ma
In recent years, there has been growing interest in the video-based action quality assessment (AQA).
no code implementations • 4 Jan 2024 • Chuanming Wang, Yuxin Yang, Mengshi Qi, Huadong Ma
Object re-identification (ReID) is committed to searching for objects of the same identity across cameras, and its real-world deployment is gradually increasing.
1 code implementation • CVPR 2024 • Huiyuan Fu, Fei Peng, Xianwei Li, Yejun Li, Xin Wang, Huadong Ma
The extensive experiments demonstrate the superior performance of the arbitrary-scale SR models trained on the COZ dataset compared to models trained on simulated data.
1 code implementation • 1 Dec 2023 • Yaoyao Zhong, Mengshi Qi, Rui Wang, Yuhan Qiu, Yang Zhang, Huadong Ma
Video Internet of Things (VIoT) has shown full potential in collecting an unprecedented volume of video data.
1 code implementation • 22 Mar 2023 • Wulian Yun, Mengshi Qi, Chuanming Wang, Huadong Ma
Weakly-supervised temporal action localization aims to locate action regions and identify action categories in untrimmed videos simultaneously by taking only video-level labels as the supervision.
Pseudo Label
Weakly-supervised Temporal Action Localization
+1
1 code implementation • 20 Mar 2023 • Changsheng Lv, Mengshi Qi, Xia Li, Zhengyuan Yang, Huadong Ma
In this paper, we propose a novel model called SGFormer, Semantic Graph TransFormer for point cloud-based 3D scene graph generation.
no code implementations • ICCV 2023 • Pengfei Zhu, Mengshi Qi, Xia Li, Weijian Li, Huadong Ma
Predicting attention regions of interest is an important yet challenging task for self-driving systems.
1 code implementation • ICCV 2023 • Shuai He, Anlong Ming, Yaqi Li, Jinyuan Sun, Shuntian Zheng, Huadong Ma
We present a comprehensive study on a new task named image color aesthetics assessment (ICAA), which aims to assess color aesthetics based on human perception.
no code implementations • CVPR 2023 • Huiyuan Fu, Wenkai Zheng, Xiangyu Meng, Xin Wang, Chuanming Wang, Huadong Ma
The Retinex-based methods require decomposing the image into reflectance and illumination components, which is a highly ill-posed problem and there is no available ground truth.
1 code implementation • ICCV 2023 • Huiyuan Fu, Wenkai Zheng, Xicong Wang, Jiaxuan Wang, Heng Zhang, Huadong Ma
To address this issue, we design a camera system and collect a high-quality low-light video dataset with multiple exposures and cameras.
no code implementations • 30 Apr 2022 • Wulian Yun, Mengshi Qi, Chuanming Wang, Huiyuan Fu, Huadong Ma
Meanwhile, we design a Multi-Scale Residual Structure to preserve multiple aspects of information at different stages, which contains a Temporal Features Aggregation Module to summarize the dynamic representation.
no code implementations • 20 Feb 2022 • Lige Ding, Dong Zhao, Zhaofeng Wang, Guang Wang, Chang Tan, Lei Fan, Huadong Ma
The ever-increasing heavy traffic congestion potentially impedes the accessibility of emergency vehicles (EVs), resulting in detrimental impacts on critical services and even safety of people's lives.
1 code implementation • 18 Oct 2021 • Yizong Wang, Dong Zhao, Yajie Ren, Desheng Zhang, Huadong Ma
A direct idea is to leverage the urban transfer learning paradigm to learn the knowledge from a source city, then exploit it to predict charging demands, and meanwhile determine locations and amounts of slow/fast chargers for charging stations in the target city.
no code implementations • 18 Jun 2020 • Kun Liu, Huadong Ma, Chuang Gan
In this paper, we present Language Guided Networks (LGN), a new framework that leverages the sentence embedding to guide the whole process of moment retrieval.
no code implementations • 17 Jun 2020 • Kun Liu, Wu Liu, Huadong Ma, Mingkui Tan, Chuang Gan
Our method achieves clear improvements on UCF101 action recognition benchmark against state-of-the-art real-time methods by 5. 4% in terms of accuracy and 2 times faster in terms of inference speed with a less than 5MB storage model.
no code implementations • 22 Jul 2019 • Peiye Liu, Bo Wu, Huadong Ma, Mingoo Seok
Recent studies on automatic neural architectures search have demonstrated significant performance, competitive to or even better than hand-crafted neural architectures.
no code implementations • 10 Jan 2019 • Xinchen Liu, Wu Liu, Huadong Ma, Shuangqun Li
In this paper, a Progressive Vehicle Search System, named as PVSS, is designed to solve the above problems.
no code implementations • 10 Jan 2019 • Meng Zhang, Xinchen Liu, Wu Liu, Anfu Zhou, Huadong Ma, Tao Mei
To bridge the domain gap, we propose a Multi-Granularity Reasoning framework for social relation recognition from images.
Ranked #3 on
Visual Social Relationship Recognition
on PISC
no code implementations • 18 Oct 2018 • Peiye Liu, Wu Liu, Huadong Ma, Tao Mei, Mingoo Seok
To transfer the knowledge of intermediate representations, we set high-level teacher feature maps as a target, toward which the student feature maps are trained.
no code implementations • 20 Oct 2017 • Kun Liu, Wu Liu, Huadong Ma, Wenbing Huang, Xiongxiong Dong
Motivated by this, we study the task of action recognition in surveillance video under a more realistic \emph{generalized zero-shot setting}, where testing data contains both seen and unseen classes.