Search Results for author: Xiping Hu

Found 42 papers, 20 papers with code

Temporal Action Detection Model Compression by Progressive Block Drop

no code implementations21 Mar 2025 Xiaoyong Chen, Yong Guo, Jiaming Liang, Sitong Zhuang, Runhao Zeng, Xiping Hu

While existing channel pruning methods can compress these models, reducing the number of channels often hinders the parallelization efficiency of GPU, due to the inefficient multiplication between small matrices.

Action Detection Autonomous Driving +2

A Survey on Video Analytics in Cloud-Edge-Terminal Collaborative Systems

no code implementations10 Feb 2025 Linxiao Gong, Hao Yang, Gaoyun Fang, Bobo Ju, Juncen Guo, Xiaoguang Zhu, Xiping Hu, Yan Wang, Peng Sun, Azzedine Boukerche

The explosive growth of video data has driven the development of distributed video analytics in cloud-edge-terminal collaborative (CETC) systems, enabling efficient video processing, real-time inference, and privacy-preserving analysis.

Autonomous Driving Edge-computing +3

A Survey on Multimodal Recommender Systems: Recent Advances and Future Directions

1 code implementation22 Jan 2025 Jinfeng Xu, Zheyu Chen, Shuo Yang, Jinze Li, Wei Wang, Xiping Hu, Steven Hoi, Edith Ngai

The primary objective of this survey is to comprehensively review recent research advancements in MRS and to analyze the models from a technical perspective.

Recommendation Systems

AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling

no code implementations16 Jan 2025 Ancheng Xu, Di Yang, Renhao Li, Jingwei Zhu, Minghuan Tan, Min Yang, Wanxin Qiu, Mingchen Ma, Haihong Wu, Bingyu Li, Feng Sha, Chengming Li, Xiping Hu, Qiang Qu, Derek F. Wong, Ruifeng Xu

Traditional in-person psychological counseling remains primarily niche, often chosen by individuals with psychological issues, while online automated counseling offers a potential solution for those hesitant to seek help due to feelings of shame.

Facial Expression Analysis and Its Potentials in IoT Systems: A Contemporary Survey

no code implementations23 Dec 2024 Zixuan Shanggua, Yanjie Dong, Song Guo, Victor C. M. Leung, M. Jamal Deen, Xiping Hu

The integration of facial expression analysis with Internet-of-Thing (IoT) systems has significant potential across diverse scenarios.

Learning to Generate Gradients for Test-Time Adaptation via Test-Time Training Layers

1 code implementation22 Dec 2024 Qi Deng, Shuaicheng Niu, Ronghao Zhang, Yaofo Chen, Runhao Zeng, Jian Chen, Xiping Hu

Specifically, we aim for MGG to effectively utilize historical gradient information during the online optimization process to optimize the current model.

Memorization Test-time Adaptation

Video2Reward: Generating Reward Function from Videos for Legged Robot Behavior Learning

1 code implementation7 Dec 2024 Runhao Zeng, Dingjie Zhou, Qiwei Liang, Junlin Liu, Hui Li, Changxin Huang, Jianqiang Li, Xiping Hu, Fuchun Sun

In this paper, we introduce a new video2reward method, which directly generates reward functions from videos depicting the behaviors to be mimicked and learned.

Large Language Model

TGCA-PVT: Topic-Guided Context-Aware Pyramid Vision Transformer for Sticker Emotion Recognition

1 code implementation MM '24: Proceedings of the 32nd ACM International Conference on Multimedia 2024 Jian Chen, Wei Wang, Yuzhu Hu, Junxin Chen, Han Liu, Xiping Hu

Our approach encompasses a novel topic-guided context-aware module and a topic-guided attention mechanism, enabling the extraction of comprehensive topic context features from stickers sharing the same topic ID, significantly enhancing emotion recognition accuracy.

Emotion Recognition

MoChat: Joints-Grouped Spatio-Temporal Grounding LLM for Multi-Turn Motion Comprehension and Description

no code implementations15 Oct 2024 Jiawei Mo, Yixuan Chen, Rifen Lin, Yongkang Ni, Min Zeng, Xiping Hu, Min Li

Despite continuous advancements in deep learning for understanding human motion, existing models often struggle to accurately identify action timing and specific body parts, typically supporting only single-round interaction.

Language Modeling Language Modelling +2

Learning to Generalize Unseen Domains via Multi-Source Meta Learning for Text Classification

no code implementations20 Sep 2024 Yuxuan Hu, Chenwei Zhang, Min Yang, Xiaodan Liang, Chengming Li, Xiping Hu

In this paper, we study the multi-source Domain Generalization of text classification and propose a framework to use multiple seen domains to train a model that can achieve high accuracy in an unseen domain.

Domain Generalization Meta-Learning +2

Training on the Benchmark Is Not All You Need

1 code implementation3 Sep 2024 Shiwen Ni, Xiangtao Kong, Chengming Li, Xiping Hu, Ruifeng Xu, Jia Zhu, Min Yang

The success of Large Language Models (LLMs) relies heavily on the huge amount of pre-training data learned in the pre-training phase.

All Multiple-choice

A Survey on Facial Expression Recognition of Static and Dynamic Emotions

1 code implementation28 Aug 2024 Yan Wang, Shaoqi Yan, Yang Liu, Wei Song, Jing Liu, Yang Chang, Xinji Mai, Xiping Hu, Wenqiang Zhang, Zhongxue Gan

Facial expression recognition (FER) aims to analyze emotional states from static images and dynamic sequences, which is pivotal in enhancing anthropomorphic communication among humans, robots, and digital avatars by leveraging AI technologies.

cross-modal alignment Facial Expression Recognition +1

Fine-Tuning and Deploying Large Language Models Over Edges: Issues and Approaches

no code implementations20 Aug 2024 Yanjie Dong, Haijun Zhang, Chengming Li, Song Guo, Victor C. M. Leung, Xiping Hu

Additionally, large-scale foundation models have expanded to create images, audio, videos, and multi-modal contents, further emphasizing the need for efficient deployment.

Model Compression

Heavy-Ball Momentum Accelerated Actor-Critic With Function Approximation

no code implementations13 Aug 2024 Yanjie Dong, Haijun Zhang, Gang Wang, Shisheng Cui, Xiping Hu

In this work, we first propose a heavy-ball momentum based advantage actor-critic (\mbox{HB-A2C}) algorithm by integrating the heavy-ball momentum into the critic recursion that is parameterized by a linear function.

APTNESS: Incorporating Appraisal Theory and Emotion Support Strategies for Empathetic Response Generation

1 code implementation23 Jul 2024 Yuxuan Hu, Minghuan Tan, Chenwei Zhang, Zixuan Li, Xiaodan Liang, Min Yang, Chengming Li, Xiping Hu

By incorporating emotional support strategies, we aim to enrich the model's capabilities in both cognitive and affective empathy, leading to a more nuanced and comprehensive empathetic response.

Empathetic Response Generation Response Generation +2

Resource Allocation and Workload Scheduling for Large-Scale Distributed Deep Learning: A Survey

no code implementations12 Jun 2024 Feng Liang, Zhen Zhang, Haifeng Lu, Chengming Li, Victor C. M. Leung, Yanyi Guo, Xiping Hu

The large-scale environment with large volumes of datasets, models, and computational and communication resources raises various unique challenges for resource allocation and workload scheduling in distributed deep learning, such as scheduling complexity, resource and workload heterogeneity, and fault tolerance.

Deep Learning Scheduling +1

FourierKAN-GCF: Fourier Kolmogorov-Arnold Network -- An Effective and Efficient Feature Transformation for Graph Collaborative Filtering

1 code implementation3 Jun 2024 Jinfeng Xu, Zheyu Chen, Jinze Li, Shuo Yang, Wei Wang, Xiping Hu, Edith C. -H. Ngai

We revisit these two components and discover that a part of feature transformation and nonlinear operation during message passing in GCN can improve the representation of GCF, but increase the difficulty of training.

Collaborative Filtering

CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling

2 code implementations26 May 2024 Chenhao Zhang, Renhao Li, Minghuan Tan, Min Yang, Jingwei Zhu, Di Yang, Jiahao Zhao, Guancheng Ye, Chengming Li, Xiping Hu

To bridge the gap, we propose CPsyCoun, a report-based multi-turn dialogue reconstruction and evaluation framework for Chinese psychological counseling.

CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations

1 code implementation16 May 2024 Jiahao Zhao, Jingwei Zhu, Minghuan Tan, Min Yang, Renhao Li, Di Yang, Chenhao Zhang, Guancheng Ye, Chengming Li, Xiping Hu, Derek F. Wong

In this paper, we introduce a novel psychological benchmark, CPsyExam, constructed from questions sourced from Chinese language examinations.

4k

Communication-Efficient Large-Scale Distributed Deep Learning: A Comprehensive Survey

no code implementations9 Apr 2024 Feng Liang, Zhen Zhang, Haifeng Lu, Victor C. M. Leung, Yanyi Guo, Xiping Hu

Due to intensive synchronization of models and sharing of data across GPUs and computing nodes during distributed training and inference processes, communication efficiency becomes the bottleneck for achieving high performance at a large scale.

Data Compression Deep Learning +2

CLHA: A Simple yet Effective Contrastive Learning Framework for Human Alignment

1 code implementation25 Mar 2024 Feiteng Fang, Liang Zhu, Min Yang, Xi Feng, Jinchang Hou, Qixuan Zhao, Chengming Li, Xiping Hu, Ruifeng Xu

Reinforcement learning from human feedback (RLHF) is a crucial technique in aligning large language models (LLMs) with human preferences, ensuring these LLMs behave in beneficial and comprehensible ways to users.

Contrastive Learning reinforcement-learning +1

Layer-wise Regularized Dropout for Neural Language Models

no code implementations26 Feb 2024 Shiwen Ni, Min Yang, Ruifeng Xu, Chengming Li, Xiping Hu

To solve the inconsistency between training and inference caused by the randomness of dropout, some studies use consistency training to regularize dropout at the output layer.

Abstractive Text Summarization Machine Translation +1

MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property

1 code implementation26 Feb 2024 Shiwen Ni, Minghuan Tan, Yuelin Bai, Fuqiang Niu, Min Yang, BoWen Zhang, Ruifeng Xu, Xiaojun Chen, Chengming Li, Xiping Hu, Ye Li, Jianping Fan

In this paper, we contribute a new benchmark, the first Multilingual-oriented quiZ on Intellectual Property (MoZIP), for the evaluation of LLMs in the IP domain.

Language Modeling Language Modelling +3

E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models

1 code implementation29 Jan 2024 Jinchang Hou, Chang Ao, Haihong Wu, Xiangtao Kong, Zhigang Zheng, Daijia Tang, Chengming Li, Xiping Hu, Ruifeng Xu, Shiwen Ni, Min Yang

The integration of LLMs and education is getting closer and closer, however, there is currently no benchmark for evaluating LLMs that focuses on the Chinese K-12 education domain.

Ethics Multiple-choice

Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models

no code implementations14 Nov 2023 Shiwen Ni, Dingwei Chen, Chengming Li, Xiping Hu, Ruifeng Xu, Min Yang

In this paper, we propose a new paradigm for fine-tuning called F-Learning (Forgetting before Learning), which employs parametric arithmetic to facilitate the forgetting of old knowledge and learning of new knowledge.

Expression Syntax Information Bottleneck for Math Word Problems

1 code implementation24 Oct 2023 Jing Xiong, Chengming Li, Min Yang, Xiping Hu, Bin Hu

To this end, we design an Expression Syntax Information Bottleneck method for MWP (called ESIB) based on variational information bottleneck, which extracts essential features of expression syntax tree while filtering latent-specific redundancy containing syntax-irrelevant features.

Math

Accelerating Wireless Federated Learning via Nesterov's Momentum and Distributed Principle Component Analysis

no code implementations31 Mar 2023 Yanjie Dong, Luya Wang, Yuanfang Chi, Jia Wang, Haijun Zhang, Fei Richard Yu, Victor C. M. Leung, Xiping Hu

A wireless federated learning system is investigated by allowing a server and workers to exchange uncoded information via orthogonal wireless channels.

Federated Learning

Self-consistent Reasoning For Solving Math Word Problems

no code implementations27 Oct 2022 Jing Xiong, Zhongwei Wan, Xiping Hu, Min Yang, Chengming Li

Specifically, we firstly obtain a sub-network by pruning a roberta2tree model, for the sake to use the gap on output distribution between the original roberta2tree model and the pruned sub-network to expose spurious correlative samples.

Math

SM-SGE: A Self-Supervised Multi-Scale Skeleton Graph Encoding Framework for Person Re-Identification

1 code implementation5 Jul 2021 Haocong Rao, Xiping Hu, Jun Cheng, Bin Hu

In this paper, we for the first time propose a Self-supervised Multi-scale Skeleton Graph Encoding (SM-SGE) framework that comprehensively models human body, component relations, and skeleton dynamics from unlabeled skeleton graphs of various scales to learn an effective skeleton representation for person Re-ID.

Person Re-Identification Relation Network

More than Encoder: Introducing Transformer Decoder to Upsample

no code implementations20 Jun 2021 Yijiang Li, Wentian Cai, Ying Gao, Chengming Li, Xiping Hu

The local and detailed feature from the shallower layer such as boundary and tissue texture is particularly more important in medical segmentation compared with natural image segmentation.

Decoder Image Segmentation +4

Multi-Level Graph Encoding with Structural-Collaborative Relation Learning for Skeleton-Based Person Re-Identification

1 code implementation6 Jun 2021 Haocong Rao, Shihao Xu, Xiping Hu, Jun Cheng, Bin Hu

To fully explore body relations, we construct graphs to model human skeletons from different levels, and for the first time propose a Multi-level Graph encoding approach with Structural-Collaborative Relation learning (MG-SCR) to encode discriminative graph features for person Re-ID.

Person Re-Identification Relation

Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton Based Action Recognition

1 code implementation14 Nov 2020 Shihao Xu, Haocong Rao, Xiping Hu, Bin Hu

Existing approaches usually learn action representations by sequential prediction but they suffer from the inability to fully learn semantic information.

Clustering Prediction +6

A Self-Supervised Gait Encoding Approach with Locality-Awareness for 3D Skeleton Based Person Re-Identification

1 code implementation5 Sep 2020 Haocong Rao, Siqi Wang, Xiping Hu, Mingkui Tan, Yi Guo, Jun Cheng, Xinwang Liu, Bin Hu

This paper proposes a self-supervised gait encoding approach that can leverage unlabeled skeleton data to learn gait representations for person Re-ID.

Contrastive Learning Person Re-Identification +2

Self-Supervised Gait Encoding with Locality-Aware Attention for Person Re-Identification

1 code implementation21 Aug 2020 Haocong Rao, Siqi Wang, Xiping Hu, Mingkui Tan, Huang Da, Jun Cheng, Bin Hu

Unlike previous methods, we for the first time propose a generic gait encoding approach that can utilize unlabeled skeleton data to learn gait representations in a self-supervised manner.

Person Re-Identification

Augmented Skeleton Based Contrastive Action Learning with Momentum LSTM for Unsupervised Action Recognition

2 code implementations1 Aug 2020 Haocong Rao, Shihao Xu, Xiping Hu, Jun Cheng, Bin Hu

In this paper, we for the first time propose a contrastive action learning paradigm named AS-CAL that can leverage different augmentations of unlabeled skeleton data to learn action representations in an unsupervised manner.

Contrastive Learning Self-Supervised Human Action Recognition +1

Emotion Recognition From Gait Analyses: Current Research and Future Directions

no code implementations13 Mar 2020 Shihao Xu, Jing Fang, Xiping Hu, Edith Ngai, Wei Wang, Yi Guo, Victor C. M. Leung

This article reviews current research on gait-based emotion detection, particularly on how gait parameters can be affected by different emotion states and how the emotion states can be recognized through distinct gait patterns.

Emotion Recognition

MODMA dataset: a Multi-modal Open Dataset for Mental-disorder Analysis

no code implementations20 Feb 2020 Hanshu Cai, Yiwen Gao, Shuting Sun, Na Li, Fuze Tian, Han Xiao, Jianxiu Li, Zhengwu Yang, Xiaowei Li, Qinglin Zhao, Zhenyu Liu, Zhijun Yao, Minqiang Yang, Hong Peng, Jing Zhu, Xiaowei Zhang, Guoping Gao, Fang Zheng, Rui Li, Zhihua Guo, Rong Ma, Jing Yang, Lan Zhang, Xiping Hu, Yumin Li, Bin Hu

The EEG dataset includes not only data collected using traditional 128-electrodes mounted elastic cap, but also a novel wearable 3-electrode EEG collector for pervasive applications.

EEG

Towards Interpreting Deep Neural Networks via Understanding Layer Behaviors

no code implementations25 Sep 2019 JieZhang Cao, Jincheng Li, Xiping Hu, Peilin Zhao, Mingkui Tan

ii) the $W$-distance of a specific layer to the target distribution tends to decrease along training iterations.

Anchor-based Nearest Class Mean Loss for Convolutional Neural Networks

no code implementations22 Apr 2018 Fusheng Hao, Jun Cheng, Lei Wang, Xinchao Wang, Jianzhong Cao, Xiping Hu, Dapeng Tao

Discriminative features are obtained by constraining the deep CNNs to map training samples to the corresponding anchors as close as possible.

Image Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.