Search Results for author: Mengyuan Liu

Found 48 papers, 31 papers with code

SFMViT: SlowFast Meet ViT in Chaotic World

1 code implementation25 Apr 2024 Jiaying Lin, Jiajun Wen, Mengyuan Liu, Jinfu Liu, Baiqiao Yin, Yue Li

The task of spatiotemporal action localization in chaotic scenes is a challenging task toward advanced video understanding.

HDBN: A Novel Hybrid Dual-branch Network for Robust Skeleton-based Action Recognition

1 code implementation24 Apr 2024 Jinfu Liu, Baiqiao Yin, Jiaying Lin, Jiajun Wen, Yue Li, Mengyuan Liu

Skeleton-based action recognition has gained considerable traction thanks to its utilization of succinct and robust skeletal representations.

MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Human Motions

1 code implementation21 Apr 2024 Sheng Yan, Mengyuan Liu, Yong Wang, Yang Liu, Chen Chen, Hong Liu

In this paper, we address the unexplored question of temporal sentence localization in human motions (TSLM), aiming to locate a target moment from a 3D human motion that semantically corresponds to a text query.

Moment Retrieval Sentence

Point-In-Context: Understanding Point Cloud via In-Context Learning

1 code implementation18 Apr 2024 Mengyuan Liu, Zhongbin Fang, Xia Li, Joachim M. Buhmann, Xiangtai Li, Chen Change Loy

With the emergence of large-scale models trained on diverse datasets, in-context learning has emerged as a promising paradigm for multitasking, notably in natural language processing and image processing.

In-Context Learning

VG4D: Vision-Language Model Goes 4D Video Recognition

1 code implementation17 Apr 2024 Zhichao Deng, Xiangtai Li, Xia Li, Yunhai Tong, Shen Zhao, Mengyuan Liu

By transferring the knowledge of the VLM to the 4D encoder and combining the VLM, our VG4D achieves improved recognition performance.

Action Recognition Autonomous Driving +2

Identity-aware Dual-constraint Network for Cloth-Changing Person Re-identification

no code implementations13 Mar 2024 Peini Guo, Mengyuan Liu, Hong Liu, Ruijia Fan, Guoquan Wang, Bin He

In addition, a Multi-scale Constraint Block (MCB) is designed, which extracts fine-grained identity-related features and effectively transfers cloth-irrelevant knowledge.

Cloth-Changing Person Re-Identification counterfactual

Uncertainty-Aware Testing-Time Optimization for 3D Human Pose Estimation

no code implementations4 Feb 2024 Ti Wang, Mengyuan Liu, Hong Liu, Bin Ren, Yingxuan You, Wenhao Li, Nicu Sebe, Xia Li

We observe that previous optimization-based methods commonly rely on projection constraint, which only ensures alignment in 2D space, potentially leading to the overfitting problem.

3D Human Pose Estimation

Key-Graph Transformer for Image Restoration

no code implementations4 Feb 2024 Bin Ren, Yawei Li, Jingyun Liang, Rakesh Ranjan, Mengyuan Liu, Rita Cucchiara, Luc van Gool, Nicu Sebe

While it is crucial to capture global information for effective image restoration (IR), integrating such cues into transformer-based methods becomes computationally expensive, especially with high input resolution.

Graph Attention Image Restoration

Learning Mutual Excitation for Hand-to-Hand and Human-to-Human Interaction Recognition

no code implementations4 Feb 2024 Mengyuan Liu, Chen Chen, Songtao Wu, Fanyang Meng, Hong Liu

Recognizing interactive actions, including hand-to-hand interaction and human-to-human interaction, has attracted increasing attention for various applications in the field of video analysis and human-robot interaction.

Action Recognition Human Interaction Recognition

Eye Motion Matters for 3D Face Reconstruction

1 code implementation18 Jan 2024 Xuan Wang, Mengyuan Liu

Recent advances in single-image 3D face reconstruction have shown remarkable progress in various applications.

3D Face Reconstruction

ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point Cloud Classification

1 code implementation16 Jan 2024 Zhongbin Fang, Xia Li, Xiangtai Li, Shen Zhao, Mengyuan Liu

Through extensive experiments, we demonstrate that our PointMLS achieves state-of-the-art results on ModelNet-O and competitive results on regular datasets, and it is robust and effective.

3D Point Cloud Classification Point Cloud Classification

Diffusion-based Pose Refinement and Muti-hypothesis Generation for 3D Human Pose Estimaiton

1 code implementation10 Jan 2024 Hongbo Kang, Yong Wang, Mengyuan Liu, Doudou Wu, Peng Liu, Xinlin Yuan, Wenming Yang

To address these two challenges, we propose a diffusion-based refinement framework called DRPose, which refines the output of deterministic models by reverse diffusion and achieves more suitable multi-hypothesis prediction for the current pose benchmark by multi-step refinement with multiple noises.

3D Human Pose Estimation Denoising

Expressive Forecasting of 3D Whole-body Human Motions

1 code implementation19 Dec 2023 Pengxiang Ding, Qiongjie Cui, Min Zhang, Mengyuan Liu, Haofan Wang, Donglin Wang

Human motion forecasting, with the goal of estimating future human behavior over a period of time, is a fundamental task in many real-world applications.

Human Pose Forecasting Motion Forecasting

GCNext: Towards the Unity of Graph Convolutions for Human Motion Prediction

1 code implementation19 Dec 2023 Xinshun Wang, Qiongjie Cui, Chen Chen, Mengyuan Liu

The past few years has witnessed the dominance of Graph Convolutional Networks (GCNs) over human motion prediction. Various styles of graph convolutions have been proposed, with each one meticulously designed and incorporated into a carefully-crafted network architecture.

Human motion prediction motion prediction +1

Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning

1 code implementation6 Dec 2023 Xinshun Wang, Zhongbin Fang, Xia Li, Xiangtai Li, Chen Chen, Mengyuan Liu

Under this setting, the model can perceive tasks from prompts and accomplish them without any extra task-specific head predictions or model fine-tuning.

In-Context Learning motion prediction +1

Dynamic Dense Graph Convolutional Network for Skeleton-based Human Motion Prediction

no code implementations29 Nov 2023 Xinshun Wang, Wanying Zhang, Can Wang, Yuan Gao, Mengyuan Liu

Graph Convolutional Networks (GCN) which typically follows a neural message passing framework to model dependencies among skeletal joints has achieved high success in skeleton-based human motion prediction task.

Human motion prediction motion prediction

Dynamic Compositional Graph Convolutional Network for Efficient Composite Human Motion Prediction

1 code implementation23 Nov 2023 Wanying Zhang, Shen Zhao, Fanyang Meng, Songtao Wu, Mengyuan Liu

With potential applications in fields including intelligent surveillance and human-robot interaction, the human motion prediction task has become a hot research topic and also has achieved high success, especially using the recent Graph Convolutional Network (GCN).

Action Generation Human motion prediction +1

Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation

1 code implementation20 Nov 2023 Wenhao Li, Mengyuan Liu, Hong Liu, Pichao Wang, Jialun Cai, Nicu Sebe

Transformers have been successfully applied in the field of video-based 3D human pose estimation.

3D Human Pose Estimation

Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning

1 code implementation25 Sep 2023 Yang Liu, Chen Chen, Can Wang, Xulin King, Mengyuan Liu

The proposed method decouples functions between the decoder and the encoder by introducing a mask regressor, which predicts the masked patch representation from the visible patch representation encoded by the encoder and the decoder reconstructs the target from the predicted masked patch representation.

Few-Shot 3D Point Cloud Classification Representation Learning +1

RenderIH: A Large-scale Synthetic Dataset for 3D Interacting Hand Pose Estimation

1 code implementation ICCV 2023 Lijun Li, Linrui Tian, Xindi Zhang, Qi Wang, Bang Zhang, Mengyuan Liu, Chen Chen

The current interacting hand (IH) datasets are relatively simplistic in terms of background and texture, with hand joints being annotated by a machine annotator, which may result in inaccuracies, and the diversity of pose distribution is limited.

3D Interacting Hand Pose Estimation Hand Pose Estimation

Double-chain Constraints for 3D Human Pose Estimation in Images and Videos

1 code implementation10 Aug 2023 Hongbo Kang, Yong Wang, Mengyuan Liu, Doudou Wu, Peng Liu, Wenming Yang

Notably, our model achieves state-of-the-art performance on all action categories in the Human3. 6M dataset using detected 2D poses from CPN, and our code is available at: https://github. com/KHB1698/DC-GCT.

Monocular 3D Human Pose Estimation

Facial Prior Based First Order Motion Model for Micro-expression Generation

1 code implementation8 Aug 2023 Yi Zhang, Youjun Zhao, Yuhang Wen, Zixuan Tang, Xinhua Xu, Mengyuan Liu

To solve this problem, this paper tries to formulate a new task called micro-expression generation and then presents a strong baseline which combines the first order motion model with facial prior knowledge.

Micro-expression Generation (MEGC2021) motion prediction

Learning Snippet-to-Motion Progression for Skeleton-based Human Motion Prediction

no code implementations26 Jul 2023 Xinshun Wang, Qiongjie Cui, Chen Chen, Shen Zhao, Mengyuan Liu

Existing Graph Convolutional Networks to achieve human motion prediction largely adopt a one-step scheme, which output the prediction straight from history input, failing to exploit human motion patterns.

Human motion prediction motion prediction +1

Integrating Human Parsing and Pose Network for Human Action Recognition

1 code implementation16 Jul 2023 Runwei Ding, Yuhang Wen, Jinfu Liu, Nan Dai, Fanyang Meng, Mengyuan Liu

We propose an Integrating Human Parsing and Pose Network (IPP-Net) for action recognition, which is the first to leverage both skeletons and human parsing feature maps in dual-branch approach.

Action Recognition Human Parsing

Joint Adversarial and Collaborative Learning for Self-Supervised Action Recognition

1 code implementation15 Jul 2023 Tianyu Guo, Mengyuan Liu, Hong Liu, Wenhao Li, Jingwen Guo, Tao Wang, Yidi Li

Considering the instance-level discriminative ability, contrastive learning methods, including MoCo and SimCLR, have been adapted from the original image representation learning task to solve the self-supervised skeleton-based action recognition task.

Contrastive Learning Ensemble Learning +4

A Gated Cross-domain Collaborative Network for Underwater Object Detection

1 code implementation25 Jun 2023 Linhui Dai, Hong Liu, Pinhao Song, Mengyuan Liu

Firstly, a real-time UIE method is employed to generate enhanced images, which can improve the visibility of objects in low-contrast areas.

object-detection Object Detection +1

Explore In-Context Learning for 3D Point Cloud Understanding

2 code implementations NeurIPS 2023 Zhongbin Fang, Xiangtai Li, Xia Li, Joachim M. Buhmann, Chen Change Loy, Mengyuan Liu

With the rise of large-scale models trained on broad data, in-context learning has become a new learning paradigm that has demonstrated significant potential in natural language processing and computer vision tasks.

In-Context Learning

Cross-Modal Retrieval for Motion and Text via DopTriple Loss

1 code implementation7 May 2023 Sheng Yan, Yang Liu, Haoqiang Wang, Xin Du, Mengyuan Liu, Hong Liu

On the latest HumanML3D dataset, we achieve a recall of 62. 9% for motion retrieval and 71. 5% for text retrieval (both based on R@10).

Cross-Modal Retrieval Retrieval +1

Part Aware Contrastive Learning for Self-Supervised Action Recognition

1 code implementation1 May 2023 Yilei Hua, Wenhan Wu, Ce Zheng, Aidong Lu, Mengyuan Liu, Chen Chen, Shiqian Wu

This paper proposes an attention-based contrastive learning framework for skeleton representation learning, called SkeAttnCLR, which integrates local similarity and global features for skeleton-based action representations.

Contrastive Learning Data Augmentation +3

Graph-Guided MLP-Mixer for Skeleton-Based Human Motion Prediction

no code implementations7 Apr 2023 Xinshun Wang, Qiongjie Cui, Chen Chen, Shen Zhao, Mengyuan Liu

In recent years, Graph Convolutional Networks (GCNs) have been widely used in human motion prediction, but their performance remains unsatisfactory.

Human motion prediction Human Pose Forecasting +1

PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation

2 code implementations CVPR 2023 Qitao Zhao, Ce Zheng, Mengyuan Liu, Pichao Wang, Chen Chen

However, in real scenarios, the performance of PoseFormer and its follow-ups is limited by two factors: (a) The length of the input joint sequence; (b) The quality of 2D joint detection.

3D Human Pose Estimation Human Dynamics

DiffMesh: A Motion-aware Diffusion-like Framework for Human Mesh Recovery from Videos

no code implementations23 Mar 2023 Ce Zheng, Xianpeng Liu, Mengyuan Liu, Tianfu Wu, Guo-Jun Qi, Chen Chen

While image-based HMR methods have achieved impressive results, they often struggle to recover humans in dynamic scenarios, leading to temporal inconsistencies and non-smooth 3D motion predictions due to the absence of human motion.

3D Human Pose Estimation Human Mesh Recovery

Feature Completion Transformer for Occluded Person Re-identification

no code implementations3 Mar 2023 Tao Wang, Mengyuan Liu, Hong Liu, Wenhao Li, Miaoju Ban, Tuanyu Guo, Yidi Li

In this paper, different from most previous works that discard the occluded region, we propose a Feature Completion Transformer (FCFormer) to implicitly complement the semantic information of occluded parts in the feature space.

Person Re-Identification

Modiff: Action-Conditioned 3D Motion Generation with Denoising Diffusion Probabilistic Models

no code implementations10 Jan 2023 Mengyi Zhao, Mengyuan Liu, Bin Ren, Shuling Dai, Nicu Sebe

Diffusion-based generative models have recently emerged as powerful solutions for high-quality synthesis in multiple domains.

Denoising

Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition

1 code implementation7 Dec 2021 Tianyu Guo, Hong Liu, Zhan Chen, Mengyuan Liu, Tao Wang, Runwei Ding

In this paper, to make better use of the movement patterns introduced by extreme augmentations, a Contrastive Learning framework utilizing Abundant Information Mining for self-supervised action Representation (AimCLR) is proposed.

Contrastive Learning Representation Learning +2

A Survey on 3D Skeleton-Based Action Recognition Using Learning Method

no code implementations14 Feb 2020 Bin Ren, Mengyuan Liu, Runwei Ding, Hong Liu

To the best of our knowledge, this research represents the first comprehensive discussion of deep learning-based action recognition using 3D skeleton data.

Action Recognition Skeleton Based Action Recognition

Deformable Pose Traversal Convolution for 3D Action and Gesture Recognition

no code implementations ECCV 2018 Junwu Weng, Mengyuan Liu, Xudong Jiang, Junsong Yuan

This deformable convolution can better utilize contextual joints for action and gesture recognition and is more robust to noisy joints.

Hand Gesture Recognition Hand-Gesture Recognition

Recognizing Human Actions as the Evolution of Pose Estimation Maps

no code implementations CVPR 2018 Mengyuan Liu, Junsong Yuan

Specifically, the evolution of pose estimation maps can be decomposed as an evolution of heatmaps, e. g., probabilistic maps, and an evolution of estimated 2D human poses, which denote the changes of body shape and body pose, respectively.

Action Recognition Multimodal Activity Recognition +3

Robust 3D Action Recognition through Sampling Local Appearances and Global Distributions

no code implementations4 Dec 2017 Mengyuan Liu, Hong Liu, Chen Chen

Then, motion and shape cues are jointly used to generate robust and distinctive spatial-temporal interest points (STIPs): motion-based STIPs and shape-based STIPs.

3D Action Recognition

Two-Stream 3D Convolutional Neural Network for Skeleton-Based Action Recognition

no code implementations23 May 2017 Hong Liu, Juanhui Tu, Mengyuan Liu

Extensive experiments on the SmartHome dataset and the large-scale NTU RGB-D dataset demonstrate that our method outperforms most of RNN-based methods, which verify the complementary property between spatial and temporal information and the robustness to noise.

Skeleton Based Action Recognition Vocal Bursts Valence Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.