Search Results for author: Jianqin Yin

Found 38 papers, 12 papers with code

DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation

no code implementations22 Apr 2024 Yonghao Dang, Jianqin Yin, Liyuan Liu, Yuan Sun, Yanzhu Hu, Pengxiang Ding

Multi-person pose estimation (MPPE) presents a formidable yet crucial challenge in computer vision.

Towards more realistic human motion prediction with attention to motion coordination

no code implementations4 Apr 2024 Pengxiang Ding, Jianqin Yin

However, the motion coordination, a global joint relation reflecting the simultaneous cooperation of all joints, is usually weakened because it is learned from part to whole progressively and asynchronously.

Human motion prediction motion prediction +1

A Two-stream Hybrid CNN-Transformer Network for Skeleton-based Human Interaction Recognition

no code implementations31 Dec 2023 Ruoqi Yin, Jianqin Yin

Specifically, Transformer-based stream integrates 3D convolutions with multi-head self-attention to learn inter-token correlations; We propose a new multi-branch CNN framework for CNN-based streams that automatically learns joint spatio-temporal features from skeleton sequences.

Human Interaction Recognition Specificity

Spatial-Temporal Decoupling Contrastive Learning for Skeleton-based Human Action Recognition

2 code implementations23 Dec 2023 Shaojie Zhang, Jianqin Yin, Yonghao Dang

Furthermore, to explicitly exploit the latent data distributions, we employ the attentive features to contrastive learning, which models the cross-sequence semantic relations by pulling together the features from the positive pairs and pushing away the negative pairs.

Action Recognition Contrastive Learning +2

BiHRNet: A Binary high-resolution network for Human Pose Estimation

no code implementations17 Nov 2023 Zhicheng Zhang, Xueyao Sun, Yonghao Dang, Jianqin Yin

On the challenging of COCO dataset, the proposed method enables the binary neural network to achieve 70. 8 mAP, which is better than most tested lightweight full-precision networks.

Binarization Pose Estimation

SoccerNet 2023 Challenges Results

2 code implementations12 Sep 2023 Anthony Cioppa, Silvio Giancola, Vladimir Somers, Floriane Magera, Xin Zhou, Hassan Mkhallati, Adrien Deliège, Jan Held, Carlos Hinojosa, Amir M. Mansourian, Pierre Miralles, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdullah Kamal, Adrien Maglo, Albert Clapés, Amr Abdelaziz, Artur Xarles, Astrid Orcesi, Atom Scott, Bin Liu, Byoungkwon Lim, Chen Chen, Fabian Deuser, Feng Yan, Fufu Yu, Gal Shitrit, Guanshuo Wang, Gyusik Choi, Hankyul Kim, Hao Guo, Hasby Fahrudin, Hidenari Koguchi, Håkan Ardö, Ibrahim Salah, Ido Yerushalmy, Iftikar Muhammad, Ikuma Uchida, Ishay Be'ery, Jaonary Rabarisoa, Jeongae Lee, Jiajun Fu, Jianqin Yin, Jinghang Xu, Jongho Nang, Julien Denize, Junjie Li, Junpei Zhang, Juntae Kim, Kamil Synowiec, Kenji Kobayashi, Kexin Zhang, Konrad Habel, Kota Nakajima, Licheng Jiao, Lin Ma, Lizhi Wang, Luping Wang, Menglong Li, Mengying Zhou, Mohamed Nasr, Mohamed Abdelwahed, Mykola Liashuha, Nikolay Falaleev, Norbert Oswald, Qiong Jia, Quoc-Cuong Pham, Ran Song, Romain Hérault, Rui Peng, Ruilong Chen, Ruixuan Liu, Ruslan Baikulov, Ryuto Fukushima, Sergio Escalera, Seungcheon Lee, Shimin Chen, Shouhong Ding, Taiga Someya, Thomas B. Moeslund, Tianjiao Li, Wei Shen, Wei zhang, Wei Li, Wei Dai, Weixin Luo, Wending Zhao, Wenjie Zhang, Xinquan Yang, Yanbiao Ma, Yeeun Joo, Yingsen Zeng, Yiyang Gan, Yongqiang Zhu, Yujie Zhong, Zheng Ruan, Zhiheng Li, Zhijian Huang, Ziyu Meng

More information on the tasks, challenges, and leaderboards are available on https://www. soccer-net. org.

Action Spotting Camera Calibration +3

Physics-constrained Attack against Convolution-based Human Motion Prediction

1 code implementation21 Jun 2023 Chengxu Duan, Zhicheng Zhang, Xiaoli Liu, Yonghao Dang, Jianqin Yin

Specifically, we introduce a novel adaptable scheme that facilitates the attack to suit the scale of the target pose and two physical constraints to enhance the naturalness of the adversarial example.

Adversarial Attack Adversarial Robustness +2

Target-Aware Spatio-Temporal Reasoning via Answering Questions in Dynamics Audio-Visual Scenarios

1 code implementation21 May 2023 Yuanyuan Jiang, Jianqin Yin

Recent works rely on elaborate target-agnostic parsing of audio-visual scenes for spatial grounding while mistreating audio and video as separate entities for temporal grounding.

Audio-visual Question Answering Audio-Visual Question Answering (AVQA) +3

SDVRF: Sparse-to-Dense Voxel Region Fusion for Multi-modal 3D Object Detection

no code implementations17 Apr 2023 Binglu Ren, Jianqin Yin

To solve these two problems, we present a new concept, Voxel Region (VR), which is obtained by projecting the sparse local point clouds in each voxel dynamically.

3D Object Detection Autonomous Driving +1

An Improved Baseline Framework for Pose Estimation Challenge at ECCV 2022 Visual Perception for Navigation in Human Environments Workshop

no code implementations13 Mar 2023 Jiajun Fu, Yonghao Dang, Ruoqi Yin, Shaojie Zhang, Feng Zhou, Wending Zhao, Jianqin Yin

This technical report describes our first-place solution to the pose estimation challenge at ECCV 2022 Visual Perception for Navigation in Human Environments Workshop.

Human Detection Pose Estimation

Instance-incremental Scene Graph Generation from Real-world Point Clouds via Normalizing Flows

no code implementations21 Feb 2023 Chao Qi, Jianqin Yin, Jinghang Xu, Pengxiang Ding

This work introduces a new task of instance-incremental scene graph generation: Given a scene of the point cloud, representing it as a graph and automatically increasing novel instances.

Graph Generation Scene Graph Generation

An end-to-end multi-scale network for action prediction in videos

no code implementations31 Dec 2022 Xiaofa Liu, Jianqin Yin, Yuan Sun, Zhicheng Zhang, Jin Tang

Unlike most existing methods with offline feature generation, our method directly takes frames as input and further models motion evolution on two different temporal scales. Therefore, we solve the complexity problems of the two stages of modeling and the problem of insufficient temporal and spatial information of a single scale.

Leveraging the Video-level Semantic Consistency of Event for Audio-visual Event Localization

1 code implementation11 Oct 2022 Yuanyuan Jiang, Jianqin Yin, Yonghao Dang

In contrast to existing methods, we propose a novel video-level semantic consistency guidance network for the AVE localization task.

audio-visual event localization

Kinematics Modeling Network for Video-based Human Pose Estimation

no code implementations22 Jul 2022 Yonghao Dang, Jianqin Yin, Shaojie Zhang, Jiping Liu, Yanzhu Hu

In this work, we propose a plug-and-play kinematics modeling module (KMM) to explicitly model temporal correlations between joints across different frames by calculating their temporal similarity.

Optical Flow Estimation Pose Estimation

Deeply Supervised Skin Lesions Diagnosis with Stage and Branch Attention

2 code implementations9 May 2022 Wei Dai, Rui Liu, Tianyi Wu, Min Wang, Jianqin Yin, Jun Liu

Visual features of skin lesions vary significantly because the images are collected from patients with different lesion colours and morphologies by using dissimilar imaging equipment.

Classification

Learning Constrained Dynamic Correlations in Spatiotemporal Graphs for Motion Prediction

1 code implementation4 Apr 2022 Jiajun Fu, Fuxing Yang, Yonghao Dang, Xiaoli Liu, Jianqin Yin

The key of DSTD-GC is constrained dynamic correlation modeling, which explicitly parameterizes the common static constraints as a spatial/temporal vanilla adjacency matrix shared by all frames/joints and dynamically extracts correspondence variances for each frame/joint with an adjustment modeling function.

Human motion prediction motion prediction

Rich Action-semantic Consistent Knowledge for Early Action Prediction

1 code implementation23 Jan 2022 Xiaoli Liu, Jianqin Yin, Di Guo, Huaping Liu

Next, we build a bi-directional semantic graph for the teacher network and a single-directional semantic graph for the student network to model rich ASCK among partial videos.

Early Action Prediction

Neighborhood Spatial Aggregation MC Dropout for Efficient Uncertainty-aware Semantic Segmentation in Point Clouds

no code implementations5 Dec 2021 Chao Qi, Jianqin Yin

Specifically, the NSA-MC dropout samples the model many times through a space-dependent way, outputting point-wise distribution by aggregating stochastic inference results of neighbors.

Model Optimization Semantic Segmentation

Real-World Semantic Grasp Detection Based on Attention Mechanism

no code implementations20 Nov 2021 Mingshuai Dong, Shimin Wei, Jianqin Yin, Xiuli Yu

And we also design a target feature attention mechanism to guide the model focus on the features of target object ontology for grasp prediction according to the semantic information.

Object

Amodal segmentation just like doing a jigsaw

no code implementations15 Jul 2021 Xunli Zeng, Jianqin Yin

This jigsaw method can better model the occlusion relationship and use the occlusion context information, which is important for amodal segmentation.

Instance Segmentation Segmentation +1

Uncertainty-aware Human Motion Prediction

no code implementations8 Jul 2021 Pengxiang Ding, Jianqin Yin

It is far more enough for current approaches in actual scenarios because people can't know how to interact with the machine without the evaluation of prediction, and unreliable predictions may mislead the machine to harm the human.

Human motion prediction motion prediction

Relation-Based Associative Joint Location for Human Pose Estimation in Videos

1 code implementation8 Jul 2021 Yonghao Dang, Jianqin Yin, Shaojie Zhang

Moreover, the JRE can infer invisible joints according to the relationship between joints, which is beneficial for the model to locate occluded joints.

Pose Estimation Relation

An Attractor-Guided Neural Networks for Skeleton-Based Human Motion Prediction

no code implementations20 May 2021 Pengxiang Ding, Junying Wang, Jianqin Yin

However, the global coordination of all joints, which reflects human motion's balance property, is usually weakened because it is learned from part to whole progressively and asynchronously.

Human motion prediction motion prediction

Temporal Consistency Two-Stream CNN for Human Motion Prediction

no code implementations11 Apr 2021 Jin Tang, Jin Zhang, Jianqin Yin

In this paper, we propose a novel temporal fusion (TF) module to fuse the two-stream joints' information to predict human motion, including a temporal concatenation and a reinforcement trajectory spatial-temporal (TST) block, specifically designed to keep prediction temporal consistency.

Human motion prediction motion prediction +2

Mask-GD Segmentation Based Robotic Grasp Detection

no code implementations20 Jan 2021 Mingshuai Dong, Shimin Wei, Xiuli Yu, Jianqin Yin

MASK is a segmented image that only contains the pixels of the target object.

Robotics

Multi-grained Trajectory Graph Convolutional Networks for Habit-unrelated Human Motion Prediction

no code implementations23 Dec 2020 Jin Liu, Jianqin Yin

A multi-grained trajectory graph convolutional networks based and lightweight framework is proposed for habit-unrelated human motion prediction.

Computational Efficiency Human motion prediction +1

SDMTL: Semi-Decoupled Multi-grained Trajectory Learning for 3D human motion prediction

no code implementations11 Oct 2020 Xiaoli Liu, Jianqin Yin

Predicting future human motion is critical for intelligent robots to interact with humans in the real world, and human motion has the nature of multi-granularity.

Human motion prediction motion prediction

DeepSSM: Deep State-Space Model for 3D Human Motion Prediction

1 code implementation25 May 2020 Xiaoli Liu, Jianqin Yin, Huaping Liu, Jun Liu

In contrast to prior works, we improve the multi-order modeling ability of human motion systems for more accurate predictions by building a deep state-space model (DeepSSM).

Human motion prediction motion prediction

Energy-based Periodicity Mining with Deep Features for Action Repetition Counting in Unconstrained Videos

no code implementations15 Mar 2020 Jianqin Yin, Yanchun Wu, Huaping Liu, Yonghao Dang, Zhiyi Liu, Jun Liu

Our work features two-fold: 1) An important insight that deep features extracted for action recognition can well model the self-similarity periodicity of the repetitive action is presented.

Action Recognition

TrajectoryNet: a new spatio-temporal feature learning network for human motion prediction

no code implementations15 Oct 2019 Xiaoli Liu, Jianqin Yin, Jin Liu, Pengxiang Ding, Jun Liu, Huaping Liu

And the global temporal co-occurrence features represent the co-occurrence relationship that different subsequences in a complex motion sequence are appeared simultaneously, which can be obtained automatically with our proposed TrajectoryNet by reorganizing the temporal information as the depth dimension of the input tensor.

Human motion prediction motion prediction +1

PISEP^2: Pseudo Image Sequence Evolution based 3D Pose Prediction

no code implementations arXiv:1909.01818 2019 Xiaoli Liu, Jianqin Yin, Huaping Liu, Yilong Yin

Specifically, a skeletal representation is proposed by transforming the joint coordinate sequence into an image sequence, which can model the different correlations of different joints.

Computational Efficiency Pose Prediction

DWnet: Deep-Wide Network for 3D Action Recognition

no code implementations29 Aug 2019 Yonghao Dang, Fuxing Yang, Jianqin Yin

We propose in this paper a deep-wide network (DWnet) which combines the deep structure with the broad learning system (BLS) to recognize actions.

3D Action Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.