no code implementations • 19 Oct 2023 • Lijuan Zhou, Xiang Meng, Zhihuan Liu, Mengqi Wu, Zhimin Gao, Pichao Wang
This paper presents a comprehensive survey of pose-based applications utilizing deep learning, encompassing pose estimation, pose tracking, and action recognition. Pose estimation involves the determination of human joint positions from images or image sequences.
no code implementations • IEEE Transactions on Cybernetics 2023 • Lisha Cui, Pei Lv, Xiaoheng Jiang, Zhimin Gao, Bing Zhou, Luming Zhang, Ling Shao, Mingliang Xu
State-of-the-art object detectors usually progressively downsample the input image until it is represented by small feature maps, which loses the spatial information and compromises the representation of small objects.
Ranked #1 on
Traffic Sign Detection
on TT100K
no code implementations • 6 Oct 2022 • Zhimin Gao, Peitao Wang, Pei Lv, Xiaoheng Jiang, Qidong Liu, Pichao Wang, Mingliang Xu, Wanqing Li
Besides, these methods directly calculate the pair-wise global self-attention equally for all the joints in both the spatial and temporal dimensions, undervaluing the effect of discriminative local joints and the short-range temporal dynamics.
1 code implementation • 21 Sep 2022 • Zihui Guo, Yonghong Hou, Pichao Wang, Zhimin Gao, Mingliang Xu, Wanqing Li
It has been studied either using first person vision (FPV) or third person vision (TPV).
1 code implementation • 13 Nov 2021 • Shuangyan Miao, Yonghong Hou, Zhimin Gao, Mingliang Xu, Wanqing Li
This paper proposes a new graph convolutional operator called central difference graph convolution (CDGC) for skeleton based action recognition.
no code implementations • 14 Jan 2021 • Mudabbir Kaleem, Keshav Kasichainula, Rabimba Karanjai, Lei Xu, Zhimin Gao, Lin Chen, Weidong Shi
This paper presents EDSC, a novel smart contract platform design based on the event-driven execution model as opposed to the traditionally employed transaction-driven execution model.
Distributed, Parallel, and Cluster Computing
no code implementations • 5 Jan 2021 • Xiangyu Li, Yonghong Hou, Pichao Wang, Zhimin Gao, Mingliang Xu, Wanqing Li
In this paper, we propose a \textbf{Tr}ansformer-based RGB-D \textbf{e}gocentric \textbf{a}ction \textbf{r}ecognition framework, called Trear.
no code implementations • 8 Dec 2020 • Xiangyu Li, Yonghong Hou, Pichao Wang, Zhimin Gao, Mingliang Xu, Wanqing Li
In this paper, we propose a method consisting of two camera pose estimators that deal with the information from pairwise images and a short sequence of images respectively.
no code implementations • 7 Nov 2018 • Lin Chen, Lei Xu, Shouhuai Xu, Zhimin Gao, Weidong Shi
In this paper, we introduce a novel variant of the bribery problem, "Election with Bribed Voter Uncertainty" or BVU for short, accommodating the uncertainty that the vote of a bribed voter may or may not be counted.
1 code implementation • 18 May 2018 • Lisha Cui, Rui Ma, Pei Lv, Xiaoheng Jiang, Zhimin Gao, Bing Zhou, Mingliang Xu
The performance of small object detection, however, is still less than satisfactory because of the deficiency of semantic information on shallow feature maps.
no code implementations • 17 Mar 2018 • Pichao Wang, Wanqing Li, Zhimin Gao, Chang Tang, Philip Ogunbona
This paper proposes three simple, compact yet effective representations of depth sequences, referred to respectively as Dynamic Depth Images (DDI), Dynamic Depth Normal Images (DDNI) and Dynamic Depth Motion Normal Images (DDMNI), for both isolated and continuous action recognition.
1 code implementation • 23 Dec 2017 • Han He, Lei Wu, Xiaokun Yang, Hua Yan, Zhimin Gao, Yi Feng, George Townsend
To build a concrete study and substantiate the efficiency of our neural architecture, we take Chinese Word Segmentation as a research case example.
1 code implementation • 7 Dec 2017 • Han He, Lei Wu, Hua Yan, Zhimin Gao, Yi Feng, George Townsend
We present a simple yet elegant solution to train a single joint model on multi-criteria corpora for Chinese Word Segmentation (CWS).
no code implementations • CVPR 2017 • Pichao Wang, Wanqing Li, Zhimin Gao, Yuyao Zhang, Chang Tang, Philip Ogunbona
Based on the scene flow vectors, we propose a new representation, namely, Scene Flow to Action Map (SFAM), that describes several long term spatio-temporal dynamics for action recognition.
Ranked #3 on
Hand Gesture Recognition
on ChaLearn val
no code implementations • 7 Jan 2017 • Pichao Wang, Wanqing Li, Song Liu, Zhimin Gao, Chang Tang, Philip Ogunbona
This paper proposes three simple, compact yet effective representations of depth sequences, referred to respectively as Dynamic Depth Images (DDI), Dynamic Depth Normal Images (DDNI) and Dynamic Depth Motion Normal Images (DDMNI).
Ranked #2 on
Hand Gesture Recognition
on ChaLearn val
no code implementations • 22 Aug 2016 • Pichao Wang, Wanqing Li, Song Liu, Yuyao Zhang, Zhimin Gao, Philip Ogunbona
This paper addresses the problem of continuous gesture recognition from sequences of depth maps using convolutional neutral networks (ConvNets).
no code implementations • IEEE Transactions on Human-Machine Systems 2016 2015 • Pichao Wang, Wanqing Li, Zhimin Gao, Jing Zhang, Chang Tang, Philip Ogunbona
In addition, the method was evaluated on the large dataset constructed from the above datasets.
Ranked #9 on
Multimodal Activity Recognition
on EV-Action
no code implementations • 10 Apr 2015 • Zhimin Gao, Lei Wang, Luping Zhou, Jianjia Zhang
Efficient Human Epithelial-2 (HEp-2) cell image classification can facilitate the diagnosis of many autoimmune diseases.
no code implementations • 20 Jan 2015 • Pichao Wang, Wanqing Li, Zhimin Gao, Jing Zhang, Chang Tang, Philip Ogunbona
The results show that our approach can achieve state-of-the-art results on the individual datasets and without dramatical performance degradation on the Combined Dataset.
no code implementations • 14 Sep 2014 • Pichao Wang, Wanqing Li, Philip Ogunbona, Zhimin Gao, Hanling Zhang
These parts are referred to as Frequent Local Parts or FLPs.