1 code implementation • 14 Oct 2024 • Yue Zhang, Zhizhou Zhong, Minhao Liu, Zhaokang Chen, Bin Wu, Yubin Zeng, Chao Zhan, Yingjie He, Junxin Huang, Wenjiang Zhou
Real-time video dubbing that preserves identity consistency while achieving accurate lip synchronization remains a critical challenge.
no code implementations • 21 Feb 2023 • Kun Wang, Zi Wang, Zhang Li, Ang Su, Xichao Teng, Erting Pan, Minhao Liu, Qifeng Yu
Given the rapid development of this field, this paper presents a comprehensive survey of recent advances in oriented object detection.
no code implementations • 12 Dec 2021 • Minhao Liu, Zhijian Xu, Qiang Xu
Due to the inherently unpredictable and highly varied nature of anomalies and the lack of anomaly labels in historical data, the AD problem is typically formulated as an unsupervised learning problem.
no code implementations • ICLR 2022 • Minhao Liu, Ailing Zeng, Qiuxia Lai, Ruiyuan Gao, Min Li, Jing Qin, Qiang Xu
In this work, we propose a novel tree-structured wavelet neural network for time series signal analysis, namely T-WaveNet, by taking advantage of an inherent property of various types of signals, known as the dominant frequency range.
no code implementations • ICCV 2021 • Ailing Zeng, Xiao Sun, Lei Yang, Nanxuan Zhao, Minhao Liu, Qiang Xu
While the average prediction accuracy has been improved significantly over the years, the performance on hard poses with depth ambiguity, self-occlusion, and complex or rare poses is still far from satisfactory.
Ranked #32 on
Skeleton Based Action Recognition
on NTU RGB+D 120
1 code implementation • 7 Aug 2021 • Qiuxia Lai, Yu Li, Ailing Zeng, Minhao Liu, Hanqiu Sun, Qiang Xu
Extensive experiments show that the proposed IB-inspired spatial attention mechanism can yield attention maps that neatly highlight the regions of interest while suppressing backgrounds, and bootstrap standard DNN structures for visual recognition tasks (e. g., image classification, fine-grained recognition, cross-domain classification).
6 code implementations • 17 Jun 2021 • Minhao Liu, Ailing Zeng, Muxi Chen, Zhijian Xu, Qiuxia Lai, Lingna Ma, Qiang Xu
One unique property of time series is that the temporal relations are largely preserved after downsampling into two sub-sequences.
Ranked #1 on
Time Series Forecasting
on ETTh1 (24) Multivariate
(using extra training data)
no code implementations • 30 May 2021 • Ailing Zeng, Minhao Liu, Zhiwei Liu, Ruiyuan Gao, Jing Qin, Qiang Xu
We propose a novel solution to addressing a long-standing dilemma in the representation learning of graph neural networks (GNNs): how to effectively capture and represent useful information embedded in long-distance nodes to improve the performance of nodes with low homophily without leading to performance degradation in nodes with high homophily.
no code implementations • 10 Dec 2020 • Minhao Liu, Ailing Zeng, Qiuxia Lai, Qiang Xu
Motivated by the fact that usually a small subset of the frequency components carries the primary information for sensor data, we propose a novel tree-structured wavelet neural network for sensor data analysis, namely \emph{T-WaveNet}.
1 code implementation • ECCV 2020 • Ailing Zeng, Xiao Sun, Fuyang Huang, Minhao Liu, Qiang Xu, Stephen Lin
With the reduced dimensionality of less relevant body areas, the training set distribution within network branches more closely reflects the statistics of local poses instead of global body poses, without sacrificing information important for joint inference.
Ranked #21 on
Monocular 3D Human Pose Estimation
on Human3.6M
no code implementations • 9 Dec 2019 • Fuyang Huang, Ailing Zeng, Minhao Liu, Qiuxia Lai, Qiang Xu
In this paper, we propose a two-stage fully 3D network, namely \textbf{DeepFuse}, to estimate human pose in 3D space by fusing body-worn Inertial Measurement Unit (IMU) data and multi-view images deeply.
Ranked #5 on
3D Human Pose Estimation
on Total Capture
no code implementations • 26 Dec 2018 • Fuyang Huang, Ailing Zeng, Minhao Liu, Jing Qin, Qiang Xu
Experimental results show that the proposed structure-aware 3D hourglass network is able to achieve a mean joint error of 7. 4 mm in MSRA and 8. 9 mm in NYU datasets, respectively.