1 code implementation • journal 2023 • Qin Cheng, Jun Cheng, Ziliang Ren, Qieshi Zhang, Jianming Liu
Unifying the MSST module, a multi-scale spatial–temporal convolutional neural network (MSSTNet) is proposed to capture high-level spatial–temporal semantic features for action recognition.
Ranked #2 on Skeleton Based Action Recognition on UAV-Human
no code implementations • 19 Apr 2023 • Yao Huang, Jianming Liu
We first enhance the correlation features between the support set image and the query image using a bidirectional cross-attention module.
1 code implementation • 8 May 2020 • Yong Xu, Meng Yu, Shi-Xiong Zhang, Lian-Wu Chen, Chao Weng, Jianming Liu, Dong Yu
Purely neural network (NN) based speech separation and enhancement methods, although can achieve good objective scores, inevitably cause nonlinear speech distortions that are harmful for the automatic speech recognition (ASR).
Audio and Speech Processing Sound