1 code implementation • 20 Jul 2023 • Weidong Chen, Xiaofen Xing, Peihao Chen, Xiangmin Xu
Although PTMs shed new light on artificial general intelligence, they are constructed with general tasks in mind, and thus, their efficacy for specific tasks can be further improved.
no code implementations • 14 Mar 2023 • Tengjun Liu, Yansong Chua, Yiwei Zhang, Yuxiao Ning, Pengfu Liu, Guihua Wan, Zijun Wan, Shaomin Zhang, Weidong Chen
Despite its better bio-plausibility, goal-driven spiking neural network (SNN) has not achieved applicable performance for classifying biological spike trains, and showed little bio-functional similarities compared to traditional artificial neural networks.
1 code implementation • 3 Mar 2023 • Shuaiqi Chen, Xiaofen Xing, Weibin Zhang, Weidong Chen, Xiangmin Xu
Self-attention mechanism is applied within windows for capturing temporal important information locally in a fine-grained way.
1 code implementation • 27 Feb 2023 • Weidong Chen, Xiaofen Xing, Xiangmin Xu, Jianxin Pang, Lan Du
Paralinguistic speech processing is important in addressing many issues, such as sentiment and neurocognitive disorder analyses.
Ranked #1 on
Speech Emotion Recognition
on LSSED
1 code implementation • 26 Jul 2022 • Weidong Chen, Dexiang Hong, Yuankai Qi, Zhenjun Han, Shuhui Wang, Laiyun Qing, Qingming Huang, Guorong Li
To address this problem, we propose a multi-attention network which consists of dual-path dual-attention module and a query-based cross-modal Transformer module.
Ranked #5 on
Referring Expression Segmentation
on A2D Sentences
Referring Expression Segmentation
Referring Video Object Segmentation
+2
1 code implementation • 1 Apr 2021 • Guangming Wang, Hesheng Wang, Yiling Liu, Weidong Chen
A new unsupervised learning method of depth and ego-motion using multiple masks from monocular video is proposed in this paper.
1 code implementation • 30 Jan 2021 • Weiquan Fan, Xiangmin Xu, Xiaofen Xing, Weidong Chen, DongYan Huang
Speech emotion recognition is a vital contributor to the next generation of human-computer interaction (HCI).
Ranked #3 on
Speech Emotion Recognition
on LSSED
1 code implementation • 16 Sep 2020 • Hanjiang Hu, Hesheng Wang, Zhe Liu, Weidong Chen
Visual localization is a crucial component in the application of mobile robot and autonomous driving.
no code implementations • 30 Mar 2020 • Xiyi Wei, Yu-Tian Xiao, Jian Wang, Rui Chen, Wei zhang, Yue Yang, Daojun Lv, Chao Qin, Di Gu, Bo Zhang, Weidong Chen, Jianquan Hou, Ninghong Song, Guohua Zeng, Shancheng Ren
Objective: To conduct a meta-analysis of current studies that examined sex differences in severity and mortality in patients with COVID-19, and identify potential mechanisms underpinning these differences.
1 code implementation • 23 Sep 2019 • Hanjiang Hu, Hesheng Wang, Zhe Liu, Chenguang Yang, Weidong Chen, Le Xie
To retrieve a target image from the database, the query image is first encoded using the encoder belonging to the query domain to obtain a domain-invariant feature vector.
1 code implementation • 7 Jan 2019 • Baoyuan Wu, Weidong Chen, Yanbo Fan, Yong Zhang, Jinlong Hou, Jie Liu, Tong Zhang
In this work, we propose to train CNNs from images annotated with multiple tags, to enhance the quality of visual representation of the trained CNN model.
no code implementations • CVPR 2018 • Baoyuan Wu, Weidong Chen, Peng Sun, Wei Liu, Bernard Ghanem, Siwei Lyu
In D2IA, we generate a relevant and distinct tag subset, in which the tags are relevant to the image contents and semantically distinct to each other, using sequential sampling from a determinantal point process (DPP) model.
no code implementations • 1 Mar 2017 • Nevrez Imamoglu, Zhixuan Wei, Huangjun Shi, Yuki Yoshida, Myagmarbayar Nergui, Jose Gonzalez, Dongyun Gu, Weidong Chen, Kenzo Nonami, Wenwei Yu
Saliency computation has become a popular research field for many applications due to the useful information provided by saliency maps.