no code implementations • 21 Nov 2022 • Yingxue Xu, Guihua Wen, Yang Hu, Pei Yang
Compared with the ground distance of the conventional domain-level OT, the image-level OT captures structural associations among local regions of images that are beneficial to classification.
no code implementations • 27 Dec 2021 • Mengjian Zhang, Guihua Wen
A swarm intelligence-based optimization algorithm, named Duck Swarm Algorithm (DSA), is proposed in this study, which is inspired by the searching for food sources and foraging behaviors of the duck swarm.
no code implementations • NeurIPS 2021 • Jiawei Chen, Xu Tan, Yichong Leng, Jin Xu, Guihua Wen, Tao Qin, Tie-Yan Liu
Experiments on LJSpeech datasets demonstrate that Speech-T 1) is more robust than the attention based autoregressive TTS model due to its inherent monotonic alignments between text and speech; 2) naturally supports streaming TTS with good voice quality; and 3) enjoys the benefit of joint modeling TTS and ASR in a single network.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3
no code implementations • 11 Jun 2021 • Yang Hu, Adriane Chapman, Guihua Wen, Dame Wendy Hall
Supervised machine learning has several drawbacks that make it difficult to use in many situations.
no code implementations • 8 Jun 2020 • Yang Hu, Guihua Wen, Adriane Chapman, Pei Yang, Mingnan Luo, Yingxue Xu, Dan Dai, Wendy Hall
Zero-shot learning uses semantic attributes to connect the search space of unseen objects.
no code implementations • 13 May 2020 • Yingxue Xu, Guihua Wen, Yang Hu, Mingnan Luo, Dan Dai, Yishan Zhuang, Wendy Hall
Finally, a new framework for Chinese herbal recognition is proposed as a new application of APN.
no code implementations • 22 Apr 2019 • Mingnan Luo, Guihua Wen, Yang Hu, Dan Dai, Yingxue Xu
Global Average Pooling (GAP) is used by default on the channel-wise attention mechanism to extract channel descriptors.
1 code implementation • 22 Apr 2019 • Yang Hu, Guihua Wen, Mingnan Luo, Dan Dai, Wenming Cao, Zhiwen Yu, Wendy Hall
To deal with these problems, a novel Inner-Imaging architecture is proposed in this paper, which allows relationships between channels to meet the above requirement.
no code implementations • 23 Dec 2018 • Yingxue Xu, Guihua Wen, Yang Hu, Mingnan Luo, Dan Dai, Yishan Zhuang
According to the characteristics of herbal images, we proposed the competitive attentional fusion pyramid networks to model the features of herbal image, which mdoels the relationship of feature maps from different levels, and re-weights multi-level channels with channel-wise attention mechanism.
no code implementations • 17 Dec 2018 • Huiqiang Liao, Guihua Wen, Yang Hu, Changjun Wang
In order to mine features from different granularities of faces, we design a multi-scale convolutional neural network based on three-grained face, which mines the patient's face information from the organs, local regions, and the entire face.
1 code implementation • 24 Jul 2018 • Yang Hu, Guihua Wen, Mingnan Luo, Dan Dai, Jiajiong Ma, Zhiwen Yu
In this work, we propose a competitive squeeze-excitation (SE) mechanism for the residual network.
no code implementations • 1 Mar 2018 • Jiajiong Ma, Guihua Wen, Yang Hu, Tianyuan Chang, Haibin Zeng, Lijun Jiang, Jianzeng Qin
To evaluate the performance of our proposed method, we conduct experiments on three sizes of tongue datasets, in which deep convolutional neural network method and traditional digital image analysis method are respectively applied to extract features for tongue images.
no code implementations • 1 Mar 2018 • Tianyuan Chang, Guihua Wen, Yang Hu, Jiajiong Ma
Facial expression recognition (FER) has always been a challenging issue in computer vision.
no code implementations • 23 Jan 2018 • Yang Hu, Guihua Wen, Huiqiang Liao, Changjun Wang, Dan Dai, Zhiwen Yu
In order to adapt to the tongue image in a variety of photographic environments and construct herbal prescriptions, a neural network framework for prescription construction is designed.
no code implementations • 7 Apr 2017 • Dan Wang, He-Yan Huang, Chi Lu, Bo-Si Feng, Liqiang Nie, Guihua Wen, Xian-Ling Mao
Specifically, we define a novel similarity formula for hierarchical labeled data by weighting each layer, and design a deep convolutional neural network to obtain a hash code for each data point.
no code implementations • 7 Apr 2017 • Yi-Kun Tang, Xian-Ling Mao, He-Yan Huang, Guihua Wen
Recently, topic modeling has been widely used to discover the abstract topics in text corpora.