no code implementations • 16 Feb 2023 • Jing Xu, Dandan song, Chong Liu, Siu Cheung Hui, Fei Li, Qiang Ju, Xiaonan He, Jian Xie
In this paper, we propose a Dialogue State Distillation Network (DSDN) to utilize relevant information of previous dialogue states and migrate the gap of utilization between training and testing.
1 code implementation • COLING 2022 • Changzhi Zhou, Dandan song, Jing Xu, Zhijing Wu
Our framework can model complicated relations between emotions and causes while avoiding generating the pairing matrix (the leading cause of the label sparsity problem).
no code implementations • 1 Nov 2021 • Long He, Dandan song, Liang Zheng
We define the classification task where classes have characteristics above and the flat classes and the base classes are organized hierarchically as hierarchical image classification.
1 code implementation • 2 Aug 2021 • Tianlong Kong, Shouyi Yin, Dawei Zhang, Wang Geng, Xin Wang, Dandan song, Jinwen Huang, Huiyu Shi, Xiaorui Wang
To address this issue, we propose a new architecture, named dynamic multi-scale convolution, which consists of dynamic kernel convolution, local multi-scale learning, and global multi-scale pooling.
no code implementations • ACL 2021 • Fei Li, Zheng Wang, Siu Cheung Hui, Lejian Liao, Dandan song, Jing Xu, Guoxiu He, Meihuizi Jia
Although the existing Named Entity Recognition (NER) models have achieved promising performance, they suffer from certain drawbacks.
no code implementations • 13 Dec 2020 • Dandan song, Siyi Ma, Zhanchen Sun, Sicheng Yang, Lejian Liao
To develop machine with cognition-level visual understanding and reasoning abilities, the visual commonsense reasoning (VCR) task has been introduced.
Ranked #4 on
Visual Question Answering (VQA)
on VCR (Q-AR) test
Visual Commonsense Reasoning
Visual Question Answering (VQA)
no code implementations • 11 Aug 2020 • Xi Chen, Songyang Zhang, Dandan song, Peng Ouyang, Shouyi Yin
To demonstrate our proposed speech transformer with a bidirectional decoder(STBD), we conduct extensive experiments on the AISHELL-1 dataset.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3
no code implementations • 25 Dec 2019 • Yi Liu, Tianyu Liang, Can Xu, Xianwei Zhang, Xianhong Chen, Wei-Qiang Zhang, Liang He, Dandan song, Ruyun Li, Yangcheng Wu, Peng Ouyang, Shouyi Yin
This paper describes the systems submitted by the department of electronic engineering, institute of microelectronics of Tsinghua university and TsingMicro Co. Ltd. (THUEE) to the NIST 2019 speaker recognition evaluation CTS challenge.
no code implementations • 11 Dec 2019 • Xi Chen, Shouyi Yin, Dandan song, Peng Ouyang, Leibo Liu, Shaojun Wei
Despite the recent successes of deep neural networks, it remains challenging to achieve high precision keyword spotting task (KWS) on resource-constrained devices.
no code implementations • 19 May 2019 • Bowen Xing, Lejian Liao, Dandan song, Jingang Wang, Fuzheng Zhang, Zhongyuan Wang, He-Yan Huang
This paper proposes a novel variant of LSTM, termed as aspect-aware LSTM (AA-LSTM), which incorporates aspect information into LSTM cells in the context modeling stage before the attention mechanism.
Aspect-Based Sentiment Analysis
Aspect-Based Sentiment Analysis (ABSA)