Search Results for author: Kai Yu

Found 71 papers, 19 papers with code

Decoupled Dialogue Modeling and Semantic Parsing for Multi-Turn Text-to-SQL

no code implementations4 Jun 2021 Zhi Chen, Lu Chen, Hanqi Li, Ruisheng Cao, Da Ma, Mengyue Wu, Kai Yu

A dual learning approach is also proposed for the utterance rewrite model to address the data sparsity problem.

Semantic Parsing SQL Parsing +1

ShadowGNN: Graph Projection Neural Network for Text-to-SQL Parser

no code implementations NAACL 2021 Zhi Chen, Lu Chen, Yanbin Zhao, Ruisheng Cao, Zihan Xu, Su Zhu, Kai Yu

Given a database schema, Text-to-SQL aims to translate a natural language question into the corresponding SQL query.

Semantic Parsing Text-To-Sql

Quantum Dimensionality Reduction by Linear Discriminant Analysis

no code implementations4 Mar 2021 Kai Yu, Gong-De Guo, Song Lin

In this paper, we present a quantum algorithm and a quantum circuit to efficiently perform linear discriminant analysis (LDA) for dimensionality reduction.

Dimensionality Reduction Quantum Machine Learning Quantum Physics

LET: Linguistic Knowledge Enhanced Graph Transformer for Chinese Short Text Matching

1 code implementation25 Feb 2021 Boer Lyu, Lu Chen, Su Zhu, Kai Yu

Additionally, we adopt the word lattice graph as input to maintain multi-granularity information.

Text Matching

Rich Prosody Diversity Modelling with Phone-level Mixture Density Network

2 code implementations1 Feb 2021 Chenpeng Du, Kai Yu

Generating natural speech with diverse and smooth prosody pattern is a challenging task.

Speech Synthesis Text-To-Speech Synthesis Sound

WebSRC: A Dataset for Web-Based Structural Reading Comprehension

1 code implementation23 Jan 2021 Lu Chen, Xingyu Chen, Zihan Zhao, Danyang Zhang, Jiabao Ji, Ao Luo, Yuxuan Xiong, Kai Yu

This task requires a system not only to understand the semantics of texts but also the structure of the web page.

Reading Comprehension

Towards duration robust weakly supervised sound event detection

1 code implementation19 Jan 2021 Heinrich Dinkel, Mengyue Wu, Kai Yu

Our model outperforms other approaches on the DCASE2018 and URBAN-SED datasets without requiring prior duration knowledge.

Data Augmentation Sound Event Detection Sound Audio and Speech Processing

A relic sketch extraction framework based on detail-aware hierarchical deep network

no code implementations17 Jan 2021 Jinye Peng, Jiaxin Wang, Jun Wang, Erlei Zhang, Qunxi Zhang, Yongqin Zhang, Xianlin Peng, Kai Yu

For the fine extraction stage, we design a new multiscale U-Net (MSU-Net) to effectively remove disease noise and refine the sketch.

Edge Detection Transfer Learning

A 3D Non-stationary MmWave Channel Model for Vacuum Tube Ultra-High-Speed Train Channels

no code implementations17 Jan 2021 YingJie Xu, Kai Yu, Li Li, Xianfu Lei, Li Hao, Cheng-Xiang Wang

As a potential development direction of future transportation, the vacuum tube ultra-high-speed train (UHST) wireless communication systems have newly different channel characteristics from existing high-speed train (HST) scenarios.

An Investigation on Different Underlying Quantization Schemes for Pre-trained Language Models

no code implementations14 Oct 2020 Zihan Zhao, Yuncong Liu, Lu Chen, Qi Liu, Rao Ma, Kai Yu

Recently, pre-trained language models like BERT have shown promising performance on multiple natural language processing tasks.


Deep Reinforcement Learning for On-line Dialogue State Tracking

no code implementations22 Sep 2020 Zhi Chen, Lu Chen, Xiang Zhou, Kai Yu

To the best of our knowledge, this is the first effort to optimize the DST module within DRL framework for on-line task-oriented spoken dialogue systems.

Dialogue Management Dialogue State Tracking +1

CREDIT: Coarse-to-Fine Sequence Generation for Dialogue State Tracking

no code implementations22 Sep 2020 Zhi Chen, Lu Chen, Zihan Xu, Yanbin Zhao, Su Zhu, Kai Yu

In dialogue systems, a dialogue state tracker aims to accurately find a compact representation of the current dialogue status, based on the entire dialogue history.

Dialogue State Tracking

Dual Learning for Dialogue State Tracking

no code implementations22 Sep 2020 Zhi Chen, Lu Chen, Yanbin Zhao, Su Zhu, Kai Yu

In task-oriented multi-turn dialogue systems, dialogue state refers to a compact representation of the user goal in the context of dialogue history.

Dialogue State Tracking

Structured Hierarchical Dialogue Policy with Graph Neural Networks

no code implementations22 Sep 2020 Zhi Chen, Xiaoyuan Liu, Lu Chen, Kai Yu

A novel ComNet is proposed to model the structure of a hierarchical agent.

Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management

no code implementations22 Sep 2020 Zhi Chen, Lu Chen, Xiaoyuan Liu, Kai Yu

The task-oriented spoken dialogue system (SDS) aims to assist a human user in accomplishing a specific task (e. g., hotel booking).

Decision Making Dialogue Management

Vector Projection Network for Few-shot Slot Tagging in Natural Language Understanding

1 code implementation21 Sep 2020 Su Zhu, Ruisheng Cao, Lu Chen, Kai Yu

Few-shot slot tagging becomes appealing for rapid domain transfer and adaptation, motivated by the tremendous development of conversational dialogue systems.

Few-Shot Learning Natural Language Understanding +2

Future Vector Enhanced LSTM Language Model for LVCSR

no code implementations31 Jul 2020 Qi Liu, Yanmin Qian, Kai Yu

For the speech recognition rescoring, although the proposed LSTM LM obtains very slight gains, the new model seems obtain the great complementary with the conventional LSTM LM.

Language Modelling Large Vocabulary Continuous Speech Recognition +1

An Investigation on Deep Learning with Beta Stabilizer

no code implementations31 Jul 2020 Qi Liu, Tian Tan, Kai Yu

It is concluded that beta stabilizer parameters can reduce the sensitivity of learning rate with almost the same performance on DNN with relu activation function and LSTM.

Handwriting Recognition Speech Recognition

Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language Understanding

1 code implementation24 May 2020 Chen Liu, Su Zhu, Zijian Zhao, Ruisheng Cao, Lu Chen, Kai Yu

In this paper, a novel BERT based SLU model (WCN-BERT SLU) is proposed to encode WCNs and the dialogue context jointly.

Spoken Language Understanding

Semi-Supervised Text Simplification with Back-Translation and Asymmetric Denoising Autoencoders

no code implementations30 Apr 2020 Yanbin Zhao, Lu Chen, Zhi Chen, Kai Yu

When modeling simple and complex sentences with autoencoders, we introduce different types of noise into the training process.

Denoising Language Modelling +3

Dual Learning for Semi-Supervised Natural Language Understanding

2 code implementations26 Apr 2020 Su Zhu, Ruisheng Cao, Kai Yu

The framework is composed of dual pseudo-labeling and dual learning method, which enables an NLU model to make full use of data (labeled and unlabeled) through a closed-loop of the primal and dual tasks.

Natural Language Understanding

Voice activity detection in the wild via weakly supervised sound event detection

1 code implementation27 Mar 2020 Heinrich Dinkel, Yefei Chen, Mengyue Wu, Kai Yu

We proposed two GPVAD models, one full (GPV-F), trained on 527 Audioset sound events, and one binary (GPV-B), only distinguishing speech and noise.

Sound Audio and Speech Processing

Semantic Parsing with Dual Learning

1 code implementation ACL 2019 Ruisheng Cao, Su Zhu, Chen Liu, Jieyu Li, Kai Yu

Semantic parsing converts natural language queries into structured logical forms.

Semantic Parsing

Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

no code implementations18 Jun 2019 Xu Xiang, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu

The proposed approach can achieve the state-of-the-art performance, with 25% ~ 30% equal error rate (EER) reduction on both tasks when compared to strong baselines using cross entropy loss with softmax, obtaining 2. 238% EER on VoxCeleb1 test set and 2. 761% EER on SITW core-core test set, respectively.

Speaker Recognition

Audio Caption in a Car Setting with a Sentence-Level Loss

1 code implementation31 May 2019 Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu

Captioning has attracted much attention in image and video understanding while a small amount of work examines audio captioning.

Audio captioning Semantic Similarity +4

AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning

no code implementations27 May 2019 Lu Chen, Zhi Chen, Bowen Tan, Sishan Long, Milica Gasic, Kai Yu

Experiments show that AgentGraph models significantly outperform traditional reinforcement learning approaches on most of the 18 tasks of the PyDial benchmark.

Dialogue Management Multi-agent Reinforcement Learning +1

A Hierarchical Decoding Model For Spoken Language Understanding From Unaligned Data

1 code implementation9 Apr 2019 Zijian Zhao, Su Zhu, Kai Yu

In the paper, we focus on spoken language understanding from unaligned data whose annotation is a set of act-slot-value triples.

Hierarchical structure Spoken Language Understanding

Duration robust sound event detection

1 code implementation8 Apr 2019 Heinrich Dinkel, Kai Yu

Task 4 of the Dcase2018 challenge demonstrated that substantially more research is needed for a real-world application of sound event detection.

Sound Audio and Speech Processing

Text-based depression detection on sparse data

1 code implementation8 Apr 2019 Heinrich Dinkel, Mengyue Wu, Kai Yu

Previous text-based depression detection is commonly based on large user-generated data.

Depression Detection Word Embeddings

Audio Caption: Listen and Tell

1 code implementation25 Feb 2019 Mengyue Wu, Heinrich Dinkel, Kai Yu

A baseline encoder-decoder model is provided for both English and Mandarin.

General Classification

End-to-End Monaural Multi-speaker ASR System without Pretraining

no code implementations5 Nov 2018 Xuankai Chang, Yanmin Qian, Kai Yu, Shinji Watanabe

The experiments demonstrate that the proposed methods can improve the performance of the end-to-end model in separating the overlapping speech and recognizing the separated streams.

automatic-speech-recognition Speech Recognition +1

Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting

no code implementations2 Aug 2018 Zhehuai Chen, Yanmin Qian, Kai Yu

The few studies on sequence discriminative training for KWS are limited for fixed vocabulary or LVCSR based methods and have not been compared to the state-of-the-art deep learning based KWS approaches.

Keyword Spotting Large Vocabulary Continuous Speech Recognition +1

Structured Dialogue Policy with Graph Neural Networks

no code implementations COLING 2018 Lu Chen, Bowen Tan, Sishan Long, Kai Yu

The proposed structured deep reinforcement learning is based on graph neural networks (GNN), which consists of some sub-networks, each one for a node on a directed graph.

Decision Making Dialogue Management +2

Binarized LSTM Language Model

no code implementations NAACL 2018 Xuan Liu, Di Cao, Kai Yu

Although excellent performance is obtained for large vocabulary tasks, tremendous memory consumption prohibits the use of LSTM LM in low-resource devices.

automatic-speech-recognition Language Modelling +1

On Modular Training of Neural Acoustics-to-Word Model for LVCSR

no code implementations3 Mar 2018 Zhehuai Chen, Qi Liu, Hao Li, Kai Yu

Finally, modules are integrated into an acousticsto-word model (A2W) and jointly optimized using acoustic data to retain the advantage of sequence modeling.

automatic-speech-recognition Language Modelling +2

Affordable On-line Dialogue Policy Learning

no code implementations EMNLP 2017 Cheng Chang, Runzhe Yang, Lu Chen, Xiang Zhou, Kai Yu

The key to building an evolvable dialogue system in real-world scenarios is to ensure an affordable on-line dialogue policy learning, which requires the on-line learning process to be safe, efficient and economical.

Dialogue Management

Concept Transfer Learning for Adaptive Language Understanding

no code implementations WS 2018 Su Zhu, Kai Yu

Concept definition is important in language understanding (LU) adaptation since literal definition difference can easily lead to data sparsity even if different data sets are actually semantically correlated.

Domain Adaptation Transfer Learning

On-line Dialogue Policy Learning with Companion Teaching

no code implementations EACL 2017 Lu Chen, Runzhe Yang, Cheng Chang, Zihao Ye, Xiang Zhou, Kai Yu

On-line dialogue policy learning is the key for building evolvable conversational agent in real world scenarios.

Dialogue Management

A Large-scale Distributed Video Parsing and Evaluation Platform

no code implementations29 Nov 2016 Kai Yu, Yang Zhou, Da Li, Zhang Zhang, Kaiqi Huang

Visual surveillance systems have become one of the largest data sources of Big Visual Data in real world.

Weakly-supervised Learning of Mid-level Features for Pedestrian Attribute Recognition and Localization

no code implementations17 Nov 2016 Kai Yu, Biao Leng, Zhang Zhang, Dangwei Li, Kaiqi Huang

Based on GoogLeNet, firstly, a set of mid-level attribute features are discovered by novelly designed detection layers, where a max-pooling based weakly-supervised object detection technique is used to train these layers with only image-level labels without the need of bounding box annotations of pedestrian attributes.

Multi-Label Image Classification Pedestrian Attribute Recognition +1

Encoder-decoder with Focus-mechanism for Sequence Labelling Based Spoken Language Understanding

no code implementations6 Aug 2016 Su Zhu, Kai Yu

This paper investigates the framework of encoder-decoder with attention for sequence labelling based spoken language understanding.

Speech Recognition Spoken Language Understanding

Text Flow: A Unified Text Detection System in Natural Scene Images

no code implementations ICCV 2015 Shangxuan Tian, Yifeng Pan, Chang Huang, Shijian Lu, Kai Yu, Chew Lim Tan

With character candidates detected by cascade boosting, the min-cost flow network model integrates the last three sequential steps into a single process which solves the error accumulation problem at both character level and text line level effectively.

Scene Text Scene Text Detection +1

On Training Bi-directional Neural Network Language Model with Noise Contrastive Estimation

no code implementations19 Feb 2016 Tianxing He, Yu Zhang, Jasha Droppo, Kai Yu

We propose to train bi-directional neural network language model(NNLM) with noise contrastive estimation(NCE).

Language Modelling

Bidirectional LSTM-CRF Models for Sequence Tagging

19 code implementations9 Aug 2015 Zhiheng Huang, Wei Xu, Kai Yu

It can also use sentence level tag information thanks to a CRF layer.

Chunking NER +1

Recurrent Polynomial Network for Dialogue State Tracking

no code implementations14 Jul 2015 Kai Sun, Qizhe Xie, Kai Yu

Dialogue state tracking (DST) is a process to estimate the distribution of the dialogue states as a dialogue progresses.

Dialogue State Tracking

Deep Multiple Instance Learning for Image Classification and Auto-Annotation

no code implementations CVPR 2015 Jiajun Wu, Yinan Yu, Chang Huang, Kai Yu

The recent development in learning deep representations has demonstrated its wide applications in traditional vision tasks like classification and detection.

Classification General Classification +2

High-dimensional Joint Sparsity Random Effects Model for Multi-task Learning

no code implementations26 Sep 2013 Krishnakumar Balasubramanian, Kai Yu, Tong Zhang

The traditional convex formulation employs the group Lasso relaxation to achieve joint sparsity across tasks.

Multi-Task Learning

Large Scale Strongly Supervised Ensemble Metric Learning, with Applications to Face Verification and Retrieval

1 code implementation25 Dec 2012 Chang Huang, Shenghuo Zhu, Kai Yu

Learning Mahanalobis distance metrics in a high- dimensional feature space is very difficult especially when structural sparsity and low rank are enforced to improve com- putational efficiency in testing phase.

Face Verification Metric Learning

Deep Coding Network

no code implementations NeurIPS 2010 Yuanqing Lin, Tong Zhang, Shenghuo Zhu, Kai Yu

This paper proposes a principled extension of the traditional single-layer flat sparse coding scheme, where a two-layer coding scheme is derived based on theoretical analysis of nonlinear functional approximation that extends recent results for local coordinate coding.

Nonlinear Learning using Local Coordinate Coding

no code implementations NeurIPS 2009 Kai Yu, Tong Zhang, Yihong Gong

This paper introduces a new method for semi-supervised learning on high dimensional nonlinear manifolds, which includes a phase of unsupervised basis learning and a phase of supervised function learning.

Stochastic Relational Models for Large-scale Dyadic Data using MCMC

no code implementations NeurIPS 2008 Shenghuo Zhu, Kai Yu, Yihong Gong

Stochastic relational models provide a rich family of choices for learning and predicting dyadic data between two sets of entities.

Bayesian Inference Collaborative Filtering

Deep Learning with Kernel Regularization for Visual Recognition

no code implementations NeurIPS 2008 Kai Yu, Wei Xu, Yihong Gong

In this paper we focus on training deep neural networks for visual recognition tasks.

Predictive Matrix-Variate t Models

no code implementations NeurIPS 2007 Shenghuo Zhu, Kai Yu, Yihong Gong

It is becoming increasingly important to learn from a partially-observed random matrix and predict its missing elements.

Missing Elements Model Selection

Gaussian Process Models for Link Analysis and Transfer Learning

no code implementations NeurIPS 2007 Kai Yu, Wei Chu

In this paper we develop a Gaussian process (GP) framework to model a collection of reciprocal random variables defined on the \emph{edges} of a network.

Link Prediction Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.