Search Results for author: Xiang-Yang Li

Found 16 papers, 2 papers with code

BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis

no code implementations30 May 2022 Yichong Leng, Zehua Chen, Junliang Guo, Haohe Liu, Jiawei Chen, Xu Tan, Danilo Mandic, Lei He, Xiang-Yang Li, Tao Qin, Sheng Zhao, Tie-Yan Liu

Specifically, in the first stage, the common information of the binaural audio is generated with a single-channel diffusion model conditioned on the mono audio, based on which the binaural audio is generated by a two-channel diffusion model in the second stage.

FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition

no code implementations Findings (EMNLP) 2021 Yichong Leng, Xu Tan, Rui Wang, Linchen Zhu, Jin Xu, Wenjie Liu, Linquan Liu, Tao Qin, Xiang-Yang Li, Edward Lin, Tie-Yan Liu

Although multiple candidates are generated by an ASR system through beam search, current error correction approaches can only correct one sentence at a time, failing to leverage the voting effect from multiple candidates to better detect and correct error tokens.

Automatic Speech Recognition speech-recognition

FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition

1 code implementation NeurIPS 2021 Yichong Leng, Xu Tan, Linchen Zhu, Jin Xu, Renqian Luo, Linquan Liu, Tao Qin, Xiang-Yang Li, Ed Lin, Tie-Yan Liu

A straightforward solution to reduce latency, inspired by non-autoregressive (NAR) neural machine translation, is to use an NAR sequence generation model for ASR error correction, which, however, comes at the cost of significantly increased ASR error rate.

Automatic Speech Recognition Machine Translation +2

Dataset Bias in Few-shot Image Recognition

no code implementations18 Aug 2020 Shuqiang Jiang, Yaohui Zhu, Chenlong Liu, Xinhang Song, Xiang-Yang Li, Weiqing Min

Second, we investigate performance differences on different datasets from dataset structures and different few-shot learning methods.

Few-Shot Learning

Learning to Reweight with Deep Interactions

no code implementations9 Jul 2020 Yang Fan, Yingce Xia, Lijun Wu, Shufang Xie, Weiqing Liu, Jiang Bian, Tao Qin, Xiang-Yang Li

Recently, the concept of teaching has been introduced into machine learning, in which a teacher model is used to guide the training of a student model (which will be used in real tasks) through data selection, loss function design, etc.

Image Classification Machine Translation +1

Multi-branch Attentive Transformer

1 code implementation18 Jun 2020 Yang Fan, Shufang Xie, Yingce Xia, Lijun Wu, Tao Qin, Xiang-Yang Li, Tie-Yan Liu

While the multi-branch architecture is one of the key ingredients to the success of computer vision tasks, it has not been well investigated in natural language processing, especially sequence learning tasks.

Code Generation Machine Translation +3

FenceMask: A Data Augmentation Approach for Pre-extracted Image Features

no code implementations14 Jun 2020 Pu Li, Xiang-Yang Li, Xiang Long

It is based on the 'simulation of object occlusion' strategy, which aim to achieve the balance between object occlusion and information retention of the input data.

Data Augmentation Fine-Grained Visual Categorization

Review of Text Style Transfer Based on Deep Learning

no code implementations6 May 2020 Xiang-Yang Li, Guo Pu, Keyu Ming, Pu Li, Jie Wang, Yuxuan Wang

In the traditional text style transfer model, the text style is generally relied on by experts knowledge and hand-designed rules, but with the application of deep learning in the field of natural language processing, the text style transfer method based on deep learning Started to be heavily researched.

Natural Language Processing Style Transfer +1

Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling

no code implementations8 Feb 2020 Mu Yuan, Lan Zhang, Xiang-Yang Li, Hui Xiong

With limited computing resources and stringent delay, given a data stream and a collection of applicable resource-hungry deep-learning models, we design a novel approach to adaptively schedule a subset of these models to execute on each data item, aiming to maximize the value of the model output (e. g., the number of high-confidence labels).

Image Retrieval Management

Weighted Laplacian and Its Theoretical Applications

no code implementations23 Nov 2019 Shijie Xu, Jiayan Fang, Xiang-Yang Li

In this paper, we develop a novel weighted Laplacian method, which is partially inspired by the theory of graph Laplacian, to study recent popular graph problems, such as multilevel graph partitioning and balanced minimum cut problem, in a more convenient manner.

graph partitioning

Multifaceted Analysis of Fine-Tuning in Deep Model for Visual Recognition

no code implementations11 Jul 2019 Xiang-Yang Li, Luis Herranz, Shuqiang Jiang

In this paper, we introduce and systematically investigate several factors that influence the performance of fine-tuning for visual recognition.

Unsupervised Pivot Translation for Distant Languages

no code implementations ACL 2019 Yichong Leng, Xu Tan, Tao Qin, Xiang-Yang Li, Tie-Yan Liu

In this work, we introduce unsupervised pivot translation for distant languages, which translates a language to a distant language through multiple hops, and the unsupervised translation on each hop is relatively easier than the original direct translation.

Machine Translation Translation

Learning to Teach

no code implementations ICLR 2018 Yang Fan, Fei Tian, Tao Qin, Xiang-Yang Li, Tie-Yan Liu

Teaching plays a very important role in our society, by spreading human knowledge and educating our next generations.

BIG-bench Machine Learning Image Classification

Scene recognition with CNNs: objects, scales and dataset bias

no code implementations CVPR 2016 Luis Herranz, Shuqiang Jiang, Xiang-Yang Li

Thus, adapting the feature extractor to each particular scale (i. e. scale-specific CNNs) is crucial to improve recognition, since the objects in the scenes have their specific range of scales.

Scene Recognition

Towards Distribution-Free Multi-Armed Bandits with Combinatorial Strategies

no code implementations20 Jul 2013 Xiang-Yang Li, Shaojie Tang, Yaqin Zhou

At each decision epoch, we select a strategy, i. e., a subset of RVs, subject to arbitrary constraints on constituent RVs.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.