Search Results for author: Xiaoyu Chen

Found 50 papers, 8 papers with code

HW-TSC’s Submissions to the WMT21 Biomedical Translation Task

no code implementations WMT (EMNLP) 2021 Hao Yang, Zhanglin Wu, Zhengzhe Yu, Xiaoyu Chen, Daimeng Wei, Zongyao Li, Hengchao Shang, Minghan Wang, Jiaxin Guo, Lizhi Lei, Chuanfei Xu, Min Zhang, Ying Qin

This paper describes the submission of Huawei Translation Service Center (HW-TSC) to WMT21 biomedical translation task in two language pairs: Chinese↔English and German↔English (Our registered team name is HuaweiTSC).


Ensemble Active Learning by Contextual Bandits for AI Incubation in Manufacturing

no code implementations10 Oct 2023 Yingyan Zeng, Xiaoyu Chen, Ran Jin

It is challenging but important to save annotation efforts in streaming data acquisition to maintain data quality for supervised learning base learners.

Active Learning Decision Making +1

Near-Optimal Quantum Coreset Construction Algorithms for Clustering

no code implementations5 Jun 2023 Yecheng Xue, Xiaoyu Chen, Tongyang Li, Shaofeng H. -C. Jiang

$k$-Clustering in $\mathbb{R}^d$ (e. g., $k$-median and $k$-means) is a fundamental machine learning problem.


Text Style Transfer Back-Translation

1 code implementation2 Jun 2023 Daimeng Wei, Zhanglin Wu, Hengchao Shang, Zongyao Li, Minghan Wang, Jiaxin Guo, Xiaoyu Chen, Zhengzhe Yu, Hao Yang

To address this issue, we propose Text Style Transfer Back Translation (TST BT), which uses a style transfer model to modify the source side of BT data.

Data Augmentation Domain Adaptation +4

Asking Before Action: Gather Information in Embodied Decision Making with Language Models

no code implementations25 May 2023 Xiaoyu Chen, Shenao Zhang, Pushi Zhang, Li Zhao, Jianyu Chen

With strong capabilities of reasoning and a generic understanding of the world, Large Language Models (LLMs) have shown great potential in building versatile embodied decision making agents capable of performing diverse tasks.

Imitation Learning

Towards Generalizable Reinforcement Learning for Trade Execution

no code implementations12 May 2023 Chuheng Zhang, Yitong Duan, Xiaoyu Chen, Jianyu Chen, Jian Li, Li Zhao

To evaluate our algorithms, we also implement a carefully designed simulator based on historical limit order book (LOB) data to provide a high-fidelity benchmark for different algorithms.

Offline RL reinforcement-learning +1

TBFormer: Two-Branch Transformer for Image Forgery Localization

1 code implementation25 Feb 2023 Yaqi Liu, Binbin Lv, Xin Jin, Xiaoyu Chen, Xiaokun Zhang

In this paper, we propose a Transformer-style network with two feature extraction branches for image forgery localization, and it is named as Two-Branch Transformer (TBFormer).

Vocal Bursts Valence Prediction

An Adaptive Deep RL Method for Non-Stationary Environments with Piecewise Stable Context

no code implementations24 Dec 2022 Xiaoyu Chen, Xiangming Zhu, Yufeng Zheng, Pushi Zhang, Li Zhao, Wenxue Cheng, Peng Cheng, Yongqiang Xiong, Tao Qin, Jianyu Chen, Tie-Yan Liu

One of the key challenges in deploying RL to real-world applications is to adapt to variations of unknown environment contexts, such as changing terrains in robotic tasks and fluctuated bandwidth in congestion control.

Energy Efficiency Optimization of Intelligent Reflective Surface-assisted Terahertz-RSMA System

no code implementations21 Nov 2022 Xiaoyu Chen, Feng Yan, Menghan Hu, Zihuai Lin

This paper examines the energy efficiency optimization problem of intelligent reflective surface (IRS)-assisted multi-user rate division multiple access (RSMA) downlink systems under terahertz propagation.

On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness

no code implementations19 Oct 2022 Haotian Ye, Xiaoyu Chen, LiWei Wang, Simon S. Du

Generalization in Reinforcement Learning (RL) aims to learn an agent during training that generalizes to the target environment.

Reinforcement Learning (RL)

Bayesian Sparse Regression for Mixed Multi-Responses with Application to Runtime Metrics Prediction in Fog Manufacturing

no code implementations10 Oct 2022 Xiaoyu Chen, Xiaoning Kang, Ran Jin, Xinwei Deng

In this work, we propose a Bayesian sparse regression for multivariate mixed responses to enhance the prediction of runtime performance metrics and to enable the statistical inferences.

Variable Selection

Flow-based Recurrent Belief State Learning for POMDPs

no code implementations23 May 2022 Xiaoyu Chen, Yao Mu, Ping Luo, Shengbo Li, Jianyu Chen

Furthermore, we show that the learned belief states can be plugged into downstream RL algorithms to improve performance.

Decision Making Variational Inference

Distributional Reinforcement Learning for Multi-Dimensional Reward Functions

no code implementations NeurIPS 2021 Pushi Zhang, Xiaoyu Chen, Li Zhao, Wei Xiong, Tao Qin, Tie-Yan Liu

To fully inherit the benefits of distributional RL and hybrid reward architectures, we introduce Multi-Dimensional Distributional DQN (MD3QN), which extends distributional RL to model the joint return distribution from multiple reward sources.

Distributional Reinforcement Learning reinforcement-learning +1

Near-Optimal Reward-Free Exploration for Linear Mixture MDPs with Plug-in Solver

no code implementations ICLR 2022 Xiaoyu Chen, Jiachen Hu, Lin F. Yang, LiWei Wang

In particular, we take a plug-in solver approach, where we focus on learning a model in the exploration phase and demand that \emph{any planning algorithm} on the learned model can give a near-optimal policy.

Model-based Reinforcement Learning Reinforcement Learning (RL)

WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition

1 code implementation7 Oct 2021 BinBin Zhang, Hang Lv, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie, Xin Xu, Hui Bu, Xiaoyu Chen, Chenchen Zeng, Di wu, Zhendong Peng

In this paper, we present WenetSpeech, a multi-domain Mandarin corpus consisting of 10000+ hours high-quality labeled speech, 2400+ hours weakly labeled speech, and about 10000 hours unlabeled speech, with 22400+ hours in total.

Label Error Detection Optical Character Recognition +5

The $f$-Divergence Reinforcement Learning Framework

no code implementations24 Sep 2021 Chen Gong, Qiang He, Yunpeng Bai, Zhou Yang, Xiaoyu Chen, Xinwen Hou, Xianjie Zhang, Yu Liu, Guoliang Fan

In FRL, the policy evaluation and policy improvement phases are simultaneously performed by minimizing the $f$-divergence between the learning policy and sampling policy, which is distinct from conventional DRL algorithms that aim to maximize the expected cumulative rewards.

Decision Making Mathematical Proofs +2

LDC-VAE: A Latent Distribution Consistency Approach to Variational AutoEncoders

no code implementations22 Sep 2021 Xiaoyu Chen, Chen Gong, Qiang He, Xinwen Hou, Yu Liu

Variational autoencoders (VAEs), as an important aspect of generative models, have received a lot of research interests and reached many successful applications.

Image Generation

Adaptively Weighted Top-N Recommendation for Organ Matching

no code implementations23 Jul 2021 Parshin Shojaee, Xiaoyu Chen, Ran Jin

Because of the shortage, organ matching decision is the most critical decision to assign the limited viable organs to the most suitable patients.

Decision Making

Certifiably Robust Interpretation via Renyi Differential Privacy

no code implementations4 Jul 2021 Ao Liu, Xiaoyu Chen, Sijia Liu, Lirong Xia, Chuang Gan

The advantages of our Renyi-Robust-Smooth (RDP-based interpretation method) are three-folds.

U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition

no code implementations10 Jun 2021 Di wu, BinBin Zhang, Chao Yang, Zhendong Peng, Wenjing Xia, Xiaoyu Chen, Xin Lei

On the experiment of AISHELL-1, we achieve a 4. 63\% character error rate (CER) with a non-streaming setup and 5. 05\% with a streaming setup with 320ms latency by U2++.

Data Augmentation speech-recognition +1

Friedel Oscillations of Vortex Bound States Under Extreme Quantum Limit in KCa2Fe4As4F2

no code implementations24 Feb 2021 Xiaoyu Chen, Wen Duan, Xinwei Fan, Wenshan Hong, Kailun Chen, Huan Yang, Shiliang Li, Huiqian Luo, Hai-Hu Wen

We report the observation of discrete vortex bound states with the energy levels deviating from the widely believed ratio of 1:3:5 in the vortices of an iron based superconductor KCa2Fe4As4F2 through scanning tunneling microcopy (STM).

Superconductivity Strongly Correlated Electrons

Single particle tunneling spectroscopy and superconducting gaps in layered iron based superconductor KCa$_{2}$Fe$_{4}$As$_{4}$F$_{2}$

no code implementations17 Feb 2021 Wen Duan, Kailun Chen, Wenshan Hong, Xiaoyu Chen, Huan Yang, Shiliang Li, Huiqian Luo, Hai-Hu Wen

On the second type of surface which is rarely obtained, the fully gapped feature can still be observed on the tunneling spectra, although multiple gaps are obtained either from a single spectrum or separate ones, and the gap values determined from coherence peaks locate mainly in the range from 4 to 8 meV.


Near-optimal Representation Learning for Linear Bandits and Linear RL

no code implementations8 Feb 2021 Jiachen Hu, Xiaoyu Chen, Chi Jin, Lihong Li, LiWei Wang

This paper studies representation learning for multi-task linear bandits and multi-task episodic RL with linear value function approximation.

Representation Learning

WeNet: Production oriented Streaming and Non-streaming End-to-End Speech Recognition Toolkit

3 code implementations2 Feb 2021 Zhuoyuan Yao, Di wu, Xiong Wang, BinBin Zhang, Fan Yu, Chao Yang, Zhendong Peng, Xiaoyu Chen, Lei Xie, Xin Lei

In this paper, we propose an open source, production first, and production ready speech recognition toolkit called WeNet in which a new two-pass approach is implemented to unify streaming and non-streaming end-to-end (E2E) speech recognition in a single model.

speech-recognition Speech Recognition

Modeling Method for the Coupling Relations of Microgrid Cyber-Physical Systems Driven by Hybrid Spatiotemporal Events

no code implementations1 Feb 2021 Xiaoyong Bo, Xiaoyu Chen, Huashun Li, Yunchang Dong, Zhaoyang Qu, Lei Wang, Yang Li

Considering the constraints of the temporal conversion of information flow and energy flow, a microgrid CPS coupling model is established, the effectiveness of which is verified by simulating false data injection attack (FDIA) scenarios.

Decision Making

Compositional Prototype Network with Multi-view Comparision for Few-Shot Point Cloud Semantic Segmentation

no code implementations28 Dec 2020 Xiaoyu Chen, Chi Zhang, Guosheng Lin, Jing Han

Moreover, when we use our network to handle the long-tail problem in a fully supervised point cloud segmentation dataset, it can also effectively boost the performance of the few-shot classes.

Few-Shot Learning Point Cloud Segmentation +2

Relation Extraction with Contextualized Relation Embedding (CRE)

1 code implementation EMNLP (DeeLIO) 2020 Xiaoyu Chen, Rohan Badlani

This paper proposes an architecture for the relation extraction task that integrates semantic information with knowledge base modeling in a novel manner.

Entity Embeddings Relation Extraction

Efficient Reinforcement Learning in Factored MDPs with Application to Constrained RL

no code implementations ICLR 2021 Xiaoyu Chen, Jiachen Hu, Lihong Li, Li-Wei Wang

The regret of FMDP-BF is shown to be exponentially smaller than that of optimal algorithms designed for non-factored MDPs, and improves on the best previous result for FMDPs~\citep{osband2014near} by a factored of $\sqrt{H|\mathcal{S}_i|}$, where $|\mathcal{S}_i|$ is the cardinality of the factored state subspace and $H$ is the planning horizon.

reinforcement-learning Reinforcement Learning (RL)

Noisy Agents: Self-supervised Exploration by Predicting Auditory Events

no code implementations27 Jul 2020 Chuang Gan, Xiaoyu Chen, Phillip Isola, Antonio Torralba, Joshua B. Tenenbaum

Humans integrate multiple sensory modalities (e. g. visual and audio) to build a causal understanding of the physical world.

Atari Games Reinforcement Learning (RL)

(Locally) Differentially Private Combinatorial Semi-Bandits

no code implementations ICML 2020 Xiaoyu Chen, Kai Zheng, Zixin Zhou, Yunchang Yang, Wei Chen, Li-Wei Wang

In this paper, we study Combinatorial Semi-Bandits (CSB) that is an extension of classic Multi-Armed Bandits (MAB) under Differential Privacy (DP) and stronger Local Differential Privacy (LDP) setting.

Multi-Armed Bandits Privacy Preserving

High Sensitivity Snapshot Spectrometer Based on Deep Network Unmixing

no code implementations29 Jun 2019 XiaoYu Chen, Xu Wang, Lianfa Bai, Jing Han, Zhuang Zhao

In this paper, we present a convolution neural network based method to recover the light intensity distribution from the overlapped dispersive spectra instead of adding an extra light path to capture it directly for the first time.

Vocal Bursts Intensity Prediction

Non-destructive three-dimensional measurement of hand vein based on self-supervised network

no code implementations29 Jun 2019 Xiaoyu Chen, Qixin Wang, Jinzhou Ge, Yi Zhang, Jing Han

At present, supervised stereo methods based on deep neural network have achieved impressive results.

Distributed Bandit Learning: Near-Optimal Regret with Efficient Communication

no code implementations ICLR 2020 Yuanhao Wang, Jiachen Hu, Xiaoyu Chen, Li-Wei Wang

We study the problem of regret minimization for distributed bandits learning, in which $M$ agents work collaboratively to minimize their total regret under the coordination of a central server.

Multi-Armed Bandits

Passive TCP Identification for Wired and WirelessNetworks: A Long-Short Term Memory Approach

no code implementations9 Apr 2019 Xiaoyu Chen, Shugong Xu, Xudong Chen, Shan Cao, Shunqing Zhang, Yanzan Sun

TCP congestion control algorithm identification (TCP identification) can be used to significantly improve network efficiency.

BIG-bench Machine Learning

Residual Pyramid Learning for Single-Shot Semantic Segmentation

1 code implementation23 Mar 2019 Xiaoyu Chen, Xiaotian Lou, Lianfa Bai, Jing Han

In this paper, we put forward a method for single-shot segmentation in a feature residual pyramid network (RPNet), which learns the main and residuals of segmentation by decomposing the label at different levels of residual blocks.

Segmentation Semantic Segmentation

Discrete Potts Model for Generating Superpixels on Noisy Images

no code implementations20 Mar 2018 Ruobing Shen, Xiaoyu Chen, Xiangrui Zheng, Gerhard Reinelt

Many computer vision applications, such as object recognition and segmentation, increasingly build on superpixels.

Denoising Object Recognition +2

The Spaces of Data, Information, and Knowledge

no code implementations6 Nov 2014 Xiaoyu Chen, Dongming Wang

We study the data space $D$ of any given data set $X$ and explain how functions and relations are defined over $D$.


Automated Generation of Geometric Theorems from Images of Diagrams

no code implementations6 Jun 2014 Xiaoyu Chen, Dan Song, Dongming Wang

We propose an approach to generate geometric theorems from electronic images of diagrams automatically.


Cannot find the paper you are looking for? You can Submit a new open access paper.