Search Results for author: Yun Chen

Found 74 papers, 29 papers with code

Towards Making the Most of Cross-Lingual Transfer for Zero-Shot Neural Machine Translation

1 code implementation ACL 2022 Guanhua Chen, Shuming Ma, Yun Chen, Dongdong Zhang, Jia Pan, Wenping Wang, Furu Wei

When applied to zero-shot cross-lingual abstractive summarization, it produces an average performance gain of 12. 3 ROUGE-L over mBART-ft. We conduct detailed analyses to understand the key ingredients of SixT+, including multilinguality of the auxiliary parallel data, positional disentangled encoder, and the cross-lingual transferability of its encoder.

Abstractive Text Summarization Cross-Lingual Abstractive Summarization +6

Towards Bridging the Reward-Generation Gap in Direct Alignment Algorithms

no code implementations11 Jun 2025 Zeguan Xiao, Yun Chen, Guanhua Chen

In this paper, we find a contributor to the reward-generation gap is the mismatch between the inherent importance of prefix tokens during the LLM generation process and how this importance is reflected in the implicit reward functions of DAAs.

Pi-SQL: Enhancing Text-to-SQL with Fine-Grained Guidance from Pivot Programming Languages

no code implementations1 Jun 2025 Yongdong chi, Hanqing Wang, Zonghan Yang, Jian Yang, Xiao Yan, Yun Chen, Guanhua Chen

In this work, we propose Pi-SQL, which incorporates the high-resource Python program as a pivot to bridge between the natural language query and SQL program.

Text to SQL Text-To-SQL +1

AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion

no code implementations28 May 2025 Junqi Zhao, Jinzheng Zhao, Haohe Liu, Yun Chen, Lu Han, Xubo Liu, Mark Plumbley, Wenwu Wang

Diffusion models have significantly improved the quality and diversity of audio generation but are hindered by slow inference speed.

AudioCaps Audio Generation +1

TAG-INSTRUCT: Controlled Instruction Complexity Enhancement through Structure-based Augmentation

no code implementations24 May 2025 He Zhu, Zhiwen Ruan, Junyou Su, Xingwei He, Wenjia Zhang, Yun Chen, Guanhua Chen

High-quality instruction data is crucial for developing large language models (LLMs), yet existing approaches struggle to effectively control instruction complexity.

Semantic Compression TAG

ImPart: Importance-Aware Delta-Sparsification for Improved Model Compression and Merging in LLMs

no code implementations17 Apr 2025 Yan Yang, Yixia Li, Hongru Wang, Xuetao Wei, Jianqiao Yu, Yun Chen, Guanhua Chen

With the proliferation of task-specific large language models, delta compression has emerged as a method to mitigate the resource challenges of deploying numerous such models by effectively compressing the delta model parameters.

Model Compression Quantization

A Hybrid Model/Data-Driven Solution to Channel, Position and Orientation Tracking in mmWave Vehicular Systems

no code implementations7 Mar 2025 Yun Chen, Nuria González-Prelcic, Takayuki Shimizu, Chinmay Mahabal

A second network named VP-ChAT (Vehicle Position-Channel Attention for position Tracking) refines the geometric position estimate.

Position

LayAlign: Enhancing Multilingual Reasoning in Large Language Models via Layer-Wise Adaptive Fusion and Alignment Strategy

1 code implementation17 Feb 2025 Zhiwen Ruan, Yixia Li, He Zhu, Longyue Wang, Weihua Luo, Kaifu Zhang, Yun Chen, Guanhua Chen

Despite being pretrained on multilingual corpora, large language models (LLMs) exhibit suboptimal performance on low-resource languages.

GenAssets: Generating in-the-wild 3D Assets in Latent Space

no code implementations CVPR 2025 Ze Yang, Jingkang Wang, Haowei Zhang, Sivabalan Manivasagam, Yun Chen, Raquel Urtasun

High-quality 3D assets for traffic participants are critical for multi-sensor simulation, which is essential for the safe end-to-end development of autonomy.

Neural Rendering

Revisiting Generative Policies: A Simpler Reinforcement Learning Algorithmic Perspective

1 code implementation2 Dec 2024 Jinouwen Zhang, Rongkun Xue, Yazhe Niu, Yun Chen, Jing Yang, Hongsheng Li, Yu Liu

However, existing works exhibit significant variations in training schemes and RL optimization objectives, and some methods are only applicable to diffusion models.

Density Estimation Offline RL +3

Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions

no code implementations15 Nov 2024 Yutao Hou, Yajing Luo, Zhiwen Ruan, Hongru Wang, Weifeng Ge, Yun Chen, Guanhua Chen

In this paper, we introduce Compound Question Synthesis (CQ-Syn) to create the Compound-QA benchmark, focusing on compound questions with multiple sub-questions.

TSI: A Multi-View Representation Learning Approach for Time Series Forecasting

1 code implementation30 Sep 2024 Wentao Gao, Ziqi Xu, Jiuyong Li, Lin Liu, Jixue Liu, Thuc Duy Le, Debo Cheng, Yanchang Zhao, Yun Chen

As the growing demand for long sequence time-series forecasting in real-world applications, such as electricity consumption planning, the significance of time series forecasting becomes increasingly crucial across various domains.

Representation Learning Time Series +1

G3R: Gradient Guided Generalizable Reconstruction

no code implementations28 Sep 2024 Yun Chen, Jingkang Wang, Ze Yang, Sivabalan Manivasagam, Raquel Urtasun

Large scale 3D scene reconstruction is important for applications such as virtual reality and simulation.

3DGS 3D Scene Reconstruction +2

Deconfounding Multi-Cause Latent Confounders: A Factor-Model Approach to Climate Model Bias Correction

no code implementations22 Aug 2024 Wentao Gao, Jiuyong Li, Debo Cheng, Lin Liu, Jixue Liu, Thuc Duy Le, Xiaojing Du, Xiongren Chen, Yanchang Zhao, Yun Chen

This paper proposes a novel bias correction approach to utilize both GCM and observational data to learn a factor model that captures multi-cause latent confounders.

model Time Series +1

SeqAR: Jailbreak LLMs with Sequential Auto-Generated Characters

1 code implementation2 Jul 2024 Yan Yang, Zeguan Xiao, Xin Lu, Hongru Wang, Xuetao Wei, Hailiang Huang, Guanhua Chen, Yun Chen

The widespread applications of large language models (LLMs) have brought about concerns regarding their potential misuse.

Red Teaming Safety Alignment

SeTAR: Out-of-Distribution Detection with Selective Low-Rank Approximation

1 code implementation18 Jun 2024 Yixia Li, Boya Xiong, Guanhua Chen, Yun Chen

In this work, we propose SeTAR, a novel, training-free OOD detection method that leverages selective low-rank approximation of weight matrices in vision-language and vision-only models.

Out-of-Distribution Detection Out of Distribution (OOD) Detection

Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models

1 code implementation13 Jun 2024 Bowen Ping, Shuo Wang, Hanqing Wang, Xu Han, Yuzhuang Xu, Yukun Yan, Yun Chen, Baobao Chang, Zhiyuan Liu, Maosong Sun

Motivated by the long-tail distribution of singular values in the delta weights, we propose a delta quantization approach using mixed-precision.

Math Quantization

MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning

1 code implementation13 Jun 2024 Hanqing Wang, Yixia Li, Shuo Wang, Guanhua Chen, Yun Chen

It is observed that the minor matrix corresponds to the noisy or long-tail information, while the principal matrix contains important knowledge.

Math visual instruction following

Multi-level Personalized Federated Learning on Heterogeneous and Long-Tailed Data

no code implementations10 May 2024 Rongyu Zhang, Yun Chen, Chenrui Wu, Fangxin Wang, Bo Li

Federated learning (FL) offers a privacy-centric distributed learning framework, enabling model training on individual clients and central aggregation without necessitating data exchange.

Autonomous Vehicles image-classification +3

Distract Large Language Models for Automatic Jailbreak Attack

1 code implementation13 Mar 2024 Zeguan Xiao, Yan Yang, Guanhua Chen, Yun Chen

Extensive efforts have been made before the public release of Large language models (LLMs) to align their behaviors with human values.

Red Teaming

OMGEval: An Open Multilingual Generative Evaluation Benchmark for Large Language Models

1 code implementation21 Feb 2024 Meng Xu, Shuo Wang, Liner Yang, Haoyu Wang, Zhenghao Liu, Cunliang Kong, Yun Chen, Yang Liu, Maosong Sun, Erhong Yang

We evaluate several representative multilingual LLMs on the proposed OMGEval, which we believe will provide a valuable reference for the community to further understand and improve the multilingual capability of LLMs.

General Knowledge Logical Reasoning

LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks

no code implementations18 Feb 2024 Hanqing Wang, Bowen Ping, Shuo Wang, Xu Han, Yun Chen, Zhiyuan Liu, Maosong Sun

Most prior works on LoRA combination primarily rely on task-level weights for each involved LoRA, making different examples and tokens share the same LoRA weights.

Math

Attention-based Interactive Disentangling Network for Instance-level Emotional Voice Conversion

no code implementations29 Dec 2023 Yun Chen, Lingxiao Yang, Qi Chen, Jian-Huang Lai, Xiaohua Xie

We introduce a two-stage pipeline to effectively train our network: Stage I utilizes inter-speech contrastive learning to model fine-grained emotion and intra-speech disentanglement learning to better separate emotion and content.

Contrastive Learning Disentanglement +1

LightSim: Neural Lighting Simulation for Urban Scenes

no code implementations11 Dec 2023 Ava Pun, Gary Sun, Jingkang Wang, Yun Chen, Ze Yang, Sivabalan Manivasagam, Wei-Chiu Ma, Raquel Urtasun

Different outdoor illumination conditions drastically alter the appearance of urban scenes, and they can harm the performance of image-based robot perception systems if not seen during training.

Reconstructing Objects in-the-wild for Realistic Sensor Simulation

no code implementations9 Nov 2023 Ze Yang, Sivabalan Manivasagam, Yun Chen, Jingkang Wang, Rui Hu, Raquel Urtasun

In this work, we present NeuSim, a novel approach that estimates accurate geometry and realistic appearance from sparse in-the-wild data captured at distance and at limited viewpoints.

Diversity

CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation

no code implementations2 Nov 2023 Jingkang Wang, Sivabalan Manivasagam, Yun Chen, Ze Yang, Ioan Andrei Bârsan, Anqi Joyce Yang, Wei-Chiu Ma, Raquel Urtasun

To tackle these issues, we present CADSim, which combines part-aware object-class priors via a small set of CAD models with differentiable rendering to automatically reconstruct vehicle geometry, including articulated wheels, with high-quality appearance.

3D Reconstruction

StyleBART: Decorate Pretrained Model with Style Adapters for Unsupervised Stylistic Headline Generation

1 code implementation26 Oct 2023 Hanqing Wang, Yajing Luo, Boya Xiong, Guanhua Chen, Yun Chen

Stylistic headline generation is the task to generate a headline that not only summarizes the content of an article, but also reflects a desired style that attracts users.

Headline Generation

PACIT: Unlocking the Power of Examples for Better In-Context Instruction Tuning

1 code implementation2 Oct 2023 Tianci Xue, Ziqi Wang, Yixia Li, Yun Chen, Guanhua Chen

Instruction tuning enhances the instruction following ability of large language models by finetuning with supervised instruction data.

Instruction Following Zero-shot Generalization

Sparse Recovery with Attention: A Hybrid Data/Model Driven Solution for High Accuracy Position and Channel Tracking at mmWave

no code implementations26 Aug 2023 Yun Chen, Nuria González-Prelcic, Takayuki Shimizu, Hongshen Lu, Chinmay Mahabal

In this paper, we propose first a mmWave channel tracking algorithm based on multidimensional orthogonal matching pursuit algorithm (MOMP) using reduced sparsifying dictionaries, which exploits information from channel estimates in previous frames.

Position

FinEval: A Chinese Financial Domain Knowledge Evaluation Benchmark for Large Language Models

1 code implementation19 Aug 2023 Xin Guo, Haotian Xia, Zhaowei Liu, Hanyang Cao, Zhi Yang, Zhiqiang Liu, Sizhe Wang, Jinyi Niu, Chuqi Wang, Yanhui Wang, Xiaolong Liang, Xiaoming Huang, Bing Zhu, Zhongyu Wei, Yun Chen, Weining Shen, Liwen Zhang

The dataset contains 8, 351 questions categorized into four different key areas: Financial Academic Knowledge, Financial Industry Knowledge, Financial Security Knowledge, and Financial Agent.

Multiple-choice

UniSim: A Neural Closed-Loop Sensor Simulator

2 code implementations CVPR 2023 Ze Yang, Yun Chen, Jingkang Wang, Sivabalan Manivasagam, Wei-Chiu Ma, Anqi Joyce Yang, Raquel Urtasun

Previously recorded driving logs provide a rich resource to build these new scenarios from, but for closed loop evaluation, we need to modify the sensor data based on the new scene configuration and the SDV's decisions, as actors might be added or removed and the trajectories of existing actors and the SDV will differ from the original log.

Target Search and Navigation in Heterogeneous Robot Systems with Deep Reinforcement Learning

no code implementations1 Aug 2023 Yun Chen, Jiaping Xiao

Collaborative heterogeneous robot systems can greatly improve the efficiency of target search and navigation tasks.

Deep Reinforcement Learning Navigate +1

mCLIP: Multilingual CLIP via Cross-lingual Transfer

1 code implementation ACL 2023 Guanhua Chen, Lu Hou, Yun Chen, Wenliang Dai, Lifeng Shang, Xin Jiang, Qun Liu, Jia Pan, Wenping Wang

Furthermore, to enhance the token- and sentence-level multilingual representation of the MTE, we propose to train it with machine translation and contrastive learning jointly before the TriKD to provide a better initialization.

Contrastive Learning Cross-Lingual Transfer +7

Learning to Localize with Attention: from sparse mmWave channel estimates from a single BS to high accuracy 3D location

no code implementations30 Jun 2023 Yun Chen, Nuria González-Prelcic, Takayuki Shimizu, HongSheng Lu

One strategy to obtain user location information in a wireless network operating at millimeter wave (mmWave) is based on the exploitation of the geometric relationships between the channel parameters and the user position.

Position

Multilingual Sentence Transformer as A Multilingual Word Aligner

1 code implementation28 Jan 2023 Weikang Wang, Guanhua Chen, Hanqing Wang, Yue Han, Yun Chen

In this paper, we investigate whether multilingual sentence Transformer LaBSE is a strong multilingual word aligner.

Sentence Word Alignment +1

Lexical Complexity Controlled Sentence Generation

no code implementations26 Nov 2022 Jinran Nie, Liner Yang, Yun Chen, Cunliang Kong, Junhui Zhu, Erhong Yang

Compared with potential solutions, our approach fuses the representations of the word complexity levels into the model to get better control of lexical complexity.

Sentence Text Generation

RIS-ADMM: A RIS and ADMM-Based Passive and Sparse Sensing Method With Interference Removal

1 code implementation25 May 2022 Peng Chen, Zhimin Chen, Pu Miao, Yun Chen

This letter addresses the passive sensing issue utilizing wireless communication signals and RIS amidst interference from wireless access points (APs).

LitMind Dictionary: An Open-Source Online Dictionary

1 code implementation23 Apr 2022 Cunliang Kong, Xuezhi Fang, Liner Yang, Yun Chen, Erhong Yang

Since traditional dictionaries present word senses as discrete items in predefined inventories, they fall short of flexibility, which is required in providing specific meanings of words in particular contexts.

PrivateRec: Differentially Private Training and Serving for Federated News Recommendation

no code implementations18 Apr 2022 Ruixuan Liu, Yanlin Wang, Yang Cao, Lingjuan Lyu, Weike Pan, Yun Chen, Hong Chen

Collecting and training over sensitive personal data raise severe privacy concerns in personalized recommendation systems, and federated learning can potentially alleviate the problem by training models over decentralized user data. However, a theoretically private solution in both the training and serving stages of federated recommendation is essential but still lacking. Furthermore, naively applying differential privacy (DP) to the two stages in federated recommendation would fail to achieve a satisfactory trade-off between privacy and utility due to the high-dimensional characteristics of model gradients and hidden representations. In this work, we propose a federated news recommendation method for achieving a better utility in model training and online serving under a DP guarantee. We first clarify the DP definition over behavior data for each round in the life-circle of federated recommendation systems. Next, we propose a privacy-preserving online serving mechanism under this definition based on the idea of decomposing user embeddings with public basic vectors and perturbing the lower-dimensional combination coefficients.

Federated Learning News Recommendation +2

Joint Initial Access and Localization in Millimeter Wave Vehicular Networks: a Hybrid Model/Data Driven Approach

no code implementations4 Apr 2022 Yun Chen, Joan Palacios, Nuria González-Prelcic, Takayuki Shimizu, HongSheng Lu

High resolution compressive channel estimation provides information for vehicle localization when a hybrid mmWave MIMO system is considered.

Multitasking Framework for Unsupervised Simple Definition Generation

2 code implementations ACL 2022 Cunliang Kong, Yun Chen, Hengyuan Zhang, Liner Yang, Erhong Yang

We demonstrate that the framework can generate relevant, simple definitions for the target words through automatic and manual evaluations on English and Chinese datasets.

SDOA-Net: An Efficient Deep Learning-Based DOA Estimation Network for Imperfect Array

2 code implementations19 Mar 2022 Peng Chen, Zhimin Chen, Liang Liu, Yun Chen, Xianbin Wang

The estimation of direction of arrival (DOA) is a crucial issue in conventional radar, wireless communication, and integrated sensing and communication (ISAC) systems.

Integrated sensing and communication ISAC +1

Deep Learning-based Link Configuration for Radar-aided Multiuser mmWave Vehicle-to-Infrastructure Communication

no code implementations12 Jan 2022 Andrew Graff, Yun Chen, Nuria González-Prelcic, Takayuki Shimizu

Then, a deep network is used to translate features of these radar spatial covariances into features of the communication spatial covariances, by learning the intricate mapping between radar and communication channels, in both line-of-sight and non-line-of-sight settings.

YACLC: A Chinese Learner Corpus with Multidimensional Annotation

1 code implementation30 Dec 2021 Yingying Wang, Cunliang Kong, Liner Yang, Yijun Wang, Xiaorong Lu, Renfen Hu, Shan He, Zhenghao Liu, Yun Chen, Erhong Yang, Maosong Sun

This resource is of great relevance for second language acquisition research, foreign-language teaching, and automatic grammatical error correction.

Grammatical Error Correction Language Acquisition +1

Radar Aided mmWave Vehicle-to-InfrastructureLink Configuration Using Deep Learning

no code implementations16 Nov 2021 Yun Chen, Andrew Graff, Nuria González-Prelcic, Takayuki Shimizu

In this paper, we obtain prior information to speed up the beam training process by implementing two deep neural networks (DNNs) that realize radar-to-communication (R2C) channel information translation in a vehicle-to-infrastructure (V2I) system.

Deep Learning Prediction

Towards Making the Most of Multilingual Pretraining for Zero-Shot Neural Machine Translation

1 code implementation16 Oct 2021 Guanhua Chen, Shuming Ma, Yun Chen, Dongdong Zhang, Jia Pan, Wenping Wang, Furu Wei

When applied to zero-shot cross-lingual abstractive summarization, it produces an average performance gain of 12. 3 ROUGE-L over mBART-ft. We conduct detailed analyses to understand the key ingredients of SixT+, including multilinguality of the auxiliary parallel data, positional disentangled encoder, and the cross-lingual transferability of its encoder.

Abstractive Text Summarization Cross-Lingual Abstractive Summarization +6

Few-Shot Domain Adaptation for Grammatical Error Correction via Meta-Learning

no code implementations29 Jan 2021 Shengsheng Zhang, Yaping Huang, Yun Chen, Liner Yang, Chencheng Wang, Erhong Yang

We exploit a set of data-rich source domains to learn the initialization of model parameters that facilitates fast adaptation on new resource-poor target domains.

Domain Adaptation Grammatical Error Correction +2

Exploring Adversarial Robustness of Multi-Sensor Perception Systems in Self Driving

no code implementations17 Jan 2021 James Tu, Huichen Li, Xinchen Yan, Mengye Ren, Yun Chen, Ming Liang, Eilyan Bitar, Ersin Yumer, Raquel Urtasun

Yet, there have been limited studies on the adversarial robustness of multi-modal models that fuse LiDAR features with image features.

Adversarial Robustness Denoising +1

DSDNet: Deep Structured self-Driving Network

no code implementations ECCV 2020 Wenyuan Zeng, Shenlong Wang, Renjie Liao, Yun Chen, Bin Yang, Raquel Urtasun

In this paper, we propose the Deep Structured self-Driving Network (DSDNet), which performs object detection, motion prediction, and motion planning with a single neural network.

Motion Planning motion prediction +2

Learning Lane Graph Representations for Motion Forecasting

2 code implementations ECCV 2020 Ming Liang, Bin Yang, Rui Hu, Yun Chen, Renjie Liao, Song Feng, Raquel Urtasun

We propose a motion forecasting model that exploits a novel structured map representation as well as actor-map interactions.

Motion Forecasting Trajectory Prediction

A Deep Reinforcement Learning Approach to Efficient Drone Mobility Support

no code implementations11 May 2020 Yun Chen, Xingqin Lin, Talha Ahmed Khan, Mohammad Mozaffari

In this paper, we propose a novel handover framework for providing efficient mobility support and reliable wireless connectivity to drones served by a terrestrial cellular network.

Deep Reinforcement Learning Q-Learning +2

Accurate Word Alignment Induction from Neural Machine Translation

1 code implementation EMNLP 2020 Yun Chen, Yang Liu, Guanhua Chen, Xin Jiang, Qun Liu

Shift-Att is an interpretation method that induces alignments from the attention weights of Transformer and does not require parameter update or architecture change.

Decoder Machine Translation +3

Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT

1 code implementation ACL 2020 Zhiyong Wu, Yun Chen, Ben Kao, Qun Liu

However, this approach of evaluating a language model is undermined by the uncertainty of the amount of knowledge that is learned by the probe itself.

Dependency Parsing Language Modeling +3

Dictionary-based Data Augmentation for Cross-Domain Neural Machine Translation

no code implementations6 Apr 2020 Wei Peng, Chongxuan Huang, Tian-Hao Li, Yun Chen, Qun Liu

Existing data augmentation approaches for neural machine translation (NMT) have predominantly relied on back-translating in-domain (IND) monolingual corpora.

Data Augmentation Machine Translation +2

Efficient Drone Mobility Support Using Reinforcement Learning

no code implementations21 Nov 2019 Yun Chen, Xingqin Lin, Talha Khan, Mohammad Mozaffari

Flying drones can be used in a wide range of applications and services from surveillance to package delivery.

Q-Learning reinforcement-learning +2

A General Framework for Adaptation of Neural Machine Translation to Simultaneous Translation

no code implementations Asian Chapter of the Association for Computational Linguistics 2020 Yun Chen, Liangyou Li, Xin Jiang, Xiao Chen, Qun Liu

Despite the success of neural machine translation (NMT), simultaneous neural machine translation (SNMT), the task of translating in real time before a full sentence has been observed, remains challenging due to the syntactic structure difference and simultaneity requirements.

Machine Translation NMT +2

Fitness Done Right: a Real-time Intelligent Personal Trainer for Exercise Correction

no code implementations30 Oct 2019 Yun Chen, Yiyue Chen, Zhengzhong Tu

Finally, key values for key features of the two poses are computed correspondingly in the pose error detection part, which helps give correction advice.

Controllable Data Synthesis Method for Grammatical Error Correction

no code implementations29 Sep 2019 Liner Yang, Chencheng Wang, Yun Chen, Yongping Du, Erhong Yang

We propose two data synthesis methods which can control the error rate and the ratio of error types on synthetic data.

Grammatical Error Correction

Incorporating Sememes into Chinese Definition Modeling

1 code implementation16 May 2019 Liner Yang, Cunliang Kong, Yun Chen, Yang Liu, Qinan Fan, Erhong Yang

To accomplish this task, we construct the Chinese Definition Modeling Corpus (CDM), which contains triples of word, sememes and the corresponding definition.

Meta-Learning for Low-Resource Neural Machine Translation

no code implementations EMNLP 2018 Jiatao Gu, Yong Wang, Yun Chen, Kyunghyun Cho, Victor O. K. Li

We frame low-resource translation as a meta-learning problem, and we learn to adapt to low-resource languages based on multilingual high-resource language tasks.

Low Resource Neural Machine Translation Low-Resource Neural Machine Translation +4

A Stable and Effective Learning Strategy for Trainable Greedy Decoding

1 code implementation EMNLP 2018 Yun Chen, Victor O. K. Li, Kyunghyun Cho, Samuel R. Bowman

Beam search is a widely used approximate search strategy for neural network decoders, and it generally outperforms simple greedy decoding on tasks like machine translation.

Decoder Machine Translation +2

Zero-Resource Neural Machine Translation with Multi-Agent Communication Game

no code implementations9 Feb 2018 Yun Chen, Yang Liu, Victor O. K. Li

While end-to-end neural machine translation (NMT) has achieved notable success in the past years in translating a handful of resource-rich language pairs, it still suffers from the data scarcity problem for low-resource language pairs and domains.

Decoder Image Captioning +3

A Teacher-Student Framework for Zero-Resource Neural Machine Translation

no code implementations ACL 2017 Yun Chen, Yang Liu, Yong Cheng, Victor O. K. Li

While end-to-end neural machine translation (NMT) has made remarkable progress recently, it still suffers from the data scarcity problem for low-resource language pairs and domains.

Machine Translation NMT +2

Cannot find the paper you are looking for? You can Submit a new open access paper.