Search Results for author: Haoran Xu

Found 38 papers, 24 papers with code

Adaptive Bridge between Training and Inference for Dialogue Generation

no code implementations • EMNLP 2021 • Haoran Xu, Hainan Zhang, Yanyan Zou, Hongshen Chen, Zhuoye Ding, Yanyan Lan

Although exposure bias has been widely studied in some NLP tasks, it faces its unique challenges in dialogue response generation, the representative one-to-various generation scenario. In real human dialogue, there are many appropriate responses for the same context, not only with different expressions, but also with different topics.

Dialogue Generation NMT +1

Paper
Add Code

NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results

1 code implementation • 17 Apr 2024 • Xin Li, Kun Yuan, Yajing Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, HaoNing Wu, ZiCheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei LI, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Fangyuan Kong, Haotian Fan, Yifang Xu, Haoran Xu, Mengduo Yang, Jie zhou, Jiaze Li, Shijie Wen, Mai Xu, Da Li, Shunyu Yao, Jiazhi Du, WangMeng Zuo, Zhibo Li, Shuai He, Anlong Ming, Huiyuan Fu, Huadong Ma, Yong Wu, Fie Xue, Guozhi Zhao, Lina Du, Jie Guo, Yu Zhang, huimin zheng, JunHao Chen, Yue Liu, Dulan Zhou, Kele Xu, Qisheng Xu, Tao Sun, Zhixiang Ding, Yuhang Hu

This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality Assessment (S-UGC VQA), where various excellent solutions are submitted and evaluated on the collected dataset KVQ from popular short-form video platform, i. e., Kuaishou/Kwai Platform.

valid Video Quality Assessment +1

Paper
Code

Streaming Sequence Transduction through Dynamic Compression

1 code implementation • 2 Feb 2024 • Weiting Tan, Yunmo Chen, Tongfei Chen, Guanghui Qin, Haoran Xu, Heidi C. Zhang, Benjamin Van Durme, Philipp Koehn

We introduce STAR (Stream Transduction with Anchor Representations), a novel Transformer-based model designed for efficient sequence-to-sequence transduction over streams.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Code

ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update

1 code implementation • 1 Feb 2024 • Liyuan Mao, Haoran Xu, Weinan Zhang, Xianyuan Zhan

To resolve this issue, we propose a simple yet effective modification that projects the backward gradient onto the normal plane of the forward gradient, resulting in an orthogonal-gradient update, a new learning rule for DICE-based methods.

Imitation Learning Offline RL +1

Paper
Code

The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts

no code implementations • 23 Jan 2024 • Lingfeng Shen, Weiting Tan, Sihao Chen, Yunmo Chen, Jingyu Zhang, Haoran Xu, Boyuan Zheng, Philipp Koehn, Daniel Khashabi

As the influence of large language models (LLMs) spans across global communities, their safety challenges in multilingual settings become paramount for alignment research.

Paper
Add Code

Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

1 code implementation • 16 Jan 2024 • Haoran Xu, Amr Sharaf, Yunmo Chen, Weiting Tan, Lingfeng Shen, Benjamin Van Durme, Kenton Murray, Young Jin Kim

However, even the top-performing 13B LLM-based translation models, like ALMA, does not match the performance of state-of-the-art conventional encoder-decoder translation models or larger-scale LLMs such as GPT-4.

Machine Translation Translation

295

Paper
Code

Narrowing the Gap between Zero- and Few-shot Machine Translation by Matching Styles

no code implementations • 4 Nov 2023 • Weiting Tan, Haoran Xu, Lingfeng Shen, Shuyue Stella Li, Kenton Murray, Philipp Koehn, Benjamin Van Durme, Yunmo Chen

Large language models trained primarily in a monolingual setting have demonstrated their ability to generalize to machine translation using zero- and few-shot examples with in-context learning.

In-Context Learning Machine Translation +1

Paper
Add Code

Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models

no code implementations • 2 Oct 2023 • Tianjian Li, Haoran Xu, Philipp Koehn, Daniel Khashabi, Kenton Murray

Text generation models are notoriously vulnerable to errors in the training data.

Language Modelling Machine Translation +3

Paper
Add Code

A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models

1 code implementation • 20 Sep 2023 • Haoran Xu, Young Jin Kim, Amr Sharaf, Hany Hassan Awadalla

In this study, we propose a novel fine-tuning approach for LLMs that is specifically designed for the translation task, eliminating the need for the abundant parallel data that traditional translation models usually depend on.

Language Modelling Machine Translation +1

295

Paper
Code

Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization

1 code implementation • NeurIPS 2023 • Xiangsen Wang, Haoran Xu, Yinan Zheng, Xianyuan Zhan

Offline reinforcement learning (RL) has received considerable attention in recent years due to its attractive capability of learning policies from offline datasets without environmental interactions.

Management Multi-agent Reinforcement Learning +3

Paper
Code

Offline Reinforcement Learning with Imbalanced Datasets

no code implementations • 6 Jul 2023 • Li Jiang, Sijie Chen, JieLin Qiu, Haoran Xu, Wai Kin Chan, Zhao Ding

The prevalent use of benchmarks in current offline reinforcement learning (RL) research has led to a neglect of the imbalance of real-world dataset distributions in the development of models.

D4RL Offline RL +4

Paper
Add Code

PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning

1 code implementation • 25 May 2023 • Jianxiong Li, Xiao Hu, Haoran Xu, Jingjing Liu, Xianyuan Zhan, Ya-Qin Zhang

Offline-to-online reinforcement learning (RL), by combining the benefits of offline pretraining and online finetuning, promises enhanced sample efficiency and policy performance.

Computational Efficiency reinforcement-learning +1

Paper
Code

Condensing Multilingual Knowledge with Lightweight Language-Specific Modules

1 code implementation • 23 May 2023 • Haoran Xu, Weiting Tan, Shuyue Stella Li, Yunmo Chen, Benjamin Van Durme, Philipp Koehn, Kenton Murray

Incorporating language-specific (LS) modules is a proven method to boost performance in multilingual machine translation.

Machine Translation Translation

Paper
Code

Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity

1 code implementation • 3 May 2023 • Haoran Xu, Maha Elbayad, Kenton Murray, Jean Maillard, Vedanuj Goswami

Mixture-of-experts (MoE) models that employ sparse activation have demonstrated effectiveness in significantly increasing the number of parameters while maintaining low computational requirements per token.

Machine Translation Translation

Paper
Code

Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization

3 code implementations • 28 Mar 2023 • Haoran Xu, Li Jiang, Jianxiong Li, Zhuoran Yang, Zhaoran Wang, Victor Wai Kin Chan, Xianyuan Zhan

This gives a deeper understanding of why the in-sample learning paradigm works, i. e., it applies implicit value regularization to the policy.

D4RL Offline RL +2

Paper
Code

Language-Aware Multilingual Machine Translation with Self-Supervised Learning

1 code implementation • 10 Feb 2023 • Haoran Xu, Jean Maillard, Vedanuj Goswami

In this work, we first investigate how to utilize intra-distillation to learn more *language-specific* parameters and then show the importance of these language-specific parameters.

Cross-Lingual Transfer Denoising +3

Paper
Code

Mind the Gap: Offline Policy Optimization for Imperfect Rewards

1 code implementation • 3 Feb 2023 • Jianxiong Li, Xiao Hu, Haoran Xu, Jingjing Liu, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang

RGM is formulated as a bi-level optimization problem: the upper layer optimizes a reward correction term that performs visitation distribution matching w. r. t.

Reinforcement Learning (RL)

Paper
Code

SaFormer: A Conditional Sequence Modeling Approach to Offline Safe Reinforcement Learning

no code implementations • 28 Jan 2023 • Qin Zhang, Linrui Zhang, Haoran Xu, Li Shen, Bowen Wang, Yongzhe Chang, Xueqian Wang, Bo Yuan, DaCheng Tao

Offline safe RL is of great practical relevance for deploying agents in real-world applications.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Simoun: Synergizing Interactive Motion-appearance Understanding for Vision-based Reinforcement Learning

no code implementations • ICCV 2023 • Yangru Huang, Peixi Peng, Yifan Zhao, Yunpeng Zhai, Haoran Xu, Yonghong Tian

Efficient motion and appearance modeling are critical for vision-based Reinforcement Learning (RL).

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

A Policy-Guided Imitation Approach for Offline Reinforcement Learning

1 code implementation • 15 Oct 2022 • Haoran Xu, Li Jiang, Jianxiong Li, Xianyuan Zhan

We decompose the conventional reward-maximizing policy in offline RL into a guide-policy and an execute-policy.

D4RL Offline RL +3

Paper
Code

Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations

2 code implementations • 20 Jul 2022 • Haoran Xu, Xianyuan Zhan, Honglei Yin, Huiling Qin

We study the problem of offline Imitation Learning (IL) where an agent aims to learn an optimal expert behavior policy without additional online environment interactions.

Imitation Learning Offline RL +1

Paper
Code

Discriminator-Guided Model-Based Offline Imitation Learning

no code implementations • 1 Jul 2022 • Wenjia Zhang, Haoran Xu, Haoyi Niu, Peng Cheng, Ming Li, Heming Zhang, Guyue Zhou, Xianyuan Zhan

In this paper, we propose the Discriminator-guided Model-based offline Imitation Learning (DMIL) framework, which introduces a discriminator to simultaneously distinguish the dynamics correctness and suboptimality of model rollout data against real expert demonstrations.

Imitation Learning

Paper
Add Code

The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains

1 code implementation • 23 May 2022 • Haoran Xu, Philipp Koehn, Kenton Murray

We first highlight the large sensitivity (contribution) gap among high-sensitivity and low-sensitivity parameters and show that the model generalization performance can be significantly improved after balancing the contribution of all parameters.

Machine Translation Natural Language Understanding +2

Paper
Code

When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning

2 code implementations • 23 May 2022 • Jianxiong Li, Xianyuan Zhan, Haoran Xu, Xiangyu Zhu, Jingjing Liu, Ya-Qin Zhang

In offline reinforcement learning (RL), one detrimental issue to policy learning is the error accumulation of deep Q function in out-of-distribution (OOD) areas.

D4RL Offline RL +2

Paper
Code

Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient Optimization in Few-Shot Cross-Lingual Transfer

1 code implementation • Findings (NAACL) 2022 • Haoran Xu, Kenton Murray

The current state-of-the-art for few-shot cross-lingual transfer learning first trains on abundant labeled data in the source language and then fine-tunes with a few examples on the target language, termed target-adapting.

Cross-Lingual Transfer Model Selection +2

Paper
Code

VAE based Text Style Transfer with Pivot Words Enhancement Learning

1 code implementation • ICON 2021 • Haoran Xu, Sixing Lu, Zhongkai Sun, Chengyuan Ma, Chenlei Guo

Text Style Transfer (TST) aims to alter the underlying style of the source text to another specific style while keeping the same content.

Style Transfer Text Style Transfer

Paper
Code

Adaptive Bridge between Training and Inference for Dialogue

no code implementations • 22 Oct 2021 • Haoran Xu, Hainan Zhang, Yanyan Zou, Hongshen Chen, Zhuoye Ding, Yanyan Lan

Although exposure bias has been widely studied in some NLP tasks, it faces its unique challenges in dialogue response generation, the representative one-to-various generation scenario.

Dialogue Generation NMT +1

Paper
Add Code

Offline Reinforcement Learning with Soft Behavior Regularization

no code implementations • 14 Oct 2021 • Haoran Xu, Xianyuan Zhan, Jianxiong Li, Honglei Yin

In this work, we start from the performance difference between the learned policy and the behavior policy, we derive a new policy learning objective that can be used in the offline setting, which corresponds to the advantage function value of the behavior policy, multiplying by a state-marginal density ratio.

Continuous Control reinforcement-learning +1

Paper
Add Code

Enhancing semi-supervised learning via self-interested coalitional learning

no code implementations • 29 Sep 2021 • Huiling Qin, Xianyuan Zhan, Yuanxun li, Haoran Xu, Yu Zheng

Jointly solving these two tasks allows full utilization of information from both labeled and unlabeled data, thus alleviating the problem of over-reliance on labeled data.

Paper
Add Code

Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction

2 code implementations • EMNLP 2021 • Mahsa Yarmohammadi, Shijie Wu, Marc Marone, Haoran Xu, Seth Ebner, Guanghui Qin, Yunmo Chen, Jialiang Guo, Craig Harman, Kenton Murray, Aaron Steven White, Mark Dredze, Benjamin Van Durme

Zero-shot cross-lingual information extraction (IE) describes the construction of an IE model for some target language, given existing annotations exclusively in some other language, typically English.

Dependency Parsing Event Extraction +4

Paper
Code

BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation

2 code implementations • EMNLP 2021 • Haoran Xu, Benjamin Van Durme, Kenton Murray

The success of bidirectional encoders using masked language models, such as BERT, on numerous natural language processing tasks has prompted researchers to attempt to incorporate these pre-trained models into neural machine translation (NMT) systems.

Ranked #2 on Machine Translation on IWSLT2014 German-English

Language Modelling Machine Translation +2

Paper
Code

Cross-Lingual BERT Contextual Embedding Space Mapping with Isotropic and Isometric Conditions

1 code implementation • 19 Jul 2021 • Haoran Xu, Philipp Koehn

Typically, a linearly orthogonal transformation mapping is learned by aligning static type-level embeddings to build a shared semantic space.

Paper
Code

Constraints Penalized Q-learning for Safe Offline Reinforcement Learning

no code implementations • 19 Jul 2021 • Haoran Xu, Xianyuan Zhan, Xiangyu Zhu

We study the problem of safe offline reinforcement learning (RL), the goal is to learn a policy that maximizes long-term reward while satisfying safety constraints given only offline data, without further interaction with the environment.

Offline RL Q-Learning +2

Paper
Add Code

Model-Based Offline Planning with Trajectory Pruning

1 code implementation • 16 May 2021 • Xianyuan Zhan, Xiangyu Zhu, Haoran Xu

The recent offline reinforcement learning (RL) studies have achieved much progress to make RL usable in real-world systems by learning policies from pre-collected datasets without environment interaction.

Offline RL Reinforcement Learning (RL)

Paper
Code

Zero-Shot Cross-Lingual Dependency Parsing through Contextual Embedding Transformation

1 code implementation • EACL (AdaptNLP) 2021 • Haoran Xu, Philipp Koehn

Linear embedding transformation has been shown to be effective for zero-shot cross-lingual transfer tasks and achieve surprisingly promising results.

Dependency Parsing Translation +1

Paper
Code

Gradual Fine-Tuning for Low-Resource Domain Adaptation

2 code implementations • EACL (AdaptNLP) 2021 • Haoran Xu, Seth Ebner, Mahsa Yarmohammadi, Aaron Steven White, Benjamin Van Durme, Kenton Murray

Fine-tuning is known to improve NLP models by adapting an initial model trained on more plentiful but less domain-salient examples to data in a target domain.

Domain Adaptation

Paper
Code

DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning

no code implementations • 23 Feb 2021 • Xianyuan Zhan, Haoran Xu, Yue Zhang, Xiangyu Zhu, Honglei Yin, Yu Zheng

Optimizing the combustion efficiency of a thermal power generating unit (TPGU) is a highly challenging and critical task in the energy industry.

Continuous Control Offline RL +2

Paper
Add Code

Meet Changes with Constancy: Learning Invariance in Multi-Source Translation

no code implementations • COLING 2020 • Jianfeng Liu, Ling Luo, Xiang Ao, Yan Song, Haoran Xu, Jian Ye

Multi-source neural machine translation aims to translate from parallel sources of information (e. g. languages, images, etc.)

Machine Translation NMT +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.