Search Results for author: Xiaoyu Shen

Found 48 papers, 17 papers with code

RoCBert: Robust Chinese Bert with Multimodal Contrastive Pretraining

no code implementations • ACL 2022 • Hui Su, Weiwei Shi, Xiaoyu Shen, Zhou Xiao, Tuo ji, Jiarui Fang, Jie zhou

Large-scale pretrained language models have achieved SOTA results on NLP tasks.

Paper
Add Code

MovieChats: Chat like Humans in a Closed Domain

no code implementations • EMNLP 2020 • Hui Su, Xiaoyu Shen, Zhou Xiao, Zheng Zhang, Ernie Chang, Cheng Zhang, Cheng Niu, Jie zhou

In this work, we take a close look at the movie domain and present a large-scale high-quality corpus with fine-grained annotations in hope of pushing the limit of movie-domain chatbots.

Chatbot Retrieval

Paper
Add Code

semiPQA: A Study on Product Question Answering over Semi-structured Data

no code implementations • ECNLP (ACL) 2022 • Xiaoyu Shen, Gianni Barlacchi, Marco del Tredici, Weiwei Cheng, Adrià Gispert

To fill in this blank, here we study how to effectively incorporate semi-structured answer sources for PQA and focus on presenting answers in a natural, fluent sentence.

Attribute Question Answering +1

Paper
Add Code

Product Answer Generation from Heterogeneous Sources: A New Benchmark and Best Practices

no code implementations • ECNLP (ACL) 2022 • Xiaoyu Shen, Gianni Barlacchi, Marco del Tredici, Weiwei Cheng, Bill Byrne, Adrià Gispert

In this paper, we build a benchmark with annotations for both evidence selection and answer generation covering 6 information sources.

Answer Generation Data Augmentation +1

Paper
Add Code

Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice?

no code implementations • 22 Apr 2024 • Dawei Zhu, Pinzhen Chen, Miaoran Zhang, Barry Haddow, Xiaoyu Shen, Dietrich Klakow

Traditionally, success in multilingual machine translation can be attributed to three key factors in training data: large volume, diverse translation directions, and high quality.

Paper
Add Code

A Preference-driven Paradigm for Enhanced Translation with Large Language Models

no code implementations • 17 Apr 2024 • Dawei Zhu, Sony Trenous, Xiaoyu Shen, Dietrich Klakow, Bill Byrne, Eva Hasler

Recent research has shown that large language models (LLMs) can achieve remarkable translation performance through supervised fine-tuning (SFT) using only a small amount of parallel data.

Sentence Translation

Paper
Add Code

Unraveling the Mystery of Scaling Laws: Part I

no code implementations • 11 Mar 2024 • Hui Su, Zhi Tian, Xiaoyu Shen, Xunliang Cai

However, the original scaling law paper by OpenAI did not disclose the complete details necessary to derive the precise scaling law formulas, and their conclusions are only based on models containing up to 1. 5 billion parameters.

Paper
Add Code

The Impact of Demonstrations on Multilingual In-Context Learning: A Multidimensional Analysis

no code implementations • 20 Feb 2024 • Miaoran Zhang, Vagrant Gautam, Mingyang Wang, Jesujoba O. Alabi, Xiaoyu Shen, Dietrich Klakow, Marius Mosbach

Compared to work on monolingual (English) in-context learning, multilingual in-context learning is under-explored, and we lack an in-depth understanding of the role of demonstrations in this context.

In-Context Learning

Paper
Add Code

StableMask: Refining Causal Masking in Decoder-only Transformer

no code implementations • 7 Feb 2024 • Qingyu Yin, Xuzheng He, Xiang Zhuang, Yu Zhao, Jianhua Yao, Xiaoyu Shen, Qiang Zhang

The decoder-only Transformer architecture with causal masking and relative position encoding (RPE) has become the de facto choice in language modeling.

Language Modelling Position

Paper
Add Code

A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Software Engineering Tasks

1 code implementation • 25 Dec 2023 • Wentao Zou, Qi Li, Jidong Ge, Chuanyi Li, Xiaoyu Shen, LiGuo Huang, Bin Luo

We hope that our findings can provide a deeper understanding of PEFT methods on various PTMs and SE downstream tasks.

Paper
Code

Fast calculation of Counterparty Credit exposures and associated sensitivities using fourier series expansion

no code implementations • 21 Nov 2023 • Gijs Mast, Xiaoyu Shen, Fang Fang

This paper introduces a novel approach for computing netting--set level and counterparty level exposures, such as Potential Future Exposure (PFE) and Expected Exposure (EE), along with associated sensitivities.

Paper
Add Code

LawBench: Benchmarking Legal Knowledge of Large Language Models

1 code implementation • 28 Sep 2023 • Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Songyang Zhang, Kai Chen, Zongwen Shen, Jidong Ge

We hope this benchmark provides in-depth understanding of the LLMs' domain-specified capabilities and speed up the development of LLMs in the legal domain.

Benchmarking Memorization +1

162

Paper
Code

SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects

2 code implementations • 14 Sep 2023 • David Ifeoluwa Adelani, Hannah Liu, Xiaoyu Shen, Nikita Vassilyev, Jesujoba O. Alabi, Yanke Mao, Haonan Gao, Annie En-Shiun Lee

Despite the progress we have recorded in the last few years in multilingual natural language processing, evaluation is typically limited to a small set of languages with available datasets which excludes a large number of low-resource languages.

Cross-Lingual Transfer Language Modelling +5

Paper
Code

Weaker Than You Think: A Critical Look at Weakly Supervised Learning

1 code implementation • 27 May 2023 • Dawei Zhu, Xiaoyu Shen, Marius Mosbach, Andreas Stephan, Dietrich Klakow

In this paper, we revisit the setup of these approaches and find that the benefits brought by these approaches are significantly overestimated.

Weakly-supervised Learning

Paper
Code

Is Translation Helpful? An Empirical Analysis of Cross-Lingual Transfer in Low-Resource Dialog Generation

no code implementations • 21 May 2023 • Lei Shen, Shuai Yu, Xiaoyu Shen

Cross-lingual transfer is important for developing high-quality chatbots in multiple languages due to the strongly imbalanced distribution of language resources.

Cross-Lingual Transfer Machine Translation +1

Paper
Add Code

xPQA: Cross-Lingual Product Question Answering across 12 Languages

1 code implementation • 16 May 2023 • Xiaoyu Shen, Akari Asai, Bill Byrne, Adrià De Gispert

To study this practical industrial task, we present xPQA, a large-scale annotated cross-lingual PQA dataset in 12 languages across 9 branches, and report results in (1) candidate ranking, to select the best English candidate containing the information to answer a non-English question; and (2) answer generation, to generate a natural-sounding non-English answer based on the selected English candidate.

Answer Generation Machine Translation +3

Paper
Code

WeLM: A Well-Read Pre-trained Language Model for Chinese

no code implementations • 21 Sep 2022 • Hui Su, Xiao Zhou, Houjin Yu, Xiaoyu Shen, YuWen Chen, Zilin Zhu, Yang Yu, Jie zhou

Large Language Models pre-trained with self-supervised learning have demonstrated impressive zero-shot generalization capabilities on a wide spectrum of tasks.

Language Modelling Self-Supervised Learning +2

Paper
Add Code

MDIA: A Benchmark for Multilingual Dialogue Generation in 46 Languages

1 code implementation • 27 Aug 2022 • Qingyu Zhang, Xiaoyu Shen, Ernie Chang, Jidong Ge, Pengke Chen

In this paper, we present mDIA, the first large-scale multilingual benchmark for dialogue generation across low- to high-resource languages.

Dialogue Generation

Paper
Code

Low-Resource Dense Retrieval for Open-Domain Question Answering: A Comprehensive Survey

no code implementations • 5 Aug 2022 • Xiaoyu Shen, Svitlana Vakulenko, Marco del Tredici, Gianni Barlacchi, Bill Byrne, Adrià De Gispert

Dense retrieval (DR) approaches based on powerful pre-trained language models (PLMs) achieved significant advances and have become a key component for modern open-domain question-answering systems.

Open-Domain Question Answering Retrieval

Paper
Add Code

Meta Self-Refinement for Robust Learning with Weak Supervision

1 code implementation • 15 May 2022 • Dawei Zhu, Xiaoyu Shen, Michael A. Hedderich, Dietrich Klakow

Training deep neural networks (DNNs) under weak supervision has attracted increasing research attention as it can significantly reduce the annotation cost.

Paper
Code

A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation

1 code implementation • NAACL 2022 • David Ifeoluwa Adelani, Jesujoba Oluwadara Alabi, Angela Fan, Julia Kreutzer, Xiaoyu Shen, Machel Reid, Dana Ruiter, Dietrich Klakow, Peter Nabende, Ernie Chang, Tajuddeen Gwadabe, Freshia Sackey, Bonaventure F. P. Dossou, Chris Chinenye Emezue, Colin Leong, Michael Beukman, Shamsuddeen Hassan Muhammad, Guyo Dub Jarso, Oreen Yousuf, Andre Niyongabo Rubungo, Gilles Hacheme, Eric Peter Wairagala, Muhammad Umair Nasir, Benjamin Ayoade Ajibade, Tunde Oluwaseyi Ajayi, Yvonne Wambui Gitau, Jade Abbott, Mohamed Ahmed, Millicent Ochieng, Anuoluwapo Aremu, Perez Ogayo, Jonathan Mukiibi, Fatoumata Ouoba Kabore, Godson Koffi Kalipe, Derguene Mbaye, Allahsera Auguste Tapo, Victoire Memdjokam Koagne, Edwin Munkoh-Buabeng, Valencia Wagner, Idris Abdulmumin, Ayodele Awokoya, Happy Buzaaba, Blessing Sibanda, Andiswa Bukula, Sam Manthalu

We focus on two questions: 1) How can pre-trained models be used for languages not included in the initial pre-training?

Translation

Paper
Code

A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and Challenges

no code implementations • 11 Apr 2022 • Junyun Cui, Xiaoyu Shen, Feiping Nie, Zheng Wang, Jinglong Wang, Yulong Chen

In this paper, to address the current lack of comprehensive survey of existing LJP tasks, datasets, models and evaluations, (1) we analyze 31 LJP datasets in 6 languages, present their construction process and define a classification method of LJP with 3 different attributes; (2) we summarize 14 evaluation metrics under four categories for different outputs of LJP tasks; (3) we review 12 legal-domain pretrained models in 3 languages and highlight 3 major research directions for LJP; (4) we show the state-of-art results for 8 representative datasets from different court cases and discuss the open challenges.

Paper
Add Code

From Rewriting to Remembering: Common Ground for Conversational QA Models

no code implementations • NLP4ConvAI (ACL) 2022 • Marco del Tredici, Xiaoyu Shen, Gianni Barlacchi, Bill Byrne, Adrià De Gispert

In conversational QA, models have to leverage information in previous turns to answer upcoming questions.

Question Rewriting

Paper
Add Code

Deep Latent-Variable Models for Text Generation

no code implementations • 3 Mar 2022 • Xiaoyu Shen

Text generation aims to produce human-like natural language output for down-stream tasks.

Dialogue Generation Document Summarization +1

Paper
Add Code

Logical Fallacy Detection

2 code implementations • 28 Feb 2022 • Zhijing Jin, Abhinav Lalwani, Tejas Vaidhya, Xiaoyu Shen, Yiwen Ding, Zhiheng Lyu, Mrinmaya Sachan, Rada Mihalcea, Bernhard Schölkopf

In this paper, we propose the task of logical fallacy detection, and provide a new dataset (Logic) of logical fallacies generally found in text, together with an additional challenge set for detecting logical fallacies in climate change claims (LogicClimate).

Language Modelling Logical Fallacies +2

Paper
Code

Knowledge-enhanced Session-based Recommendation with Temporal Transformer

no code implementations • 16 Dec 2021 • Rongzhi Zhang, Yulong Gu, Xiaoyu Shen, Hui Su

We introduce time interval embedding to represent the time pattern between the item that needs to be predicted and historical click, and use it to replace the position embedding in the original transformer (called temporal transformer).

Graph Representation Learning Session-Based Recommendations

Paper
Add Code

Dependency Learning for Legal Judgment Prediction with a Unified Text-to-Text Transformer

1 code implementation • 13 Dec 2021 • Yunyun huang, Xiaoyu Shen, Chuanyi Li, Jidong Ge, Bin Luo

Given the fact of a case, Legal Judgment Prediction (LJP) involves a series of sub-tasks such as predicting violated law articles, charges and term of penalty.

Paper
Code

AST-Transformer: Encoding Abstract Syntax Trees Efficiently for Code Summarization

no code implementations • 2 Dec 2021 • Ze Tang, Chuanyi Li, Jidong Ge, Xiaoyu Shen, Zheling Zhu, Bin Luo

Code summarization aims to generate brief natural language descriptions for source code.

Code Summarization

Paper
Add Code

Preventing Author Profiling through Zero-Shot Multilingual Back-Translation

1 code implementation • EMNLP 2021 • David Ifeoluwa Adelani, Miaoran Zhang, Xiaoyu Shen, Ali Davody, Thomas Kleinbauer, Dietrich Klakow

Documents as short as a single sentence may inadvertently reveal sensitive information about their authors, including e. g. their gender or ethnicity.

Sentence Style Transfer +2

Paper
Code

The SelectGen Challenge: Finding the Best Training Samples for Few-Shot Neural Text Generation

no code implementations • INLG (ACL) 2021 • Ernie Chang, Xiaoyu Shen, Alex Marin, Vera Demberg

We propose a shared task on training instance selection for few-shot neural text generation.

Text Generation

Paper
Add Code

On Training Instance Selection for Few-Shot Neural Text Generation

no code implementations • ACL 2021 • Ernie Chang, Xiaoyu Shen, Hui-Syuan Yeh, Vera Demberg

In this work, we present a study on training instance selection in few-shot neural text generation.

Clustering Data-to-Text Generation +3

Paper
Add Code

Learning Fine-grained Fact-Article Correspondence in Legal Cases

1 code implementation • 21 Apr 2021 • Jidong Ge, Yunyun huang, Xiaoyu Shen, Chuanyi Li, Wei Hu

We believe that learning fine-grained correspondence between each single fact and law articles is crucial for an accurate and trustworthy AI system.

Text Matching

Paper
Code

Neural Data-to-Text Generation with LM-based Text Augmentation

no code implementations • EACL 2021 • Ernie Chang, Xiaoyu Shen, Dawei Zhu, Vera Demberg, Hui Su

Our approach automatically augments the data available for training by (i) generating new text samples based on replacing specific values by alternative ones from the same category, (ii) generating new text samples based on GPT-2, and (iii) proposing an automatic method for pairing the new text samples with data samples.

Data-to-Text Generation Text Augmentation

Paper
Add Code

Data Augmentation for Multiclass Utterance Classification -- A Systematic Study

no code implementations • COLING 2020 • Binxia Xu, Siyuan Qiu, Jie Zhang, Yafang Wang, Xiaoyu Shen, Gerard de Melo

Utterance classification is a key component in many conversational systems.

Data Augmentation text-classification +2

Paper
Add Code

Cross-Domain Learning for Classifying Propaganda in Online Contents

2 code implementations • 13 Nov 2020 • Liqiang Wang, Xiaoyu Shen, Gerard de Melo, Gerhard Weikum

Prior work has focused on supervised learning with training data from the same domain.

Paper
Code

DART: A Lightweight Quality-Suggestive Data-to-Text Annotation Tool

no code implementations • COLING 2020 • Ernie Chang, Jeriah Caplinger, Alex Marin, Xiaoyu Shen, Vera Demberg

We present a lightweight annotation tool, the Data AnnotatoR Tool (DART), for the general task of labeling structured data with textual descriptions.

Active Learning text annotation

Paper
Add Code

Integrating Image Captioning with Rule-based Entity Masking

no code implementations • 22 Jul 2020 • Aditya Mogadala, Xiaoyu Shen, Dietrich Klakow

Particularly, these image features are subdivided into global and local features, where global features are extracted from the global representation of the image, while local features are extracted from the objects detected locally in an image.

Image Captioning

Paper
Add Code

Diversifying Dialogue Generation with Non-Conversational Text

1 code implementation • ACL 2020 • Hui Su, Xiaoyu Shen, Sanqiang Zhao, Xiao Zhou, Pengwei Hu, Randy Zhong, Cheng Niu, Jie zhou

Neural network-based sequence-to-sequence (seq2seq) models strongly suffer from the low-diversity problem when it comes to open-domain dialogue generation.

Dialogue Generation Translation

Paper
Code

Neural Data-to-Text Generation via Jointly Learning the Segmentation and Correspondence

no code implementations • ACL 2020 • Xiaoyu Shen, Ernie Chang, Hui Su, Jie zhou, Dietrich Klakow

The neural attention model has achieved great success in data-to-text generation tasks.

Data-to-Text Generation Hallucination

Paper
Add Code

Unsupervised Pidgin Text Generation By Pivoting English Data and Self-Training

no code implementations • 18 Mar 2020 • Ernie Chang, David Ifeoluwa Adelani, Xiaoyu Shen, Vera Demberg

In this work, we develop techniques targeted at bridging the gap between Pidgin English and English in the context of natural language generation.

Data-to-Text Generation Machine Translation +1

Paper
Add Code

Improving Latent Alignment in Text Summarization by Generalizing the Pointer Generator

no code implementations • IJCNLP 2019 • Xiaoyu Shen, Yang Zhao, Hui Su, Dietrich Klakow

Pointer Generators have been the de facto standard for modern summarization systems.

Text Summarization Word Alignment

Paper
Add Code

Select and Attend: Towards Controllable Content Selection in Text Generation

1 code implementation • IJCNLP 2019 • Xiaoyu Shen, Jun Suzuki, Kentaro Inui, Hui Su, Dietrich Klakow, Satoshi Sekine

As a result, the content to be described in the text cannot be explicitly controlled.

Headline Generation

Paper
Code

Unsupervised Rewriter for Multi-Sentence Compression

no code implementations • ACL 2019 • Yang Zhao, Xiaoyu Shen, Wei Bi, Akiko Aizawa

First, the word graph approach that simply concatenates fragments from multiple sentences may yield non-fluent or ungrammatical compression.

Sentence Sentence Compression

Paper
Add Code

Improving Multi-turn Dialogue Modelling with Utterance ReWriter

1 code implementation • ACL 2019 • Hui Su, Xiaoyu Shen, Rongzhi Zhang, Fei Sun, Pengwei Hu, Cheng Niu, Jie zhou

To properly train the utterance rewriter, we collect a new dataset with human annotations and introduce a Transformer-based utterance rewriting architecture using the pointer network.

Coreference Resolution Dialogue Rewriting

210

Paper
Code

NEXUS Network: Connecting the Preceding and the Following in Dialogue Generation

no code implementations • EMNLP 2018 • Hui Su, Xiaoyu Shen, Wenjie Li, Dietrich Klakow

Sequence-to-Sequence (seq2seq) models have become overwhelmingly popular in building end-to-end trainable dialogue systems.

Dialogue Generation

Paper
Add Code

Improving Variational Encoder-Decoders in Dialogue Generation

no code implementations • 6 Feb 2018 • Xiaoyu Shen, Hui Su, Shuzi Niu, Vera Demberg

Variational encoder-decoders (VEDs) have shown promising results in dialogue generation.

Dialogue Generation

Paper
Add Code

DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset

13 code implementations • IJCNLP 2017 • Yan-ran Li, Hui Su, Xiaoyu Shen, Wenjie Li, Ziqiang Cao, Shuzi Niu

We develop a high-quality multi-turn dialog dataset, DailyDialog, which is intriguing in several aspects.

10,425

Paper
Code

A Conditional Variational Framework for Dialog Generation

no code implementations • ACL 2017 • Xiaoyu Shen, Hui Su, Yan-ran Li, Wenjie Li, Shuzi Niu, Yang Zhao, Akiko Aizawa, Guoping Long

Deep latent variable models have been shown to facilitate the response generation for open-domain dialog systems.

Attribute Open-Domain Dialog +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.