Search Results for author: Xiaoyu Shen

Found 35 papers, 11 papers with code

MovieChats: Chat like Humans in a Closed Domain

no code implementations EMNLP 2020 Hui Su, Xiaoyu Shen, Zhou Xiao, Zheng Zhang, Ernie Chang, Cheng Zhang, Cheng Niu, Jie zhou

In this work, we take a close look at the movie domain and present a large-scale high-quality corpus with fine-grained annotations in hope of pushing the limit of movie-domain chatbots.

Chatbot Retrieval

semiPQA: A Study on Product Question Answering over Semi-structured Data

no code implementations ECNLP (ACL) 2022 Xiaoyu Shen, Gianni Barlacchi, Marco del Tredici, Weiwei Cheng, Adrià Gispert

To fill in this blank, here we study how to effectively incorporate semi-structured answer sources for PQA and focus on presenting answers in a natural, fluent sentence.

Question Answering

MDIA: A Benchmark for Multilingual Dialogue Generation in 46 Languages

1 code implementation27 Aug 2022 Qingyu Zhang, Xiaoyu Shen, Ernie Chang, Jidong Ge, Pengke Chen

In this paper, we present mDIA, the first large-scale multilingual benchmark for dialogue generation across low- to high-resource languages.

Dialogue Generation

Low-Resource Dense Retrieval for Open-Domain Question Answering: A Comprehensive Survey

no code implementations5 Aug 2022 Xiaoyu Shen, Svitlana Vakulenko, Marco del Tredici, Gianni Barlacchi, Bill Byrne, Adrià De Gispert

Dense retrieval (DR) approaches based on powerful pre-trained language models (PLMs) achieved significant advances and have become a key component for modern open-domain question-answering systems.

Open-Domain Question Answering Retrieval

Meta Self-Refinement for Robust Learning with Weak Supervision

no code implementations15 May 2022 Dawei Zhu, Xiaoyu Shen, Michael A. Hedderich, Dietrich Klakow

However, labels from weak supervision can be rather noisy and the high capacity of DNNs makes them easy to overfit the noisy labels.

A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and Challenges

no code implementations11 Apr 2022 Junyun Cui, Xiaoyu Shen, Feiping Nie, Zheng Wang, Jinglong Wang, Yulong Chen

In this paper, to address the current lack of comprehensive survey of existing LJP tasks, datasets, models and evaluations, (1) we analyze 31 LJP datasets in 6 languages, present their construction process and define a classification method of LJP with 3 different attributes; (2) we summarize 14 evaluation metrics under four categories for different outputs of LJP tasks; (3) we review 12 legal-domain pretrained models in 3 languages and highlight 3 major research directions for LJP; (4) we show the state-of-art results for 8 representative datasets from different court cases and discuss the open challenges.

Deep Latent-Variable Models for Text Generation

no code implementations3 Mar 2022 Xiaoyu Shen

Text generation aims to produce human-like natural language output for down-stream tasks.

Dialogue Generation Document Summarization +1

Logical Fallacy Detection

1 code implementation28 Feb 2022 Zhijing Jin, Abhinav Lalwani, Tejas Vaidhya, Xiaoyu Shen, Yiwen Ding, Zhiheng Lyu, Mrinmaya Sachan, Rada Mihalcea, Bernhard Schölkopf

In this paper, we propose the task of logical fallacy detection, and provide a new dataset (Logic) of logical fallacies generally found in text, together with an additional challenge set for detecting logical fallacies in climate change claims (LogicClimate).

Language Modelling Logical Fallacies +2

Knowledge-enhanced Session-based Recommendation with Temporal Transformer

no code implementations16 Dec 2021 Rongzhi Zhang, Yulong Gu, Xiaoyu Shen, Hui Su

We introduce time interval embedding to represent the time pattern between the item that needs to be predicted and historical click, and use it to replace the position embedding in the original transformer (called temporal transformer).

Graph Representation Learning Session-Based Recommendations

Dependency Learning for Legal Judgment Prediction with a Unified Text-to-Text Transformer

1 code implementation13 Dec 2021 Yunyun huang, Xiaoyu Shen, Chuanyi Li, Jidong Ge, Bin Luo

Given the fact of a case, Legal Judgment Prediction (LJP) involves a series of sub-tasks such as predicting violated law articles, charges and term of penalty.

Preventing Author Profiling through Zero-Shot Multilingual Back-Translation

1 code implementation EMNLP 2021 David Ifeoluwa Adelani, Miaoran Zhang, Xiaoyu Shen, Ali Davody, Thomas Kleinbauer, Dietrich Klakow

Documents as short as a single sentence may inadvertently reveal sensitive information about their authors, including e. g. their gender or ethnicity.

Style Transfer Text Style Transfer +1

Learning Fine-grained Fact-Article Correspondence in Legal Cases

1 code implementation21 Apr 2021 Jidong Ge, Yunyun huang, Xiaoyu Shen, Chuanyi Li, Wei Hu

We believe that learning fine-grained correspondence between each single fact and law articles is crucial for an accurate and trustworthy AI system.

Text Matching

Neural Data-to-Text Generation with LM-based Text Augmentation

no code implementations EACL 2021 Ernie Chang, Xiaoyu Shen, Dawei Zhu, Vera Demberg, Hui Su

Our approach automatically augments the data available for training by (i) generating new text samples based on replacing specific values by alternative ones from the same category, (ii) generating new text samples based on GPT-2, and (iii) proposing an automatic method for pairing the new text samples with data samples.

Data-to-Text Generation Text Augmentation

Cross-Domain Learning for Classifying Propaganda in Online Contents

2 code implementations13 Nov 2020 Liqiang Wang, Xiaoyu Shen, Gerard de Melo, Gerhard Weikum

Prior work has focused on supervised learning with training data from the same domain.

DART: A Lightweight Quality-Suggestive Data-to-Text Annotation Tool

no code implementations COLING 2020 Ernie Chang, Jeriah Caplinger, Alex Marin, Xiaoyu Shen, Vera Demberg

We present a lightweight annotation tool, the Data AnnotatoR Tool (DART), for the general task of labeling structured data with textual descriptions.

Active Learning text annotation

Integrating Image Captioning with Rule-based Entity Masking

no code implementations22 Jul 2020 Aditya Mogadala, Xiaoyu Shen, Dietrich Klakow

Particularly, these image features are subdivided into global and local features, where global features are extracted from the global representation of the image, while local features are extracted from the objects detected locally in an image.

Image Captioning

Diversifying Dialogue Generation with Non-Conversational Text

1 code implementation ACL 2020 Hui Su, Xiaoyu Shen, Sanqiang Zhao, Xiao Zhou, Pengwei Hu, Randy Zhong, Cheng Niu, Jie zhou

Neural network-based sequence-to-sequence (seq2seq) models strongly suffer from the low-diversity problem when it comes to open-domain dialogue generation.

Dialogue Generation Translation

Unsupervised Pidgin Text Generation By Pivoting English Data and Self-Training

no code implementations18 Mar 2020 Ernie Chang, David Ifeoluwa Adelani, Xiaoyu Shen, Vera Demberg

In this work, we develop techniques targeted at bridging the gap between Pidgin English and English in the context of natural language generation.

Data-to-Text Generation Machine Translation +1

Unsupervised Rewriter for Multi-Sentence Compression

no code implementations ACL 2019 Yang Zhao, Xiaoyu Shen, Wei Bi, Akiko Aizawa

First, the word graph approach that simply concatenates fragments from multiple sentences may yield non-fluent or ungrammatical compression.

Sentence Compression

Improving Multi-turn Dialogue Modelling with Utterance ReWriter

1 code implementation ACL 2019 Hui Su, Xiaoyu Shen, Rongzhi Zhang, Fei Sun, Pengwei Hu, Cheng Niu, Jie zhou

To properly train the utterance rewriter, we collect a new dataset with human annotations and introduce a Transformer-based utterance rewriting architecture using the pointer network.

Coreference Resolution Dialogue Rewriting

NEXUS Network: Connecting the Preceding and the Following in Dialogue Generation

no code implementations EMNLP 2018 Hui Su, Xiaoyu Shen, Wenjie Li, Dietrich Klakow

Sequence-to-Sequence (seq2seq) models have become overwhelmingly popular in building end-to-end trainable dialogue systems.

Dialogue Generation

Improving Variational Encoder-Decoders in Dialogue Generation

no code implementations6 Feb 2018 Xiaoyu Shen, Hui Su, Shuzi Niu, Vera Demberg

Variational encoder-decoders (VEDs) have shown promising results in dialogue generation.

Dialogue Generation

DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset

13 code implementations IJCNLP 2017 Yan-ran Li, Hui Su, Xiaoyu Shen, Wenjie Li, Ziqiang Cao, Shuzi Niu

We develop a high-quality multi-turn dialog dataset, DailyDialog, which is intriguing in several aspects.

Cannot find the paper you are looking for? You can Submit a new open access paper.