no code implementations • ACL (NL4XAI, INLG) 2020 • Juliette Faille, Albert Gatt, Claire Gardent
End-to-end encoder-decoder approaches to data-to-text generation are often black boxes whose predictions are difficult to explain.
1 code implementation • Findings (NAACL) 2022 • Liam Cripwell, Joël Legrand, Claire Gardent
Different types of transformations have been used to model sentence simplification ranging from mainly local operations such as phrasal or lexical rewriting, deletion and re-ordering to the more global affecting the whole input sentence such as sentence rephrasing, copying and splitting.
no code implementations • NAACL (NLPMC) 2021 • Anna Liednikova, Philippe Jolivet, Alexandre Durand-Salmon, Claire Gardent
We focus on dialog models in the context of clinical studies where the goal is to help gather, in addition to the close information collected based on a questionnaire, serendipitous information that is medically relevant.
1 code implementation • ACL (WebNLG, INLG) 2020 • Diego Moussallem, Paramjot Kaur, Thiago Ferreira, Chris van der Lee, Anastasia Shimorina, Felix Conrads, Michael Röder, René Speck, Claire Gardent, Simon Mille, Nikolai Ilinykh, Axel-Cyrille Ngonga Ngomo
The RDF-to-text task has recently gained substantial attention due to the continuous growth of RDF knowledge graphs in number and size.
1 code implementation • Findings (EMNLP) 2021 • Liam Cripwell, Joël Legrand, Claire Gardent
Sentence splitting involves the segmentation of a sentence into two or more shorter sentences.
no code implementations • ACL 2022 • Angela Fan, Claire Gardent
To assess the impact of available web evidence on the output text, we compare the performance of our approach when generating biographies about women (for which less information is available on the web) vs. biographies generally.
no code implementations • ACL (WebNLG, INLG) 2020 • Thiago castro Ferreira, Claire Gardent, Nikolai Ilinykh, Chris van der Lee, Simon Mille, Diego Moussallem, Anastasia Shimorina
WebNLG+ offers two challenges: (i) mapping sets of RDF triples to English or Russian text (generation) and (ii) converting English or Russian text to sets of RDF triples (semantic parsing).
no code implementations • Findings (EMNLP) 2021 • Juliette Faille, Albert Gatt, Claire Gardent
While powerful pre-trained language models have improved the fluency of text generation models, semantic adequacy -the ability to generate text that is semantically faithful to the input- remains an unsolved issue.
no code implementations • LREC 2022 • Kelvin Han, Thiago castro Ferreira, Claire Gardent
Question generation from knowledge bases (or knowledge base question generation, KBQG) is the task of generating questions from structured database information, typically in the form of triples representing facts.
no code implementations • 11 Apr 2024 • Juliette Faille, Quentin Brabant, Gwenole Lecorve, Lina M. Rojas-Barahona, Claire Gardent
We explore question generation in the context of knowledge-grounded dialogs focusing on explainability and evaluation.
no code implementations • 4 Apr 2024 • Liam Cripwell, Joël Legrand, Claire Gardent
In this paper, we focus on the evaluation of document-level text simplification and compare existing models using distinct metrics for meaning preservation and simplification.
1 code implementation • 12 Oct 2023 • Liam Cripwell, Joël Legrand, Claire Gardent
Automatic evaluation for sentence simplification remains a challenging problem.
no code implementations • 5 Oct 2023 • Yao Dou, Philippe Laban, Claire Gardent, Wei Xu
In this tutorial, we focus on text-to-text generation, a class of natural language generation (NLG) tasks, that takes a piece of text as input and then generates a revision that is improved according to some specific criteria (e. g., readability or linguistic styles), while largely retaining the original meaning and the length of the text.
no code implementations • 29 Aug 2023 • Quentin Brabant, Gwenole Lecorve, Lina M. Rojas-Barahona, Claire Gardent
We present KGConv, a large, conversational corpus of 71k conversations where each question-answer pair is grounded in a Wikidata fact.
1 code implementation • 10 May 2023 • Liam Cripwell, Joël Legrand, Claire Gardent
To date, most work on text simplification has focused on sentence-level inputs.
no code implementations • 28 Feb 2023 • Teven Le Scao, Claire Gardent
A key feature of neural models is that they can produce semantic vector representations of objects (texts, images, speech, etc.)
no code implementations • 7 Jul 2022 • Quentin Brabant, Lina Maria Rojas-Barahona, Claire Gardent
In human conversations, ellipsis and coreference are commonly occurring linguistic phenomena.
no code implementations • 12 Apr 2022 • Angela Fan, Claire Gardent
To assess the impact of available web evidence on the output text, we compare the performance of our approach when generating biographies about women (for which less information is available on the web) vs. biographies generally.
no code implementations • COLING 2020 • Anna Liednikova, Philippe Jolivet, Alexandre Durand-Salmon, Claire Gardent
Not only does it allow for the semi-automatic creation of large quantities of training data.
no code implementations • EMNLP 2020 • Angela Fan, Claire Gardent
Generating text from structured data is challenging because it requires bridging the gap between (i) structure and natural language (NL) and (ii) semantically underspecified input and fully specified NL output.
no code implementations • 27 Apr 2020 • Angela Fan, Claire Gardent, Chloe Braud, Antoine Bordes
Various machine learning tasks can benefit from access to external information of different modalities, such as text and images.
1 code implementation • 29 Jan 2020 • Leonardo F. R. Ribeiro, Yue Zhang, Claire Gardent, Iryna Gurevych
Recent graph-to-text models generate text from graph-based data using either global or local aggregation to learn node representations.
Ranked #1 on Graph-to-Sequence on WebNLG
no code implementations • 11 Dec 2019 • Hoa T. Le, Christophe Cerisara, Claire Gardent
Work on summarization has explored both reinforcement learning (RL) optimization using ROUGE as a reward and syntax-aware models, such as models those input is enriched with part-of-speech (POS)-tags and dependency information.
no code implementations • WS 2019 • Anastasia Shimorina, Claire Gardent
This paper presents the LORIA / Lorraine University submission at the Multilingual Surface Realisation shared task 2019 for the shallow track.
no code implementations • IJCNLP 2019 • Anastasia Shimorina, Claire Gardent
Surface realisation (SR) maps a meaning representation to a sentence and can be viewed as consisting of three subtasks: word ordering, morphological inflection and contraction generation (e. g., clitic attachment in Portuguese or elision in French).
1 code implementation • IJCNLP 2019 • Angela Fan, Claire Gardent, Chloe Braud, Antoine Bordes
Query-based open-domain NLP tasks require information synthesis from long and diverse web results.
Ranked #4 on Open-Domain Question Answering on ELI5
no code implementations • WS 2019 • Yevgeniy Puzikov, Claire Gardent, Ido Dagan, Iryna Gurevych
End-to-end neural approaches have achieved state-of-the-art performance in many natural language processing (NLP) tasks.
no code implementations • WS 2019 • Emilie Colin, Claire Gardent
Surface realisation (SR) consists in generating a text from a meaning representations (MR).
no code implementations • 25 Sep 2019 • Angela Fan, Claire Gardent, Chloe Braud, Antoine Bordes
Various machine learning tasks can benefit from access to external information of different modalities, such as text and images.
1 code implementation • IJCNLP 2019 • Leonardo F. R. Ribeiro, Claire Gardent, Iryna Gurevych
Generating text from graph-based data, such as Abstract Meaning Representation (AMR), is a challenging task due to the inherent difficulty in how to properly encode the structure of a graph with labeled edges.
Ranked #11 on AMR-to-Text Generation on LDC2017T10
1 code implementation • WS 2019 • Anastasia Shimorina, Elena Khasanova, Claire Gardent
In this paper, we propose an approach for semi-automatically creating a data-to-text (D2T) corpus for Russian that can be used to learn a D2T natural language generation model.
2 code implementations • WS 2018 • Anastasia Shimorina, Claire Gardent
Neural approaches to data-to-text generation generally handle rare input items using either delexicalisation or a copy mechanism.
Ranked #5 on KG-to-Text Generation on WebNLG 2.0 (Constrained)
no code implementations • EMNLP 2018 • Emilie Colin, Claire Gardent
We study the automatic generation of syntactic paraphrases using four different models for generation: data-to-text generation, text-to-text generation, text reduction and text expansion, We derive training data for each of these tasks from the WebNLG dataset and we show (i) that conditioning generation on syntactic constraints effectively permits the generation of syntactically distinct paraphrases for the same input and (ii) that exploiting different types of input (data, text or data+text) further increases the number of distinct paraphrases that can be generated for a given input.
1 code implementation • 20 Sep 2018 • Chunyang Xiao, Marc Dymetman, Claire Gardent
Seq2seq models based on Recurrent Neural Networks (RNNs) have recently received a lot of attention in the domain of Semantic Parsing for Question Answering.
no code implementations • NAACL 2018 • Claire Gardent, Shashi Narayan
Each text production task raises a slightly different communication goal (e. g, how to take the dialogue context into account when producing a dialogue turn; how to detect and merge relevant information when summarising a text; or how to produce a well-formed text that correctly capture the information contained in some input data in the case of data-to-text generation).
no code implementations • WS 2017 • Claire Gardent, Anastasia Shimorina, Shashi Narayan, Laura Perez-Beltrachini
The WebNLG challenge consists in mapping sets of RDF triples to text.
2 code implementations • EMNLP 2017 • Shashi Narayan, Claire Gardent, Shay B. Cohen, Anastasia Shimorina
We propose a new sentence simplification task (Split-and-Rephrase) where the aim is to split a complex sentence into a meaning preserving sequence of shorter sentences.
no code implementations • ACL 2017 • Claire Gardent, Anastasia Shimorina, Shashi Narayan, Laura Perez-Beltrachini
In this paper, we present a novel framework for semi-automatically creating linguistically challenging micro-planning data-to-text corpora from existing Knowledge Bases.
no code implementations • WS 2017 • Laura Perez-Beltrachini, Claire Gardent
Recently, several data-sets associating data to text have been created to train data-to-text surface realisers.
no code implementations • CL 2017 • Claire Gardent, Laura Perez-Beltrachini
Although there has been much work in recent years on data-driven natural language generation, little attention has been paid to the fine-grained interactions that arise during microplanning between aggregation, surface realization, and sentence segmentation.
no code implementations • COLING 2016 • Laura Perez-Beltrachini, Rania Sayed, Claire Gardent
In Natural Language Generation (NLG), one important limitation is the lack of common benchmarks on which to train, evaluate and compare data-to-text generators.
1 code implementation • WS 2016 • Shashi Narayan, Claire Gardent
We present a novel approach to sentence simplification which departs from previous work in two main ways.
Ranked #2 on Text Simplification on PWKP / WikiSmall
no code implementations • LREC 2012 • Alex Denis, re, Ingrid Falk, Claire Gardent, Laura Perez-Beltrachini
There has been much debate, both theoretical and practical, on how to link ontologies and lexicons in natural language processing (NLP) applications.
no code implementations • LREC 2012 • Lina M. Rojas-Barahona, Alej Lorenzo, ra, Claire Gardent
We describe the acquisition of a dialog corpus for French based on multi-task human-machine interactions in a serious game setting.