Search Results for author: Thierry Poibeau

Found 44 papers, 3 papers with code

Sonnet Combinatorics with OuPoCo

no code implementations COLING (LaTeCHCLfL, CLFL, LaTeCH) 2020 Thierry Poibeau, Mylène Maignant, Frédérique Mélanie-Becquet, Clément Plancq, Matthieu Raffard, Mathilde Roussel

In this paper, we describe OuPoCo, a system producing new sonnets by recombining verses from existing sonnets, following an idea that Queneau described in his book “Cent Mille Milliards de poèmes, Gallimard”, 1961.

Text Zoning of Theater Reviews: How Different are Journalistic from Blogger Reviews?

no code implementations NLP4DH (ICON) 2021 Mylene Maignant, Thierry Poibeau, Gaëtan Brison

This paper aims at modeling the structure of theater reviews based on contemporary London performances by using text zoning.

Sentence

Multi-SimLex: A Large-Scale Evaluation of Multilingual and Crosslingual Lexical Semantic Similarity

no code implementations CL (ACL) 2020 Ivan Vulić, Simon Baker, Edoardo Maria Ponti, Ulla Petti, Ira Leviant, Kelly Wing, Olga Majewska, Eden Bar, Matt Malone, Thierry Poibeau, Roi Reichart, Anna Korhonen

We introduce Multi-SimLex, a large-scale lexical resource and evaluation benchmark covering data sets for 12 typologically diverse languages, including major languages (e. g., Mandarin Chinese, Spanish, Russian) as well as less-resourced ones (e. g., Welsh, Kiswahili).

Representation Learning Semantic Similarity +2

How to Evaluate Coreference in Literary Texts?

no code implementations30 Dec 2023 Ana-Isabel Duron-Tejedor, Pascal Amsili, Thierry Poibeau

In this short paper, we examine the main metrics used to evaluate textual coreference and we detail some of their limitations.

On the Correspondence between Compositionality and Imitation in Emergent Neural Communication

no code implementations22 May 2023 Emily Cheng, Mathieu Rita, Thierry Poibeau

Compositionality is a hallmark of human language that not only enables linguistic generalization, but also potentially facilitates acquisition.

Imitation Learning

Video Games as a Corpus: Sentiment Analysis using Fallout New Vegas Dialog

no code implementations5 Dec 2022 Mika Hämäläinen, Khalid Alnajjar, Thierry Poibeau

We conduct experiments on multilingual, multilabel sentiment analysis on the extracted data set using multilingual BERT, XLMRoBERTa and language specific BERT models.

Sentiment Analysis

Word Order Matters when you Increase Masking

no code implementations8 Nov 2022 Karim Lasri, Alessandro Lenci, Thierry Poibeau

We find that the necessity of position information increases with the amount of masking, and that masked language models without position encodings are not able to reconstruct this information on the task.

Language Modelling Position +1

Subject Verb Agreement Error Patterns in Meaningless Sentences: Humans vs. BERT

no code implementations COLING 2022 Karim Lasri, Olga Seminck, Alessandro Lenci, Thierry Poibeau

We compare the performance of BERT-base to that of humans, obtained with a psycholinguistic online crowdsourcing experiment.

Probing for the Usage of Grammatical Number

no code implementations ACL 2022 Karim Lasri, Tiago Pimentel, Alessandro Lenci, Thierry Poibeau, Ryan Cotterell

We also find that BERT uses a separate encoding of grammatical number for nouns and verbs.

Does BERT really agree ? Fine-grained Analysis of Lexical Dependence on a Syntactic Task

no code implementations Findings (ACL) 2022 Karim Lasri, Alessandro Lenci, Thierry Poibeau

Although transformer-based Neural Language Models demonstrate impressive performance on a variety of tasks, their generalization abilities are not well understood.

Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity

no code implementations10 Mar 2020 Ivan Vulić, Simon Baker, Edoardo Maria Ponti, Ulla Petti, Ira Leviant, Kelly Wing, Olga Majewska, Eden Bar, Matt Malone, Thierry Poibeau, Roi Reichart, Anna Korhonen

We introduce Multi-SimLex, a large-scale lexical resource and evaluation benchmark covering datasets for 12 typologically diverse languages, including major languages (e. g., Mandarin Chinese, Spanish, Russian) as well as less-resourced ones (e. g., Welsh, Kiswahili).

Cross-Lingual Word Embeddings Representation Learning +3

SEx BiST: A Multi-Source Trainable Parser with Deep Contextualized Lexical Representations

1 code implementation CONLL 2018 KyungTae Lim, Cheoneum Park, Changki Lee, Thierry Poibeau

We describe the SEx BiST parser (Semantically EXtended Bi-LSTM parser) developed at Lattice for the CoNLL 2018 Shared Task (Multilingual Parsing from Raw Text to Universal Dependencies).

Dependency Parsing Event Extraction

Enjambment Detection in a Large Diachronic Corpus of Spanish Sonnets

no code implementations WS 2017 Pablo Ruiz, Clara Mart{\'\i}nez Cant{\'o}n, Thierry Poibeau, Elena Gonz{\'a}lez-Blanco

Enjambment takes place when a syntactic unit is broken up across two lines of poetry, giving rise to different stylistic effects.

A System for Multilingual Dependency Parsing based on Bidirectional LSTM Feature Representations

no code implementations CONLL 2017 KyungTae Lim, Thierry Poibeau

In this paper, we present our multilingual dependency parser developed for the CoNLL 2017 UD Shared Task dealing with {``}Multilingual Parsing from Raw Text to Universal Dependencies{''}.

Dependency Parsing Multilingual Word Embeddings

Exploring a Continuous and Flexible Representation of the Lexicon

no code implementations COLING 2016 Pierre Marchal, Thierry Poibeau

We aim at showing that lexical descriptions based on multifactorial and continuous models can be used by linguists and lexicographers (and not only by machines) so long as they are provided with a way to efficiently navigate data collections.

Navigate Semantic Textual Similarity

Introduction: Cognitive Issues in Natural Language Processing

no code implementations24 Oct 2016 Thierry Poibeau, Shravan Vasishth

This special issue is dedicated to get a better picture of the relationships between computational linguistics and cognitive science.

Language Modelling

Generating Navigable Semantic Maps from Social Sciences Corpora

no code implementations8 Jul 2015 Thierry Poibeau, Pablo Ruiz

It is now commonplace to observe that we are facing a deluge of online information.

Navigate

Archaeology in the Digital Age: From Paper to Databases

no code implementations8 Jul 2015 Frédérique Mélanie-Becquet, Johan Ferguth, Katherine Gruel, Thierry Poibeau

Research units in archaeology often manage large and precious archives containing various documents, including reports on fieldwork, scholarly studies and reference books.

Mapping the Economic Crisis: Some Preliminary Investigations

no code implementations17 Jun 2014 Pierre Bourreau, Thierry Poibeau

In this paper we describe our contribution to the PoliInformatics 2014 Challenge on the 2007-2008 financial crisis.

Optimality Theory as a Framework for Lexical Acquisition

no code implementations26 May 2014 Thierry Poibeau

This paper re-investigates a lexical acquisition system initially developed for French. We show that, interestingly, the architecture of the system reproduces and implements the main components of Optimality Theory.

Reconstructing the Semantic Landscape of Natural Language Processing

no code implementations LREC 2014 Elisa Omodei, Jean-Philippe Cointet, Thierry Poibeau

Then, the semantic network is built using a co-occurrence analysis of these keywords within the corpus.

Cannot find the paper you are looking for? You can Submit a new open access paper.