Search Results for author: Vincent Claveau

Found 38 papers, 3 papers with code

Décodage guidé par un discriminateur avec le Monte Carlo Tree Search pour la génération de texte contrainte (Discriminator-guided decoding with Monte Carlo Tree Search for constrained text generation )

no code implementations • JEP/TALN/RECITAL 2022 • Antoine Chaffin, Vincent Claveau, Ewa Kijak

Dans cet article, nous explorons comment contrôler la génération de texte au moment du décodage pour satisfaire certaines contraintes (e. g. être non toxique, transmettre certaines émotions...), sans nécessiter de ré-entrainer le modèle de langue.

Text Generation

Paper
Add Code

Choisir le bon co-équipier pour la génération coopérative de texte (Choosing The Right Teammate For Cooperative Text Generation)

no code implementations • JEP/TALN/RECITAL 2022 • Antoine Chaffin, Vincent Claveau, Ewa Kijak, Sylvain Lamprier, Benjamin Piwowarski, Thomas Scialom, Jacopo Staiano

Nous évaluons leurs avantages et inconvénients, en explorant leur précision respective sur des tâches de classification, ainsi que leur impact sur la génération coopérative et leur coût de calcul, dans le cadre d’une stratégie de décodage état de l’art, basée sur une recherche arborescente de Monte-Carlo (MCTS).

Text Generation

Paper
Add Code

La génération de textes artificiels en substitution ou en complément de données d’apprentissage (Generating artificial texts as substitution or complement of training data )

no code implementations • JEP/TALN/RECITAL 2021 • Vincent Claveau, Antoine Chaffin, Ewa Kijak

(ii) peuvent-elles remplacer les données d’origines quand ces dernières ne peuvent pas être distribuées, par exemple pour des raisons de confidentialité ?

Paper
Add Code

Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP Guided Reinforcement Learning

1 code implementation • 21 Feb 2024 • Antoine Chaffin, Ewa Kijak, Vincent Claveau

Secondly, they can serve as additional trajectories in the RL strategy, resulting in a teacher forcing loss weighted by the similarity of the GT to the image.

Cross-Modal Retrieval Image Captioning +2

Paper
Code

Measuring vagueness and subjectivity in texts: from symbolic to neural VAGO

no code implementations • 12 Sep 2023 • Benjamin Icard, Vincent Claveau, Ghislain Atemezing, Paul Égré

We present a hybrid approach to the automated measurement of vagueness and subjectivity in texts.

Paper
Add Code

Which Discriminator for Cooperative Text Generation?

1 code implementation • 25 Apr 2022 • Antoine Chaffin, Thomas Scialom, Sylvain Lamprier, Jacopo Staiano, Benjamin Piwowarski, Ewa Kijak, Vincent Claveau

Language models generate texts by successively predicting probability distributions for next tokens given past ones.

Language Modelling Text Generation

Paper
Code

Generative Cooperative Networks for Natural Language Generation

no code implementations • 28 Jan 2022 • Sylvain Lamprier, Thomas Scialom, Antoine Chaffin, Vincent Claveau, Ewa Kijak, Jacopo Staiano, Benjamin Piwowarski

Generative Adversarial Networks (GANs) have known a tremendous success for many continuous generation tasks, especially in the field of image generation.

Image Generation Text Generation

Paper
Add Code

Generating artificial texts as substitution or complement of training data

no code implementations • LREC 2022 • Vincent Claveau, Antoine Chaffin, Ewa Kijak

The quality of artificially generated texts has considerably improved with the advent of transformers.

Data Augmentation Fake News Detection +1

Paper
Add Code

PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided MCTS Decoding

1 code implementation • NAACL 2022 • Antoine Chaffin, Vincent Claveau, Ewa Kijak

without fine-tuning the LM.

Language Modelling Re-Ranking

Paper
Code

Query expansion with artificially generated texts

no code implementations • 16 Dec 2020 • Vincent Claveau

We rely on a well-known neural generative model, GPT-2, that comes with pre-trained models for English but can also be fine-tuned on specific corpora.

Retrieval Text Generation

Paper
Add Code

Construction de plongements de concepts m\'edicaux sans textes (Embedding medical concepts without texts)

no code implementations • JEPTALNRECITAL 2020 • Vincent Claveau

Les approches existantes pour g{\'e}n{\'e}rer ces plongements n{\'e}cessitent de grandes quantit{\'e}s de documents m{\'e}dicaux.

Paper
Add Code

On the Correlation of Word Embedding Evaluation Metrics

no code implementations • LREC 2020 • Fran{\c{c}}ois Torregrossa, Vincent Claveau, Nihel Kooli, Guillaume Gravier, Robin Allesiardo

Word embeddings intervene in a wide range of natural language processing tasks.

Word Embeddings

Paper
Add Code

Speculation and Negation detection in French biomedical corpora

no code implementations • RANLP 2019 • Cl{\'e}ment Dalloux, Vincent Claveau, Natalia Grabar

We reach up to 97. 21 {\%} and 91. 30 {\%} F-measure for the detection of negation and speculation cues, respectively, using CRFs.

Negation Negation Detection +1

Paper
Add Code

Clinical Case Reports for NLP

no code implementations • WS 2019 • Cyril Grouin, Natalia Grabar, Vincent Claveau, Thierry Hamon

Thus, we manually annotated a set of 717 files into four general categories (age, gender, outcome, and origin) for a total number of 2, 835 annotations.

Paper
Add Code

Corpus annot\'e de cas cliniques en fran\ccais (Annotated corpus with clinical cases in French)

no code implementations • JEPTALNRECITAL 2019 • Natalia Grabar, Cyril Grouin, Thierry Hamon, Vincent Claveau

Pour r{\'e}pondre {\`a} ce d{\'e}fi, nous pr{\'e}sentons dans cet article le corpus CAS contenant des cas cliniques de patients, r{\'e}els ou fictifs, que nous avons compil{\'e}s. Ces cas cliniques en fran{\c{c}}ais couvrent plusieurs sp{\'e}cialit{\'e}s m{\'e}dicales et focalisent donc sur diff{\'e}rentes situations cliniques.

Paper
Add Code

Recherche et extraction d'information dans des cas cliniques. Pr\'esentation de la campagne d'\'evaluation DEFT 2019 (Information Retrieval and Information Extraction from Clinical Cases)

no code implementations • JEPTALNRECITAL 2019 • Natalia Grabar, Cyril Grouin, Thierry Hamon, Vincent Claveau

Cet article pr{\'e}sente la campagne d{'}{\'e}valuation DEFT 2019 sur l{'}analyse de textes cliniques r{\'e}dig{\'e}s en fran{\c{c}}ais.

Information Retrieval Retrieval

Paper
Add Code

CAS: French Corpus with Clinical Cases

no code implementations • WS 2018 • Natalia Grabar, Vincent Claveau, Cl{\'e}ment Dalloux

Textual corpora are extremely important for various NLP applications as they provide information necessary for creating, setting and testing these applications and the corresponding tools.

Information Retrieval

Paper
Add Code

IRISA at SMM4H 2018: Neural Network and Bagging for Tweet Classification

no code implementations • WS 2018 • Anne-Lyse Minard, Christian Raymond, Vincent Claveau

This paper describes the systems developed by IRISA to participate to the four tasks of the SMM4H 2018 challenge.

General Classification Word Embeddings

Paper
Add Code

Port\'ee de la n\'egation : d\'etection par apprentissage supervis\'e en fran\ccais et portugais br\'esilien (Negation scope : sequence labeling by supervised learning in French and Brazilian-Portuguese)

no code implementations • JEPTALNRECITAL 2018 • Cl{\'e}ment Dalloux, Vincent Claveau, Natalia Grabar, Claudia Moro

Cet article pr{\'e}sente nos contributions concernant la d{\'e}tection de la port{\'e}e de la n{\'e}gation en fran{\c{c}}ais et portugais br{\'e}silien.

Negation

Paper
Add Code

Participation de l'IRISA \`a DeFT 2018 : classification et annotation d'opinion dans des tweets (IRISA at DeFT 2018: classifying and tagging opinion in tweets )

no code implementations • JEPTALNRECITAL 2018 • Anne-Lyse Minard, Christian Raymond, Vincent Claveau

L{'}{\'e}quipe a particip{\'e} {\`a} 3 des 4 t{\^a}ches de la campagne : (i) classification des tweets selon s{'}ils concernent les transports ou non, (ii) classification des tweets selon leur polarit{\'e} et (iii) annotation des marqueurs d{'}opinion et de l{'}objet {\`a} propos duquel est exprim{\'e}e l{'}opinion.

Paper
Add Code

DEFT2018 : recherche d'information et analyse de sentiments dans des tweets concernant les transports en \^Ile de France (DEFT2018 : Information Retrieval and Sentiment Analysis in Tweets about Public Transportation in \^Ile de France Region )

no code implementations • JEPTALNRECITAL 2018 • Patrick Paroubek, Cyril Grouin, Patrice Bellot, Vincent Claveau, Iris Eshkol-Taravella, Amel Fraisse, Agata Jackiewicz, Jihen Karoui, Laura Monceaux, Juan-Manuel Torres-Moreno

Cet article pr{\'e}sente l{'}{\'e}dition 2018 de la campagne d{'}{\'e}valuation DEFT (D{\'e}fi Fouille de Textes).

Information Retrieval Retrieval +1

Paper
Add Code

Crit\`eres num\'eriques dans les essais cliniques : annotation, d\'etection et normalisation (Numerical criteria in clinical trials : annotation, detection and normalization)

no code implementations • JEPTALNRECITAL 2017 • Natalia Grabar, Vincent Claveau

Les essais cliniques sont un {\'e}l{\'e}ment fondamental pour l{'}{\'e}valuation de nouvelles th{\'e}rapies ou techniques de diagnostic, de leur s{\'e}curit{\'e} et efficacit{\'e}.

Paper
Add Code

Direct vs. indirect evaluation of distributional thesauri

no code implementations • COLING 2016 • Vincent Claveau, Ewa Kijak

In this paper, we address the problem of the evaluation of such thesauri or embedding models and compare their results.

Information Retrieval Retrieval

Paper
Add Code

Extraction d'expressions-cibles de l'opinion : de l'anglais au fran\ccais (Opinion Target Expression extraction : from English to French)

no code implementations • JEPTALNRECITAL 2016 • Gr{\'e}goire Jadi, Laura Monceaux, Vincent Claveau, B{\'e}atrice Daille

Dans cet article, nous pr{\'e}sentons le d{\'e}veloppement d{'}un syst{\`e}me d{'}extraction d{'}expressions-cibles pour l{'}anglais et sa transposition au fran{\c{c}}ais.

Paper
Add Code

M\'edias traditionnels, m\'edias sociaux : caract\'eriser la r\'einformation (Traditional medias, social medias : characterizing reinformation)

no code implementations • JEPTALNRECITAL 2016 • C{\'e}dric Maigrot, Ewa Kijak, Vincent Claveau

Nous pr{\'e}sentons d{'}autre part quelques exp{\'e}riences de d{\'e}tection automatique des messages issus des m{\'e}dias de r{\'e}information, en {\'e}tudiant notamment l{'}influence d{'}attributs de surface et d{'}attributs portant plus sp{\'e}cifiquement sur le contenu de ces messages.

SENTER SENTS

Paper
Add Code

Distributional Thesauri for Information Retrieval and vice versa

no code implementations • LREC 2016 • Vincent Claveau, Ewa Kijak

In this paper, we address the problem of building and evaluating such thesauri with the help of Information Retrieval (IR) concepts.

Information Retrieval Retrieval

Paper
Add Code

Evaluating Lexical Similarity to build Sentiment Similarity

no code implementations • LREC 2016 • Gr{\'e}goire Jadi, Vincent Claveau, B{\'e}atrice Daille, Laura Monceaux

In this article, we propose to evaluate the lexical similarity information provided by word representations against several opinion resources using traditional Information Retrieval tools.

Information Retrieval Retrieval +3

Paper
Add Code

Strat\'egies de s\'election des exemples pour l'apprentissage actif avec des champs al\'eatoires conditionnels

no code implementations • JEPTALNRECITAL 2015 • Vincent Claveau, Ewa Kijak

D{'}autre part, nous d{\'e}taillons une m{\'e}thode originale de s{\'e}lection s{'}appuyant sur un crit{\`e}re de respect des proportions dans les jeux de donn{\'e}es manipul{\'e}s. Le bien- fond{\'e} de ces propositions est v{\'e}rifi{\'e} au travers de plusieurs t{\^a}ches et jeux de donn{\'e}es, incluant reconnaissance d{'}entit{\'e}s nomm{\'e}es, chunking, phon{\'e}tisation, d{\'e}sambigu{\"\i}sation de sens.

Active Learning Chunking

Paper
Add Code

Improving distributional thesauri by exploring the graph of neighbors

no code implementations • COLING 2014 • Vincent Claveau, Ewa Kijak, Olivier Ferret

Information Retrieval

Paper
Add Code

Exploring the neighbor graph to improve distributional thesauri (Explorer le graphe de voisinage pour am\'eliorer les th\'esaurus distributionnels) [in French]

no code implementations • JEPTALNRECITAL 2014 • Vincent Claveau, Ewa Kijak, Olivier Ferret

Information Retrieval

Paper
Add Code

Generating and using probabilistic morphological resources for the biomedical domain

no code implementations • LREC 2014 • Vincent Claveau, Ewa Kijak

In most Indo-European languages, many biomedical terms are rich morphological structures composed of several constituents mainly originating from Greek or Latin.

Information Retrieval Machine Translation +1

Paper
Add Code

IRISA participation to BioNLP-ST13: lazy-learning and information retrieval for information extraction tasks

no code implementations • WS 2013 • Vincent Claveau

Information Retrieval Language Modelling +1

Paper
Add Code

Unsupervised CRF for knowledge discovery (D\'ecouverte de connaissances dans les s\'equences par CRF non-supervis\'es) [in French]

no code implementations • JEPTALNRECITAL 2013 • Vincent Claveau, Abir Ncibi

Information Retrieval

Paper
Add Code

Unsupervised and Semi-Supervised Morphological Analysis for Information Retrieval in the Biomedical Domain

no code implementations • COLING 2012 • Vincent Claveau

Information Retrieval Morphological Analysis +1

Paper
Add Code

Participation de l'IRISA \`a DeFT2012 : recherche d'information et apprentissage pour la g\'en\'eration de mots-cl\'es (IRISA participation to DeFT2012: information retrieval and machine-learning for keyword generation) [in French]

no code implementations • JEPTALNRECITAL 2012 • Vincent Claveau, Christian Raymond

Information Retrieval Retrieval

Paper
Add Code

Annotation manuelle de matchs de foot : Oh la la la ! l'accord inter-annotateurs ! et c'est le but ! (Manual Annotation of Football Matches : Inter-annotator Agreement ! Gooooal !) [in French]

no code implementations • JEPTALNRECITAL 2012 • Kar{\"e}n Fort, Vincent Claveau

Paper
Add Code

Vectorisation, Okapi et calcul de similarit\'e pour le TAL : pour oublier enfin le TF-IDF (Vectorization, Okapi and Computing Similarity for NLP : Say Goodbye to TF-IDF) [in French]

no code implementations • JEPTALNRECITAL 2012 • Vincent Claveau

Information Retrieval

Paper
Add Code

Annotating Football Matches: Influence of the Source Medium on Manual Annotation

no code implementations • LREC 2012 • Kar{\"e}n Fort, Vincent Claveau

In this paper, we present an annotation campaign of football (soccer) matches, from a heterogeneous text corpus of both match minutes and video commentary transcripts, in French.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.