Search Results for author: Xavier Tannier

Found 48 papers, 2 papers with code

Classification multilabel de concepts médicaux pour l’identification du profil clinique du patient (Multilabel classification of medical concepts for patient’s clinical profile identification )

no code implementations • JEP/TALN/RECITAL 2021 • Christel Gérardin, Pascal Vaillant, Perceval Wajsbürt, Clément Gilavert, Ali Bellamine, Emmanuelle Kempf, Xavier Tannier

Nous avons également proposé un modèle « bout-enbout », avec une première phase d’extraction d’entités nommées également basée sur un transformer de type camembert-large et un classifieur de genre sur un modèle Adaboost.

Classification

Paper
Add Code

A Benchmark Evaluation of Clinical Named Entity Recognition in French

no code implementations • 28 Mar 2024 • Nesrine Bannour, Christophe Servan, Aurélie Névéol, Xavier Tannier

Objective: This paper presentsan evaluation of masked language models for biomedical French on the task of clinical named entity recognition. Material and methods: We evaluate biomedical models CamemBERT-bio and DrBERT and compare them tostandard French models CamemBERT, FlauBERT and FrALBERT as well as multilingual mBERT using three publicallyavailable corpora for clinical named entity recognition in French.

named-entity-recognition Named Entity Recognition

Paper
Add Code

Few shot clinical entity recognition in three languages: Masked language models outperform LLM prompting

no code implementations • 20 Feb 2024 • Marco Naguib, Xavier Tannier, Aurélie Névéol

Results are consistent over the three languages and suggest that few-shot learning using Large language models is not production ready for named entity recognition in the clinical domain.

Few-Shot Learning named-entity-recognition +1

Paper
Add Code

Impact of translation on biomedical information extraction from real-life clinical notes

no code implementations • 3 Jun 2023 • Christel Gérardin, Yuhan Xiong, Perceval Wajsbürt, Fabrice Carrat, Xavier Tannier

The objective of our study is to determine whether using English tools to extract and normalize French medical concepts on translations provides comparable performance to French models trained on a set of annotated French clinical notes.

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

Detecting automatically the layout of clinical documents to enhance the performances of downstream natural language processing

no code implementations • 23 May 2023 • Christel Gérardin, Perceval Wajsbürt, Basile Dura, Alice Calliger, Alexandre Moucher, Xavier Tannier, Romain Bey

The precision, recall, and F1 score per document for the acute infection detection algorithm were 82. 54 (95CI 72. 86-91. 60), 85. 24 (95CI 76. 61-93. 70), 83. 87 (95CI 76, 92-90. 08) with exploitation of the results of the advanced body extraction algorithm, respectively.

Paper
Add Code

Development and validation of a natural language processing algorithm to pseudonymize documents in the context of a clinical data warehouse

no code implementations • 23 Mar 2023 • Xavier Tannier, Perceval Wajsbürt, Alice Calliger, Basile Dura, Alexandre Mouchet, Martin Hilka, Romain Bey

The objective of this study is to address the critical issue of de-identification of clinical reports in order to allow access to data for research purposes, while ensuring patient privacy.

De-identification

Paper
Add Code

Learning structures of the French clinical language:development and validation of word embedding models using 21 million clinical reports from electronic health records

no code implementations • 26 Jul 2022 • Basile Dura, Charline Jean, Xavier Tannier, Alice Calliger, Romain Bey, Antoine Neuraz, Rémi Flicoteaux

We used two French annotated medical datasets to compare our language models to the original CamemBERT network, evaluating the statistical significance of improvement with the Wilcoxon test.

Language Modelling Transfer Learning

Paper
Add Code

Identifying causal relations in tweets using deep learning: Use case on diabetes-related tweets from 2017-2021

1 code implementation • 1 Nov 2021 • Adrian Ahne, Vivek Khetan, Xavier Tannier, Md Imbessat Hassan Rizvi, Thomas Czernichow, Francisco Orchard, Charline Bour, Andrew Fano, Guy Fagherazzi

A cause-effect-tweet dataset was manually labeled and used to train 1) a fine-tuned Bertweet model to detect causal sentences containing a causal association 2) a CRF model with BERT based features to extract possible cause-effect associations.

Paper
Code

Effect of depth order on iterative nested named entity recognition models

no code implementations • 2 Apr 2021 • Perceval Wajsburt, Yoann Taillé, Xavier Tannier

We provide a set of experiments to study the model's capabilities and the effects of the order on performance.

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

Participation de l'\'equipe du LIMICS \`a DEFT 2020 (Participation of team LIMICS in the DEFT 2020 challenge )

no code implementations • JEPTALNRECITAL 2020 • Perceval Wajsb{\"u}rt, Yoann Taill{\'e}, Guillaume Lain{\'e}, Xavier Tannier

Nous pr{\'e}sentons dans cet article les m{\'e}thodes con{\c{c}}ues et les r{\'e}sultats obtenus lors de notre participation {\`a} la t{\^a}che 3 de la campagne d{'}{\'e}valuation DEFT 2020, consistant en la reconnaissance d{'}entit{\'e}s nomm{\'e}es du domaine m{\'e}dical.

Paper
Add Code

Mod\`ele neuronal pour la r\'esolution de la cor\'ef\'erence dans les dossiers m\'edicaux \'electroniques (Neural approach for coreference resolution in electronic health records )

no code implementations • JEPTALNRECITAL 2020 • Julien Tourille, Olivier Ferret, Aur{\'e}lie N{\'e}v{\'e}ol, Xavier Tannier

La r{\'e}solution de la cor{\'e}f{\'e}rence est un {\'e}l{\'e}ment essentiel pour la constitution automatique de chronologies m{\'e}dicales {\`a} partir des dossiers m{\'e}dicaux {\'e}lectroniques.

coreference-resolution

Paper
Add Code

Participation de l'\'equipe LAI \`a DEFT 2019 (Participation of team LAI in the DEFT 2019 challenge )

no code implementations • JEPTALNRECITAL 2019 • Jacques Hilbey, Louise Del{\'e}ger, Xavier Tannier

Nous pr{\'e}sentons dans cet article les m{\'e}thodes con{\c{c}}ues et les r{\'e}sultats obtenus lors de notre participation {\`a} la t{\^a}che 3 de la campagne d{'}{\'e}valuation DEFT 2019.

Paper
Add Code

Terminologies augmented recurrent neural network model for clinical named entity recognition

no code implementations • 25 Apr 2019 • Ivan Lerner, Nicolas Paris, Xavier Tannier

On APcNER corpus, the micro-average F-measure of the hybrid system on the 5 entities was 69. 5% in exact match, and 84. 1% in non-exact match.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Searching News Articles Using an Event Knowledge Graph Leveraged by Wikidata

no code implementations • 11 Apr 2019 • Charlotte Rudnik, Thibault Ehrhart, Olivier Ferret, Denis Teyssou, Raphaël Troncy, Xavier Tannier

News agencies produce thousands of multimedia stories describing events happening in the world that are either scheduled such as sports competitions, political summits and elections, or breaking events such as military conflicts, terrorist attacks, natural disasters, etc.

Paper
Add Code

Hybrid Approaches for our Participation to the n2c2 Challenge on Cohort Selection for Clinical Trials

no code implementations • 19 Mar 2019 • Xavier Tannier, Nicolas Paris, Hugo Cisneros, Christel Daniel, Matthieu Doutreligne, Catherine Duclos, Nicolas Griffon, Claire Hassen-Khodja, Ivan Lerner, Adrien Parrot, Éric Sadou, Cyrina Saussol, Pascal Vaillant

Materials and Methods: The first method is a weakly supervised method using an unlabeled corpus (MIMIC) to build a silver standard, by producing semi-automatically a small and very precise set of rules to detect some samples of positive and negative patients.

Paper
Add Code

Evaluation of a Sequence Tagging Tool for Biomedical Texts

1 code implementation • WS 2018 • Julien Tourille, Matthieu Doutreligne, Olivier Ferret, Aur{\'e}lie N{\'e}v{\'e}ol, Nicolas Paris, Xavier Tannier

Many applications in biomedical natural language processing rely on sequence tagging as an initial step to perform more complex analysis.

named-entity-recognition Named Entity Recognition +4

Paper
Code

Unsupervised Event Clustering and Aggregation from Newswire and Web Articles

no code implementations • WS 2017 • Swen Ribeiro, Olivier Ferret, Xavier Tannier

In this paper, we present an unsupervised pipeline approach for clustering news articles based on identified event instances in their content.

Clustering Document Summarization +1

Paper
Add Code

LIMSI-COT at SemEval-2017 Task 12: Neural Architecture for Temporal Information Extraction from Clinical Narratives

no code implementations • SEMEVAL 2017 • Julien Tourille, Olivier Ferret, Xavier Tannier, Aur{\'e}lie N{\'e}v{\'e}ol

In this paper we present our participation to SemEval 2017 Task 12.

Domain Adaptation Entity Extraction using GAN +3

Paper
Add Code

Neural Architecture for Temporal Relation Extraction: A Bi-LSTM Approach for Detecting Narrative Containers

no code implementations • ACL 2017 • Julien Tourille, Olivier Ferret, Aur{\'e}lie N{\'e}v{\'e}ol, Xavier Tannier

We present a neural architecture for containment relation identification between medical events and/or temporal expressions.

Relation Temporal Information Extraction +1

Paper
Add Code

Apprendre des repr\'esentations jointes de mots et d'entit\'es pour la d\'esambigu\"\isation d'entit\'es (Combining Word and Entity Embeddings for Entity Linking)

no code implementations • JEPTALNRECITAL 2017 • Jos{\'e} Moreno, Romaric Besan{\c{c}}on, Romain Beaumont, Eva D{'}hondt, Anne-Laure Ligozat, Sophie Rosset, Xavier Tannier, Brigitte Grau

La d{\'e}sambigu{\"\i}sation d{'}entit{\'e}s (ou liaison d{'}entit{\'e}s), qui consiste {\`a} relier des mentions d{'}entit{\'e}s d{'}un texte {\`a} des entit{\'e}s d{'}une base de connaissance, est un probl{\`e}me qui se pose, entre autre, pour le peuplement automatique de bases de connaissances {\`a} partir de textes.

Entity Embeddings Entity Linking

Paper
Add Code

Temporal information extraction from clinical text

no code implementations • EACL 2017 • Julien Tourille, Olivier Ferret, Xavier Tannier, Aur{\'e}lie N{\'e}v{\'e}ol

In this paper, we present a method for temporal relation extraction from clinical narratives in French and in English.

Relation Temporal Information Extraction +1

Paper
Add Code

Extraction de relations temporelles dans des dossiers \'electroniques patient (Extracting Temporal Relations from Electronic Health Records)

no code implementations • JEPTALNRECITAL 2016 • Julien Tourille, Olivier Ferret, Aur{\'e}lie N{\'e}v{\'e}ol, Xavier Tannier

Cette analyse repose sur l{'}extraction d{'}{\'e}v{\'e}nements, d{'}expressions temporelles et des relations entre eux.

Paper
Add Code

SemEval-2016 Task 5: Aspect Based Sentiment Analysis

no code implementations • SEMEVAL 2016 • Maria Pontiki, Dimitris Galanis, Haris Papageorgiou, Ion Androutsopoulos, Man, Suresh har, Mohammad AL-Smadi, Mahmoud Al-Ayyoub, Yanyan Zhao, Bing Qin, Orph{\'e}e De Clercq, V{\'e}ronique Hoste, Marianna Apidianaki, Xavier Tannier, Natalia Loukachevitch, Evgeniy Kotelnikov, Nuria Bel, Salud Mar{\'\i}a Jim{\'e}nez-Zafra, G{\"u}l{\c{s}}en Eryi{\u{g}}it

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Paper
Add Code

LIMSI-COT at SemEval-2016 Task 12: Temporal relation identification using a pipeline of classifiers

no code implementations • SEMEVAL 2016 • Julien Tourille, Olivier Ferret, Aur{\'e}lie N{\'e}v{\'e}ol, Xavier Tannier

Entity Extraction using GAN Relation +2

Paper
Add Code

A Dataset for Open Event Extraction in English

no code implementations • LREC 2016 • Kiem-Hieu Nguyen, Xavier Tannier, Olivier Ferret, Romaric Besan{\c{c}}on

We detail the methodology used for building the corpus and evaluate some existing systems on this new data.

Event Extraction

Paper
Add Code

Datasets for Aspect-Based Sentiment Analysis in French

no code implementations • LREC 2016 • Marianna Apidianaki, Xavier Tannier, C{\'e}cile Richart

Aspect Based Sentiment Analysis (ABSA) is the task of mining and summarizing opinions from text about specific entities and their aspects.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA)

Paper
Add Code

Redundancy in French Electronic Health Records: A preliminary study

no code implementations • WS 2015 • Eva D{'}hondt, Xavier Tannier, Aur{\'e}lie N{\'e}v{\'e}ol

Paper
Add Code

Automatic Extraction of Time Expressions Accross Domains in French Narratives

no code implementations • EMNLP 2015 • Mike Donald Tapi Nzali, Xavier Tannier, Aur{\'e}lie N{\'e}v{\'e}ol

Domain Adaptation Relation Extraction +1

Paper
Add Code

Generative Event Schema Induction with Entity Disambiguation

no code implementations • IJCNLP 2015 • Kiem-Hieu Nguyen, Xavier Tannier, Olivier Ferret, Romaric Besan{\c{c}}on

Entity Disambiguation

Paper
Add Code

D\'esambigu\"\isation d'entit\'es pour l'induction non supervis\'ee de sch\'emas \'ev\'enementiels

no code implementations • JEPTALNRECITAL 2015 • Kiem-Hieu Nguyen, Xavier Tannier, Olivier Ferret, Romaric Besan{\c{c}}on

Les pr{\'e}c{\'e}dentes m{\'e}thodes de la litt{\'e}rature utilisent uniquement les t{\^e}tes des syntagmes pour repr{\'e}senter les entit{\'e}s. Pourtant, le groupe complet (par exemple, {''}un homme arm{\'e}{''}) apporte une information plus discriminante (que {''}homme{''}).

SENTER

Paper
Add Code

Analyse d'expressions temporelles dans les dossiers \'electroniques patients

no code implementations • JEPTALNRECITAL 2015 • Mike Donald Tapi Nzali, Aur{\'e}lie N{\'e}v{\'e}ol, Xavier Tannier

En s{'}appuyant sur un corpus de documents issus de plusieurs dossiers {\'e}lectroniques patient d{\'e}sidentifi{\'e}s, nous d{\'e}crivons la construction d{'}une ressource annot{\'e}e en expressions temporelles selon la norme TimeML.

Paper
Add Code

Ranking Multidocument Event Descriptions for Building Thematic Timelines

no code implementations • COLING 2014 • Kiem-Hieu Nguyen, Xavier Tannier, Veronique Moriceau

Paper
Add Code

Evaluating Web-as-corpus Topical Document Retrieval with an Index of the OpenDirectory

no code implementations • LREC 2014 • Cl{\'e}ment de Groc, Xavier Tannier

This article introduces a novel protocol and resource to evaluate Web-as-corpus topical document retrieval.

Information Retrieval Machine Translation +1

Paper
Add Code

Thematic Cohesion: measuring terms discriminatory power toward themes

no code implementations • LREC 2014 • Cl{\'e}ment de Groc, Xavier Tannier, Claude de Loupy

This graph can be interpreted as a recommendation graph, where two terms occurring in a same document means that they recommend each other.

Opinion Mining Retrieval +2

Paper
Add Code

Extracting News Web Page Creation Time with DCTFinder

no code implementations • LREC 2014 • Xavier Tannier

Web pages do not offer reliable metadata concerning their creation date and time.

Document Summarization Information Retrieval +2

Paper
Add Code

French Resources for Extraction and Normalization of Temporal Expressions with HeidelTime

no code implementations • LREC 2014 • V{\'e}ronique Moriceau, Xavier Tannier

French resources have been evaluated in two different ways: on the French TimeBank corpus, a corpus of newspaper articles in French annotated according to the ISO-TimeML standard, and on a user application for automatic building of event timelines.

Information Retrieval

Paper
Add Code

Evaluating Temporal Graphs Built from Texts via Transitive Reduction

no code implementations • 16 Jan 2014 • Xavier Tannier, Philippe Muller

Temporal information has been the focus of recent attention in information extraction, leading to some standardization effort, in particular for the task of relating events in a text.