Search Results for author: Beatrice Daille

Found 11 papers, 4 papers with code

Cross-lingual and Cross-domain Transfer Learning for Automatic Term Extraction from Low Resource Data

no code implementations LREC 2022 Amir Hazem, Merieme Bouhandi, Florian Boudin, Beatrice Daille

Automatic Term Extraction (ATE) is a key component for domain knowledge understanding and an important basis for further natural language processing applications.

Term Extraction Transfer Learning

How Important Is Tokenization in French Medical Masked Language Models?

no code implementations22 Feb 2024 Yanis Labrak, Adrien Bazoge, Beatrice Daille, Mickael Rouvier, Richard Dufour

Subword tokenization has become the prevailing standard in the field of natural language processing (NLP) over recent years, primarily due to the widespread utilization of pre-trained language models.

DrBenchmark: A Large Language Understanding Evaluation Benchmark for French Biomedical Domain

1 code implementation20 Feb 2024 Yanis Labrak, Adrien Bazoge, Oumaima El Khettari, Mickael Rouvier, Pacome Constant dit Beaufils, Natalia Grabar, Beatrice Daille, Solen Quiniou, Emmanuel Morin, Pierre-Antoine Gourraud, Richard Dufour

This limitation hampers the evaluation of the latest French biomedical models, as they are either assessed on a minimal number of tasks with non-standardized protocols or evaluated using general downstream tasks.

named-entity-recognition Named Entity Recognition +3

A Large-Scale Dataset for Biomedical Keyphrase Generation

1 code implementation22 Nov 2022 Mael Houbre, Florian Boudin, Beatrice Daille

Keyphrase generation is the task consisting in generating a set of words or phrases that highlight the main topics of a document.

Keyphrase Generation

Hierarchical Text Segmentation for Medieval Manuscripts

1 code implementation COLING 2020 Amir Hazem, Beatrice Daille, Dominique Stutzmann, Christopher Kermorvant, Louis Chevalier

In this paper, we address the segmentation of books of hours, Latin devotional manuscripts of the late Middle Ages, that exhibit challenging issues: a complex hierarchical entangled structure, variable content, noisy transcriptions with no sentence markers, and strong correlations between sections for which topical information is no longer sufficient to draw segmentation boundaries.

Hierarchical Text Segmentation Segmentation +2

TermEval 2020: TALN-LS2N System for Automatic Term Extraction

no code implementations LREC 2020 Amir Hazem, Bouh, M{\'e}rieme i, Florian Boudin, Beatrice Daille

Automatic terminology extraction is a notoriously difficult task aiming to ease effort demanded to manually identify terms in domain-specific corpora by automatically providing a ranked list of candidate terms.

Term Extraction

Towards Automatic Thesaurus Construction and Enrichment.

no code implementations LREC 2020 Amir Hazem, Beatrice Daille, Lanza Claudia

Thesaurus construction with minimum human efforts often relies on automatic methods to discover terms and their relations.

Semantic Similarity Semantic Textual Similarity

Mod\'elisation unifi\'ee du document et de son domaine pour une indexation par termes-cl\'es libre et contr\^ol\'ee (Unified document and domain-specific model for keyphrase extraction and assignment )

no code implementations JEPTALNRECITAL 2016 Adrien Bougouin, Florian Boudin, Beatrice Daille

Dans cet article, nous nous int{\'e}ressons {\`a} l{'}indexation de documents de domaines de sp{\'e}cialit{\'e} par l{'}interm{\'e}diaire de leurs termes-cl{\'e}s. Plus particuli{\`e}rement, nous nous int{\'e}ressons {\`a} l{'}indexation telle qu{'}elle est r{\'e}alis{\'e}e par les documentalistes de biblioth{\`e}ques num{\'e}riques.

Keyphrase Extraction

Cannot find the paper you are looking for? You can Submit a new open access paper.