Search Results for author: Christophe Servan

Found 26 papers, 5 papers with code

Small Language Models are Good Too: An Empirical Study of Zero-Shot Classification

no code implementations17 Apr 2024 Pierre Lepagnol, Thomas Gerald, Sahar Ghannay, Christophe Servan, Sophie Rosset

This study is part of the debate on the efficiency of large versus small language models for text classification by prompting. We assess the performance of small language models in zero-shot text classification, challenging the prevailing dominance of large models. Across 15 datasets, our investigation benchmarks language models from 77M to 40B parameters using different architectures and scoring functions.

text-classification Text Classification +2

A Benchmark Evaluation of Clinical Named Entity Recognition in French

no code implementations28 Mar 2024 Nesrine Bannour, Christophe Servan, Aurélie Névéol, Xavier Tannier

Objective: This paper presentsan evaluation of masked language models for biomedical French on the task of clinical named entity recognition. Material and methods: We evaluate biomedical models CamemBERT-bio and DrBERT and compare them tostandard French models CamemBERT, FlauBERT and FrALBERT as well as multilingual mBERT using three publicallyavailable corpora for clinical named entity recognition in French.

named-entity-recognition Named Entity Recognition

mALBERT: Is a Compact Multilingual BERT Model Still Worth It?

no code implementations27 Mar 2024 Christophe Servan, Sahar Ghannay, Sophie Rosset

Within the current trend of Pretained Language Models (PLM), emerge more and more criticisms about the ethical andecological impact of such models.

Language Modelling Question Answering

Benchmarking Transformers-based models on French Spoken Language Understanding tasks

no code implementations19 Jul 2022 Oralie Cattan, Sahar Ghannay, Christophe Servan, Sophie Rosset

In this paper, we propose a unified benchmark, focused on evaluating models quality and their ecological impact on two well-known French spoken language understanding tasks.

Benchmarking Spoken Language Understanding

On the cross-lingual transferability of multilingual prototypical models across NLU tasks

no code implementations ACL (MetaNLP) 2021 Oralie Cattan, Christophe Servan, Sophie Rosset

Supervised deep learning-based approaches have been applied to task-oriented dialog and have proven to be effective for limited domain and language applications when a sufficient number of training examples are available.

Few-Shot Learning Natural Language Understanding +1

On the Usability of Transformers-based models for a French Question-Answering task

no code implementations RANLP 2021 Oralie Cattan, Christophe Servan, Sophie Rosset

In this paper, we establish a state-of-the-art of the efforts dedicated to the usability of Transformer-based models and propose to evaluate these improvements on the question-answering performances of French language which have few resources.

Cross-Lingual Transfer Data Augmentation +2

Using Whole Document Context in Neural Machine Translation

no code implementations EMNLP (IWSLT) 2019 Valentin Macé, Christophe Servan

In Machine Translation, considering the document as a whole can help to resolve ambiguities and inconsistencies.

Machine Translation Translation

Qwant Research @DEFT 2019: Document matching and information retrieval using clinical cases

no code implementations6 Jul 2019 Estelle Maudet, Oralie Cattan, Maureen de Seyssel, Christophe Servan

For this task, we propose an approach based on language models and evaluate the impact on the results of different preprocessings and matching techniques.

Information Retrieval Retrieval +3

Qwant Research @DEFT 2019 : appariement de documents et extraction d'informations \`a partir de cas cliniques (Document matching and information retrieval using clinical cases)

no code implementations JEPTALNRECITAL 2019 Estelle Maudet, Oralie Cattan, Maureen de Seyssel, Christophe Servan

Pour r{\'e}soudre cette t{\^a}che, nous proposons une approche reposant sur des mod{\`e}les de langue et {\'e}valuons l{'}impact de diff{\'e}rents pr{\'e}-traitements et de diff{\'e}rentes techniques d{'}appariement sur les r{\'e}sultats.

Information Retrieval Retrieval

Image search using multilingual texts: a cross-modal learning approach between image and text

no code implementations27 Mar 2019 Maxime Portaz, Hicham Randrianarivo, Adrien Nivaggioli, Estelle Maudet, Christophe Servan, Sylvain Peyronnet

Moreover, by using multilingual embeddings we ensure that words from two different languages have close descriptors and thus are attached to similar images.

Image Retrieval

SYSTRAN Purely Neural MT Engines for WMT2017

no code implementations WS 2017 Yongchao Deng, Jungi Kim, Guillaume Klein, Catherine Kobus, Natalia Segal, Christophe Servan, Bo wang, Dakun Zhang, Josep Crego, Jean Senellart

This paper describes SYSTRAN's systems submitted to the WMT 2017 shared news translation task for English-German, in both translation directions.

Machine Translation Translation

Listen and Translate: A Proof of Concept for End-to-End Speech-to-Text Translation

1 code implementation6 Dec 2016 Alexandre Berard, Olivier Pietquin, Christophe Servan, Laurent Besacier

This paper proposes a first attempt to build an end-to-end speech-to-text translation system, which does not use source language transcription during learning or decoding.

Speech-to-Text Translation Translation

SYSTRAN's Pure Neural Machine Translation Systems

no code implementations18 Oct 2016 Josep Crego, Jungi Kim, Guillaume Klein, Anabel Rebollo, Kathy Yang, Jean Senellart, Egor Akhanov, Patrice Brunelle, Aurelien Coquard, Yongchao Deng, Satoshi Enoue, Chiyo Geiss, Joshua Johanson, Ardas Khalsa, Raoum Khiari, Byeongil Ko, Catherine Kobus, Jean Lorieux, Leidiana Martins, Dang-Chuan Nguyen, Alexandra Priori, Thomas Riccardi, Natalia Segal, Christophe Servan, Cyril Tiquet, Bo wang, Jin Yang, Dakun Zhang, Jing Zhou, Peter Zoldan

Since the first online demonstration of Neural Machine Translation (NMT) by LISA, NMT development has recently moved from laboratory to production systems as demonstrated by several entities announcing roll-out of NMT engines to replace their existing technologies.

Machine Translation NMT +1

Word2Vec vs DBnary: Augmenting METEOR using Vector Representations or Lexical Resources?

1 code implementation COLING 2016 Christophe Servan, Alexandre Berard, Zied Elloumi, Hervé Blanchon, Laurent Besacier

This paper presents an approach combining lexico-semantic resources and distributed representations of words applied to the evaluation in machine translation (MT).

Machine Translation Translation

MultiVec: a Multilingual and Multilevel Representation Learning Toolkit for NLP

1 code implementation LREC 2016 Alex B{\'e}rard, re, Christophe Servan, Olivier Pietquin, Laurent Besacier

We present MultiVec, a new toolkit for computing continuous representations for text at different granularity levels (word-level or sequences of words).

Document Classification General Classification +2

Adaptation par enrichissement terminologique en traduction automatique statistique fond\'ee sur la g\'en\'eration et le filtrage de bi-segments virtuels

no code implementations JEPTALNRECITAL 2015 Christophe Servan, Marc Dymetman

Nous pr{\'e}sentons des travaux pr{\'e}liminaires sur une approche permettant d{'}ajouter des termes bilingues {\`a} un syst{\`e}me de Traduction Automatique Statistique (TAS) {\`a} base de segments.

Cannot find the paper you are looking for? You can Submit a new open access paper.