Search Results for author: Christophe Servan

Found 26 papers, 5 papers with code

Small Language Models are Good Too: An Empirical Study of Zero-Shot Classification

no code implementations • 17 Apr 2024 • Pierre Lepagnol, Thomas Gerald, Sahar Ghannay, Christophe Servan, Sophie Rosset

This study is part of the debate on the efficiency of large versus small language models for text classification by prompting. We assess the performance of small language models in zero-shot text classification, challenging the prevailing dominance of large models. Across 15 datasets, our investigation benchmarks language models from 77M to 40B parameters using different architectures and scoring functions.

text-classification Text Classification +2

Paper
Add Code

A Benchmark Evaluation of Clinical Named Entity Recognition in French

no code implementations • 28 Mar 2024 • Nesrine Bannour, Christophe Servan, Aurélie Névéol, Xavier Tannier

Objective: This paper presentsan evaluation of masked language models for biomedical French on the task of clinical named entity recognition. Material and methods: We evaluate biomedical models CamemBERT-bio and DrBERT and compare them tostandard French models CamemBERT, FlauBERT and FrALBERT as well as multilingual mBERT using three publicallyavailable corpora for clinical named entity recognition in French.

named-entity-recognition Named Entity Recognition

Paper
Add Code

New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark

1 code implementation • 28 Mar 2024 • Nadège Alavoine, Gaëlle Laperriere, Christophe Servan, Sahar Ghannay, Sophie Rosset

A combination ofmultiple datasets, including the MEDIA dataset, was suggested for training this joint model.

intent-classification Intent Classification +4

Paper
Code

mALBERT: Is a Compact Multilingual BERT Model Still Worth It?

no code implementations • 27 Mar 2024 • Christophe Servan, Sahar Ghannay, Sophie Rosset

Within the current trend of Pretained Language Models (PLM), emerge more and more criticisms about the ethical andecological impact of such models.

Language Modelling Question Answering

Paper
Add Code

Benchmarking Transformers-based models on French Spoken Language Understanding tasks

no code implementations • 19 Jul 2022 • Oralie Cattan, Sahar Ghannay, Christophe Servan, Sophie Rosset

In this paper, we propose a unified benchmark, focused on evaluating models quality and their ecological impact on two well-known French spoken language understanding tasks.

Benchmarking Spoken Language Understanding

Paper
Add Code

On the cross-lingual transferability of multilingual prototypical models across NLU tasks

no code implementations • ACL (MetaNLP) 2021 • Oralie Cattan, Christophe Servan, Sophie Rosset

Supervised deep learning-based approaches have been applied to task-oriented dialog and have proven to be effective for limited domain and language applications when a sufficient number of training examples are available.

Few-Shot Learning Natural Language Understanding +1

Paper
Add Code

On the Usability of Transformers-based models for a French Question-Answering task

no code implementations • RANLP 2021 • Oralie Cattan, Christophe Servan, Sophie Rosset

In this paper, we establish a state-of-the-art of the efforts dedicated to the usability of Transformer-based models and propose to evaluate these improvements on the question-answering performances of French language which have few resources.

Cross-Lingual Transfer Data Augmentation +2

Paper
Add Code

Neural Networks approaches focused on French Spoken Language Understanding: application to the MEDIA Evaluation Task

1 code implementation • COLING 2020 • Sahar Ghannay, Christophe Servan, Sophie Rosset

In this paper, we present a study on a French Spoken Language Understanding (SLU) task: the MEDIA task.

Spoken Language Understanding Word Embeddings

Paper
Code

Conception d'un syst\`eme de d\'etection d'intention pour un moteur de recherche sur Internet (Designing a User Intention Detection system for a Web Search Engine)

no code implementations • JEPTALNRECITAL 2020 • Estelle Maudet, Christophe Servan

Cet article pr{\'e}sente notre {\'e}tude pour proposer un nouveau syst{\`e}me de d{\'e}tection d{'}intention pour le moteur de recherche sur Internet Qwant.

Paper
Add Code

Using Whole Document Context in Neural Machine Translation

no code implementations • EMNLP (IWSLT) 2019 • Valentin Macé, Christophe Servan

In Machine Translation, considering the document as a whole can help to resolve ambiguities and inconsistencies.

Machine Translation Translation

Paper
Add Code

Qwant Research @DEFT 2019: Document matching and information retrieval using clinical cases

no code implementations • 6 Jul 2019 • Estelle Maudet, Oralie Cattan, Maureen de Seyssel, Christophe Servan

For this task, we propose an approach based on language models and evaluate the impact on the results of different preprocessings and matching techniques.

Information Retrieval Retrieval +3

Paper
Add Code

Qwant Research @DEFT 2019 : appariement de documents et extraction d'informations \`a partir de cas cliniques (Document matching and information retrieval using clinical cases)

no code implementations • JEPTALNRECITAL 2019 • Estelle Maudet, Oralie Cattan, Maureen de Seyssel, Christophe Servan

Pour r{\'e}soudre cette t{\^a}che, nous proposons une approche reposant sur des mod{\`e}les de langue et {\'e}valuons l{'}impact de diff{\'e}rents pr{\'e}-traitements et de diff{\'e}rentes techniques d{'}appariement sur les r{\'e}sultats.

Information Retrieval Retrieval

Paper
Add Code

Image search using multilingual texts: a cross-modal learning approach between image and text

no code implementations • 27 Mar 2019 • Maxime Portaz, Hicham Randrianarivo, Adrien Nivaggioli, Estelle Maudet, Christophe Servan, Sylvain Peyronnet

Moreover, by using multilingual embeddings we ensure that words from two different languages have close descriptors and thus are attached to similar images.

Image Retrieval

Paper
Add Code

SYSTRAN Purely Neural MT Engines for WMT2017

no code implementations • WS 2017 • Yongchao Deng, Jungi Kim, Guillaume Klein, Catherine Kobus, Natalia Segal, Christophe Servan, Bo wang, Dakun Zhang, Josep Crego, Jean Senellart

This paper describes SYSTRAN's systems submitted to the WMT 2017 shared news translation task for English-German, in both translation directions.

Machine Translation Translation

Paper
Add Code

Adaptation incr\'ementale de mod\`eles de traduction neuronaux (Incremental adaptation of neural machine translation models)

no code implementations • JEPTALNRECITAL 2017 • Christophe Servan, Josep Crego, Jean Senellart

L{'}adaptation au domaine est un verrou scientifique en traduction automatique.

Machine Translation Translation

Paper
Add Code

Conception d'une solution de d\'etection d'\'ev\'enements bas\'ee sur Twitter (Design of a solution for event detection from Tweeter)

no code implementations • JEPTALNRECITAL 2017 • Christophe Servan, Catherine Kobus, Yongchao Deng, Cyril Touffet, Jungi Kim, In{\`e}s Kapp, Djamel Mostefa, Josep Crego, Aur{\'e}lien Coquard, Jean Senellart

Cet article pr{\'e}sente un syst{\`e}me d{'}alertes fond{\'e} sur la masse de donn{\'e}es issues de Tweeter.

Event Detection

Paper
Add Code

Domain specialization: a post-training domain adaptation for Neural Machine Translation

no code implementations • 19 Dec 2016 • Christophe Servan, Josep Crego, Jean Senellart

Domain adaptation is a key feature in Machine Translation.

Domain Adaptation Machine Translation +2

Paper
Add Code

Listen and Translate: A Proof of Concept for End-to-End Speech-to-Text Translation

1 code implementation • 6 Dec 2016 • Alexandre Berard, Olivier Pietquin, Christophe Servan, Laurent Besacier

This paper proposes a first attempt to build an end-to-end speech-to-text translation system, which does not use source language transcription during learning or decoding.

Speech-to-Text Translation Translation

387

Paper
Code

SYSTRAN's Pure Neural Machine Translation Systems

no code implementations • 18 Oct 2016 • Josep Crego, Jungi Kim, Guillaume Klein, Anabel Rebollo, Kathy Yang, Jean Senellart, Egor Akhanov, Patrice Brunelle, Aurelien Coquard, Yongchao Deng, Satoshi Enoue, Chiyo Geiss, Joshua Johanson, Ardas Khalsa, Raoum Khiari, Byeongil Ko, Catherine Kobus, Jean Lorieux, Leidiana Martins, Dang-Chuan Nguyen, Alexandra Priori, Thomas Riccardi, Natalia Segal, Christophe Servan, Cyril Tiquet, Bo wang, Jin Yang, Dakun Zhang, Jing Zhou, Peter Zoldan

Since the first online demonstration of Neural Machine Translation (NMT) by LISA, NMT development has recently moved from laboratory to production systems as demonstrated by several entities announcing roll-out of NMT engines to replace their existing technologies.

Machine Translation NMT +1

Paper
Add Code

Word2Vec vs DBnary: Augmenting METEOR using Vector Representations or Lexical Resources?

1 code implementation • COLING 2016 • Christophe Servan, Alexandre Berard, Zied Elloumi, Hervé Blanchon, Laurent Besacier

This paper presents an approach combining lexico-semantic resources and distributed representations of words applied to the evaluation in machine translation (MT).

Machine Translation Translation

Paper
Code

Word2Vec vs DBnary ou comment (r\'e)concilier repr\'esentations distribu\'ees et r\'eseaux lexico-s\'emantiques ? Le cas de l'\'evaluation en traduction automatique (Word2Vec vs DBnary or how to bring back together vector representations and lexical resources ? A case study for machine translation evaluation)

no code implementations • JEPTALNRECITAL 2016 • Christophe Servan, Zied Elloumi, Herv{\'e} Blanchon, Laurent Besacier

Cet article pr{\'e}sente une approche associant r{\'e}seaux lexico-s{\'e}mantiques et repr{\'e}sentations distribu{\'e}es de mots appliqu{\'e}e {\`a} l{'}{\'e}valuation de la traduction automatique.

Machine Translation

Paper
Add Code

MultiVec: a Multilingual and Multilevel Representation Learning Toolkit for NLP

1 code implementation • LREC 2016 • Alex B{\'e}rard, re, Christophe Servan, Olivier Pietquin, Laurent Besacier

We present MultiVec, a new toolkit for computing continuous representations for text at different granularity levels (word-level or sequences of words).

Document Classification General Classification +2

116

Paper
Code

Adaptation par enrichissement terminologique en traduction automatique statistique fond\'ee sur la g\'en\'eration et le filtrage de bi-segments virtuels

no code implementations • JEPTALNRECITAL 2015 • Christophe Servan, Marc Dymetman

Nous pr{\'e}sentons des travaux pr{\'e}liminaires sur une approche permettant d{'}ajouter des termes bilingues {\`a} un syst{\`e}me de Traduction Automatique Statistique (TAS) {\`a} base de segments.

Paper
Add Code

Calculation of Phrase Probabilities for Statistical Machine Translation by using Belief Functions

no code implementations • COLING 2012 • Christophe Servan, Simon Petitrenaud

Machine Translation Translation

Paper
Add Code

Utilisation des fonctions de croyance pour l'estimation de param\`etres en traduction automatique (Feature calculation for Statistical Machine Translation by using belief functions) [in French]

no code implementations • JEPTALNRECITAL 2012 • Christophe Servan, Simon Petitrenaud

Machine Translation

Paper
Add Code

LIUM's SMT Machine Translation Systems for WMT 2012

no code implementations • WS 2012 • Christophe Servan, Patrik Lambert, Anthony Rousseau, Holger Schwenk, Lo{\"\i}c Barrault

Information Retrieval Language Modelling +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.