Search Results for author: Patrick Paroubek

Found 37 papers, 5 papers with code

A Rough Set Formalization of Quantitative Evaluation with Ambiguity

no code implementations • LREC 2012 • Patrick Paroubek, Xavier Tannier

In this paper, we present the founding elements of a formal model of the evaluation paradigm in natural language processing.

Information Retrieval Machine Translation +2

Paper
Add Code

Indexation libre et contr\^ol\'ee d'articles scientifiques. Pr\'esentation et r\'esultats du d\'efi fouille de textes DEFT2012 (Controlled and free indexing of scientific papers. Presentation and results of the DEFT2012 text-mining challenge) [in French]

no code implementations • JEPTALNRECITAL 2012 • Patrick Paroubek, Pierre Zweigenbaum, Dominic Forest, Cyril Grouin

Lemmatization

Paper
Add Code

Improving Minor Opinion Polarity Classification with Named Entity Analysis (L'apport des Entit\'es Nomm\'ees pour la classification des opinions minoritaires) [in French]

no code implementations • JEPTALNRECITAL 2013 • Amel Fraisse, Patrick Paroubek, Gil Francopoulo

Classification General Classification +2

Paper
Add Code

Converting dependencies for syntactic analysis of French into PASSAGE functional relations (Convertir des analyses syntaxiques en d\'ependances vers les relations fonctionnelles PASSAGE) [in French]

no code implementations • JEPTALNRECITAL 2013 • Patrick Paroubek, Munshi Asadullah, Anne Vilnat

Paper
Add Code

Facing the Identification Problem in Language-Related Scientific Data Analysis.

no code implementations • LREC 2014 • Joseph Mariani, Christopher Cieri, Gil Francopoulo, Patrick Paroubek, Marine Delaborde

This paper describes the problems that must be addressed when studying large amounts of data over time which require entity normalization applied not to the usual genres of news or political speech, but to the genre of academic discourse about language resources, technologies and sciences.

Language Identification

Paper
Add Code

Bidirectionnal converter between syntactic annotations : from French Treebank Dependencies to PASSAGE annotations, and back

no code implementations • LREC 2014 • Munshi Asadullah, Patrick Paroubek, Anne Vilnat

We shall illustrate the mapping of important syntactic phenomena using the corpus made of the examples of the FTB - DEP annotation guidelines, which we have hand-annotated with PASSAGE annotations and used to compute quantitative performance measures on the FTB - DEP guidelines. n this paper we will briefly introduce the two annotation formats.

Paper
Add Code

Toward a unifying model for Opinion, Sentiment and Emotion information extraction

no code implementations • LREC 2014 • Amel Fraisse, Patrick Paroubek

This paper presents a logical formalization of a set 20 semantic categories related to opinion, emotion and sentiment.

Opinion Mining Sentiment Analysis

Paper
Add Code

Rediscovering 15 Years of Discoveries in Language Resources and Evaluation: The LREC Anthology Analysis

no code implementations • LREC 2014 • Joseph Mariani, Patrick Paroubek, Gil Francopoulo, Olivier Hamon

It follows similar exercises that have been conducted, such as the survey on the IEEE ICASSP conference series from 1976 to 1990, which served in the launching of the ESCA Eurospeech conference, a survey of the Association of Computational Linguistics (ACL) over 50 years of existence, which was presented at the ACL conference in 2012, or a survey over the 25 years (1987-2012) of the conferences contained in the ISCA Archive, presented at Interspeech 2013.

Speech Recognition

Paper
Add Code

Automatic Analysis of Scientific and Literary Texts. Presentation and Results of the DEFT2014 Text Mining Challenge (Analyse automatique de textes litt\'eraires et scientifiques : pr\'esentation et r\'esultats du d\'efi fouille de texte DEFT2014) [in French]

no code implementations • JEPTALNRECITAL 2014 • Thierry Hamon, Quentin Plepl{\'e}, Patrick Paroubek, Pierre Zweigenbaum, Cyril Grouin

Opinion Mining

Paper
Add Code

Utiliser les interjections pour d\'etecter les \'emotions

no code implementations • JEPTALNRECITAL 2015 • Amel Fraisse, Patrick Paroubek

Des travaux en analyse de sentiments ont montr{\'e} l{'}int{\'e}r{\^e}t des {\'e}motic{\^o}nes et r{\'e}cemment des mots-di{\`e}ses, qui s{'}av{\`e}rent {\^e}tre tr{\`e}s utiles pour la classification en polarit{\'e}.

Paper
Add Code

A Study of Reuse and Plagiarism in LREC papers

no code implementations • LREC 2016 • Gil Francopoulo, Joseph Mariani, Patrick Paroubek

The aim of this experiment is to present an easy way to compare fragments of texts in order to detect (supposed) results of copy {\&} paste operations between articles in the domain of Natural Language Processing (NLP).

Paper
Add Code

Predictive Modeling: Guessing the NLP Terms of Tomorrow

no code implementations • LREC 2016 • Gil Francopoulo, Joseph Mariani, Patrick Paroubek

Predictive modeling, often called {``}predictive analytics{''} in a commercial context, encompasses a variety of statistical techniques that analyze historical and present facts to make predictions about unknown events.

Paper
Add Code

A Study of Reuse and Plagiarism in Speech and Natural Language Processing papers

no code implementations • WS 2016 • Joseph Mariani, Gil Francopoulo, Patrick Paroubek

Information Retrieval

Paper
Add Code

AppFM, une plate-forme de gestion de modules de TAL (AppFM, a tool for managing NLP modules)

no code implementations • JEPTALNRECITAL 2016 • Paul Bui-Quang, Brigitte Grau, Patrick Paroubek

AppFM 1 est un outil {\`a} mi-chemin entre un environnement de cr{\'e}ation de cha{\^\i}nes modulaires de TAL et un gestionnaire de services syst{\`e}mes.

Paper
Add Code

Providing and Analyzing NLP Terms for our Community

no code implementations • WS 2016 • Gil Francopoulo, Joseph Mariani, Patrick Paroubek, Fr{\'e}d{\'e}ric Vernier

By its own nature, the Natural Language Processing (NLP) community is a priori the best equipped to study the evolution of its own publications, but works in this direction are rare and only recently have we seen a few attempts at charting the field.

Named Entity Recognition (NER) Optical Character Recognition (OCR)

Paper
Add Code

Annotating Spin in Biomedical Scientific Publications : the case of Random Controlled Trials (RCTs)

no code implementations • LREC 2018 • Anna Koroleva, Patrick Paroubek

Decision Making

Paper
Add Code

Measuring Innovation in Speech and Language Processing Publications.

no code implementations • LREC 2018 • Joseph Mariani, Gil Francopoulo, Patrick Paroubek

Information Retrieval Optical Character Recognition (OCR)

Paper
Add Code

DEFT2018 : recherche d'information et analyse de sentiments dans des tweets concernant les transports en \^Ile de France (DEFT2018 : Information Retrieval and Sentiment Analysis in Tweets about Public Transportation in \^Ile de France Region )

no code implementations • JEPTALNRECITAL 2018 • Patrick Paroubek, Cyril Grouin, Patrice Bellot, Vincent Claveau, Iris Eshkol-Taravella, Amel Fraisse, Agata Jackiewicz, Jihen Karoui, Laura Monceaux, Juan-Manuel Torres-Moreno

Cet article pr{\'e}sente l{'}{\'e}dition 2018 de la campagne d{'}{\'e}valuation DEFT (D{\'e}fi Fouille de Textes).

Information Retrieval Retrieval +1

Paper
Add Code

Extracting relations between outcomes and significance levels in Randomized Controlled Trials (RCTs) publications

no code implementations • WS 2019 • Anna Koroleva, Patrick Paroubek

Statistical hypothesis testing is used to test if the experimental intervention is superior to the control.

Two-sample testing

Paper
Add Code

NLP Analytics in Finance with DoRe: A French 250M Tokens Corpus of Corporate Annual Reports

no code implementations • LREC 2020 • Corentin Masson, Patrick Paroubek

Recent advances in neural computing and word embeddings for semantic processing open many new applications areas which had been left unaddressed so far because of inadequate language understanding capacity.

Stock Market Prediction Word Embeddings

Paper
Add Code

DeSpin: a prototype system for detecting spin in biomedical publications

no code implementations • WS 2020 • Anna Koroleva, Sanjay Kamath, Patrick Bossuyt, Patrick Paroubek

The proposed tool is the first tool for spin detection.

Relation Extraction Semantic Similarity +3

Paper
Add Code

Natural Language Processing for Cognitive Analysis of Emotions

no code implementations • 11 Oct 2022 • Gustave Cortal, Alain Finkel, Patrick Paroubek, Lina Ye

Emotion analysis in texts suffers from two major limitations: annotated gold-standard corpora are mostly small and homogeneous, and emotion identification is often simplified as a sentence-level classification problem.

Emotion Recognition Management +1

Paper
Add Code

Emotion Recognition based on Psychological Components in Guided Narratives for Emotion Regulation

no code implementations • 15 May 2023 • Gustave Cortal, Alain Finkel, Patrick Paroubek, Lina Ye

Emotion regulation is a crucial element in dealing with emotional events and has positive effects on mental health.

Emotion Classification Emotion Recognition

Paper
Add Code

Searching for Snippets of Open-Domain Dialogue in Task-Oriented Dialogue Datasets

no code implementations • 23 Nov 2023 • Armand Stricker, Patrick Paroubek

Most existing dialogue corpora and models have been designed to fit into 2 predominant categories : task-oriented dialogues portray functional goals, such as making a restaurant reservation or booking a plane ticket, while chit-chat/open-domain dialogues focus on holding a socially engaging talk with a user.

Paper
Add Code

Enhancing Task-Oriented Dialogues with Chitchat: a Comparative Study Based on Lexical Diversity and Divergence

1 code implementation • 23 Nov 2023 • Armand Stricker, Patrick Paroubek

As a recent development, task-oriented dialogues (TODs) have been enriched with chitchat in an effort to make dialogues more diverse and engaging.

Paper
Code

A Unified Approach to Emotion Detection and Task-Oriented Dialogue Modeling

1 code implementation • 24 Jan 2024 • Armand Stricker, Patrick Paroubek

In current text-based task-oriented dialogue (TOD) systems, user emotion detection (ED) is often overlooked or is typically treated as a separate and independent task, requiring additional training.

Language Modelling

Paper
Code

Chitchat as Interference: Adding User Backstories to Task-Oriented Dialogues

1 code implementation • 23 Feb 2024 • Armand Stricker, Patrick Paroubek

During task-oriented dialogues (TODs), human users naturally introduce chitchat that is beyond the immediate scope of the task, interfering with the flow of the conversation.

Paper
Code

A Dataset for Pharmacovigilance in German, French, and Japanese: Annotating Adverse Drug Reactions across Languages

2 code implementations • 27 Mar 2024 • Lisa Raithel, Hui-Syuan Yeh, Shuntaro Yada, Cyril Grouin, Thomas Lavergne, Aurélie Névéol, Patrick Paroubek, Philippe Thomas, Tomohiro Nishiyama, Sebastian Möller, Eiji Aramaki, Yuji Matsumoto, Roland Roller, Pierre Zweigenbaum

User-generated data sources have gained significance in uncovering Adverse Drug Reactions (ADRs), with an increasing number of discussions occurring in the digital world.

Attribute

Paper
Code

MAPA Project: Ready-to-Go Open-Source Datasets and Deep Learning Technology to Remove Identifying Information from Text Documents

no code implementations • LEGAL (LREC) 2022 • Victoria Arranz, Khalid Choukri, Montse Cuadros, Aitor García Pablos, Lucie Gianola, Cyril Grouin, Manuel Herranz, Patrick Paroubek, Pierre Zweigenbaum

This paper presents the outcomes of the MAPA project, a set of annotated corpora for 24 languages of the European Union and an open-source customisable toolkit able to detect and substitute sensitive information in text documents from any domain, using state-of-the art, deep learning-based named entity recognition techniques.

De-identification named-entity-recognition +2

Paper
Add Code

A Fine-Grained Annotated Corpus for Target-Based Opinion Analysis of Economic and Financial Narratives

no code implementations • EMNLP (ECONLP) 2021 • Jiahui Hu, Patrick Paroubek

In this paper, we present our pre-annotation models and evaluations of their performance, introduce our annotation scheme and report on the main characteristics of our corpus.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA)

Paper
Add Code

Définition et détection des incohérences du système dans les dialogues orientés tâche. (We present experiments on automatically detecting inconsistent behavior of task-oriented dialogue systems from the context)

no code implementations • JEP/TALN/RECITAL 2021 • Léon-Paul Schaub, Vojtech Hudecek, Daniel Stancl, Ondrej Dusek, Patrick Paroubek

Définition et détection des incohérences du système dans les dialogues orientés tâche.

Task-Oriented Dialogue Systems

Paper
Add Code

A Unifying View On Task-oriented Dialogue Annotation

1 code implementation • LREC 2022 • Vojtěch Hudeček, Leon-paul Schaub, Daniel Stancl, Patrick Paroubek, Ondřej Dušek

In this paper, we present a new dataset, obtained by merging four publicly available annotated corpora for task-oriented dialogues in several domains (MultiWOZ 2. 2, CamRest676, DSTC2 and Schema-Guided Dialogue Dataset).

Dialogue Generation Dialogue State Tracking +1

Paper
Code

Differential Evaluation: a Qualitative Analysis of Natural Language Processing System Behavior Based Upon Data Resistance to Processing

no code implementations • EMNLP (Eval4NLP) 2021 • Lucie Gianola, Hicham El Boukkouri, Cyril Grouin, Thomas Lavergne, Patrick Paroubek, Pierre Zweigenbaum

Paper
Add Code

A sequence to sequence transformer data logic experiment

no code implementations • FNP 2021 • Danxin Cui, Dominique Mariko, Estelle Labidurie, Hugues de Mazancourt, Patrick Paroubek

Paper
Add Code

Annotation model and corpus for opinionated economy and finance narrative detection

no code implementations • FNP 2021 • Jiahui Hu, Patrick Paroubek, Dirk Schumacher

Paper
Add Code

The Multilingual Anonymisation Toolkit for Public Administrations (MAPA) Project

no code implementations • EAMT 2020 • Ēriks Ajausks, Victoria Arranz, Laurent Bié, Aleix Cerdà-i-Cucó, Khalid Choukri, Montse Cuadros, Hans Degroote, Amando Estela, Thierry Etchegoyhen, Mercedes García-Martínez, Aitor García-Pablos, Manuel Herranz, Alejandro Kohan, Maite Melero, Mike Rosner, Roberts Rozis, Patrick Paroubek, Artūrs Vasiļevskis, Pierre Zweigenbaum

We describe the MAPA project, funded under the Connecting Europe Facility programme, whose goal is the development of an open-source de-identification toolkit for all official European Union languages.

De-identification

Paper
Add Code

Un corpus annoté pour la génération de questions et l’extraction de réponses pour l’enseignement (An annotated corpus for abstractive question generation and extractive answer for education)

no code implementations • JEP/TALN/RECITAL 2022 • Thomas Gerald, Sofiane Ettayeb, Ha Quang Le, Anne Vilnat, Gabriel Illouz, Patrick Paroubek

Dans cette démonstration, nous présenterons les travaux en cours pour l’annotation d’un nouveau corpus de questions-réponses en langue Française.

Question Generation Question-Generation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.