Search Results for author: Dominique Fohr

Found 25 papers, 3 papers with code

Identification des Expressions Polylexicales dans les Tweets (Identification of Multiword Expressions in Tweets)

no code implementations JEP/TALN/RECITAL 2022 Nicolas Zampieri, Carlos Ramisch, Irina Illina, Dominique Fohr

L’identification des expressions polylexicales (EP) dans les tweets est une tâche difficile en raison de la nature linguistique complexe des EP combinée à l’utilisation d’un langage non standard.

Identification of Multiword Expressions in Tweets for Hate Speech Detection

no code implementations LREC 2022 Nicolas Zampieri, Carlos Ramisch, Irina Illina, Dominique Fohr

In this article, we present joint experiments on these two related tasks on English Twitter data: first we focus on the MWE identification task, and then we observe the influence of MWE-based features on the HSD task.

Hate Speech Detection

Unsupervised Domain Adaptation in Cross-corpora Abusive Language Detection

no code implementations NAACL (SocialNLP) 2021 Tulika Bose, Irina Illina, Dominique Fohr

The state-of-the-art abusive language detection models report great in-corpus performance, but underperform when evaluated on abusive comments that differ from the training scenario.

Abusive Language Language Modelling +1

Transferring Knowledge via Neighborhood-Aware Optimal Transport for Low-Resource Hate Speech Detection

no code implementations17 Oct 2022 Tulika Bose, Irina Illina, Dominique Fohr

The concerning rise of hateful content on online platforms has increased the attention towards automatic hate speech detection, commonly formulated as a supervised classification task.

Hate Speech Detection

Placing M-Phasis on the Plurality of Hate: A Feature-Based Corpus of Hate Online

1 code implementation LREC 2022 Dana Ruiter, Liane Reiners, Ashwin Geet D'Sa, Thomas Kleinbauer, Dominique Fohr, Irina Illina, Dietrich Klakow, Christian Schemer, Angeliki Monnier

Even though hate speech (HS) online has been an important object of research in the last decade, most HS-related corpora over-simplify the phenomenon of hate by attempting to label user comments as "hate" or "neutral".

Hate Speech Detection

Dynamically Refined Regularization for Improving Cross-corpora Hate Speech Detection

1 code implementation Findings (ACL) 2022 Tulika Bose, Nikolaos Aletras, Irina Illina, Dominique Fohr

In this paper, we propose to automatically identify and reduce spurious correlations using attribution methods with dynamic refinement of the list of terms that need to be regularized during training.

Hate Speech Detection

Improving Automatic Hate Speech Detection with Multiword Expression Features

no code implementations1 Jun 2021 Nicolas Zampieri, Irina Illina, Dominique Fohr

To incorporate MWE features, we create a three-branch deep neural network: one branch for USE, one for MWE categories, and one for MWE embeddings.

Hate Speech Detection Sentence

Introduction d'informations s\'emantiques dans un syst\`eme de reconnaissance de la parole (Despite spectacular advances in recent years, the Automatic Speech Recognition (ASR) systems still make mistakes, especially in noisy environments)

no code implementations JEPTALNRECITAL 2020 St{\'e}phane Level, Irina Illina, Dominique Fohr

Malgr{\'e} les avanc{\'e}s spectaculaires ces derni{\`e}res ann{\'e}es, les syst{\`e}mes de Reconnaissance Automatique de Parole (RAP) commettent encore des erreurs, surtout dans des environnements bruit{\'e}s. Pour am{\'e}liorer la RAP, nous proposons de se diriger vers une contextualisation d{'}un syst{\`e}me RAP, car les informations s{\'e}mantiques sont importantes pour la performance de la RAP.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Towards non-toxic landscapes: Automatic toxic comment detection using DNN

no code implementations LREC 2020 Ashwin Geet D'Sa, Irina Illina, Dominique Fohr

The contribution of this paper is the design of binary classification and regression-based approaches aiming to predict whether a comment is toxic or not.

Binary Classification

An enhanced automatic speech recognition system for Arabic

no code implementations WS 2017 Mohamed Amine Menacer, Odile Mella, Dominique Fohr, Denis Jouvet, David Langlois, Kamel Smaili

Despite all the classical techniques for Automatic Speech Recognition (ASR), which can be efficiently applied to Arabic speech recognition, it is essential to take into consideration the language specificities to improve the system performance.

Arabic Speech Recognition Automatic Speech Recognition +2

Weakly-supervised text-to-speech alignment confidence measure

no code implementations COLING 2016 Guillaume Serri{\`e}re, Christophe Cerisara, Dominique Fohr, Odile Mella

This work proposes a new confidence measure for evaluating text-to-speech alignment systems outputs, which is a key component for many applications, such as semi-automatic corpus anonymization, lips syncing, film dubbing, corpus preparation for speech synthesis and speech recognition acoustic models training.

speech-recognition Speech Recognition +1

How Diachronic Text Corpora Affect Context based Retrieval of OOV Proper Names for Audio News

no code implementations LREC 2016 Imran Sheikh, Irina Illina, Dominique Fohr

Out-Of-Vocabulary (OOV) words missed by Large Vocabulary Continuous Speech Recognition (LVCSR) systems can be recovered with the help of topic and semantic context of the OOV words captured from a diachronic text corpus.

Retrieval speech-recognition +1

The IFCASL Corpus of French and German Non-native and Native Read Speech

no code implementations LREC 2016 Juergen Trouvain, Anne Bonneau, Vincent Colotte, Camille Fauth, Dominique Fohr, Denis Jouvet, Jeanin J{\"u}gler, Yves Laprie, Odile Mella, Bernd M{\"o}bius, Frank Zimmerer

The IFCASL corpus is a French-German bilingual phonetic learner corpus designed, recorded and annotated in a project on individualized feedback in computer-assisted spoken language learning.

Learning to retrieve out-of-vocabulary words in speech recognition

no code implementations17 Nov 2015 Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linarès

In this paper, we propose two neural network models targeted to retrieve OOV PNs relevant to an audio document: (a) Document level Continuous Bag of Words (D-CBOW), (b) Document level Continuous Bag of Weighted Words (D-CBOW2).

Retrieval speech-recognition +1

CoALT: A Software for Comparing Automatic Labelling Tools

no code implementations LREC 2012 Dominique Fohr, Odile Mella

In this paper, we propose a GPL software CoALT (Comparing Automatic Labelling Tools) for comparing two automatic labellers or two speech-text alignment tools, ranking them and displaying statistics about their differences.

Speech Recognition Speech Synthesis

Cannot find the paper you are looking for? You can Submit a new open access paper.