no code implementations • 7 Mar 2024 • Guillaume Staerman, Marta Campi, Gareth W. Peters
Functional Isolation Forest (FIF) is a recent state-of-the-art Anomaly Detection (AD) algorithm designed for functional data.
no code implementations • 20 Feb 2024 • Anas Himmi, Guillaume Staerman, Marine Picot, Pierre Colombo, Nuno M. Guerreiro
Hallucinated translations pose significant threats and safety concerns when it comes to the practical deployment of machine translation systems.
no code implementations • 21 Oct 2023 • Pierre Colombo, Nathan Noiry, Guillaume Staerman, Pablo Piantanida
One of the pursued objectives of deep learning is to provide tools that learn abstract representations of reality from the observation of multiple contextual situations.
1 code implementation • 21 Oct 2023 • Pierre Colombo, Marine Picot, Nathan Noiry, Guillaume Staerman, Pablo Piantanida
The landscape of available textual adversarial attacks keeps growing, posing severe threats and raising concerns regarding the deep NLP system's integrity.
1 code implementation • 6 Jun 2023 • Eduardo Dadalto, Pierre Colombo, Guillaume Staerman, Nathan Noiry, Pablo Piantanida
A key feature of out-of-distribution (OOD) detection is to exploit a trained neural network by extracting statistical patterns and relationships through the multi-layer classifier to detect shifts in the expected input data distribution.
no code implementations • 31 May 2023 • Anass Aghbalou, Guillaume Staerman
Indeed, HTL relies only on a hypothesis learnt from such source data, relieving the hurdle of expansive data storage and providing great practical benefits.
no code implementations • 20 Feb 2023 • Maxime Darrin, Guillaume Staerman, Eduardo Dadalto Câmara Gomes, Jackie CK Cheung, Pablo Piantanida, Pierre Colombo
More importantly, we show that the usual choice (the last layer) is rarely the best one for OOD detection and that far better results could be achieved if the best layer were picked.
no code implementations • 24 Nov 2022 • Pierre Colombo, Eduardo D. C. Gomes, Guillaume Staerman, Nathan Noiry, Pablo Piantanida
Deep learning methods have boosted the adoption of NLP systems in real-life applications.
no code implementations • 10 Oct 2022 • Guillaume Staerman, Cédric Allain, Alexandre Gramfort, Thomas Moreau
Temporal point processes (TPP) are a natural tool for modeling event-based data.
no code implementations • ACL 2022 • Pierre Colombo, Guillaume Staerman, Nathan Noiry, Pablo Piantanida
When working with textual data, a natural application of disentangled representations is fair classification where the goal is to make predictions without being biased (or influenced) by sensitive attributes that may be present in the data (e. g., age, gender or race).
no code implementations • 13 Jan 2022 • Guillaume Staerman, Eric Adjakossa, Pavlo Mozharovskyi, Vera Hofer, Jayant Sen Gupta, Stephan Clémençon
After an overview of the state-of-the-art and a visual-descriptive study, a variety of anomaly detection methods are compared.
2 code implementations • EMNLP 2021 • Pierre Colombo, Guillaume Staerman, Chloe Clavel, Pablo Piantanida
A new metric \texttt{BaryScore} to evaluate text generation based on deep contextualized embeddings e. g., BERT, Roberta, ELMo) is introduced.
no code implementations • 21 Jun 2021 • Guillaume Staerman, Pavlo Mozharovskyi, Stéphan Clémençon
Because it determines a center-outward ordering of observations in $\mathbb{R}^d$ with $d\geq 2$, the concept of statistical depth permits to define quantiles and ranks for multivariate data and use them for various statistical tasks (e. g. inference, hypothesis testing).
1 code implementation • 23 Mar 2021 • Guillaume Staerman, Pavlo Mozharovskyi, Pierre Colombo, Stéphan Clémençon, Florence d'Alché-Buc
a probability distribution or a data set.
no code implementations • 18 Jun 2020 • Guillaume Staerman, Pierre Laforgue, Pavlo Mozharovskyi, Florence d'Alché-Buc
Issued from Optimal Transport, the Wasserstein distance has gained importance in Machine Learning due to its appealing geometrical properties and the increasing availability of efficient approximations.
no code implementations • 9 Jun 2020 • Pierre Laforgue, Guillaume Staerman, Stephan Clémençon
In contrast to the empirical mean, the Median-of-Means (MoM) is an estimator of the mean $\theta$ of a square integrable r. v.
2 code implementations • 9 Oct 2019 • Guillaume Staerman, Pavlo Mozharovskyi, Stephan Clémençon
a statistical population may play a crucial role in this regard, anomalies corresponding to observations with 'small' depth.
1 code implementation • 9 Apr 2019 • Guillaume Staerman, Pavlo Mozharovskyi, Stephan Clémençon, Florence d'Alché-Buc
For the purpose of monitoring the behavior of complex infrastructures (e. g. aircrafts, transport or energy networks), high-rate sensors are deployed to capture multivariate data, generally unlabeled, in quasi continuous-time to detect quickly the occurrence of anomalies that may jeopardize the smooth operation of the system of interest.