no code implementations • 25 Oct 2024 • Mohamed Salim Aissi, Clement Romac, Thomas Carta, Sylvain Lamprier, Pierre-Yves Oudeyer, Olivier Sigaud, Laure Soulier, Nicolas Thome
Finally, we propose to use a contrastive loss to mitigate this sensitivity and improve the robustness and generalization capabilities of LLMs.
1 code implementation • 26 Sep 2024 • Yannis Karmim, Marc Lafon, Raphael Fournier S'niehotta, Nicolas Thome
Fully connected Graph Transformers (GT) have rapidly become prominent in the static graph community as an alternative to Message-Passing models, which suffer from a lack of expressivity, oversquashing, and under-reaching.
1 code implementation • 17 Jul 2024 • Yannis Karmim, Leshanshui Yang, Raphaël Fournier S'Niehotta, Clément Chatelain, Sébastien Adam, Nicolas Thome
Dynamic link prediction is a critical task in the analysis of evolving networks, with applications ranging from recommender systems to economic exchanges.
no code implementations • 3 Jul 2024 • Yannis Karmim, Elias Ramzi, Raphaël Fournier-S'niehotta, Nicolas Thome
Moreover, we extend the evaluation of GNN models for top-k recommendation tasks with an inductive user-centric protocol, providing a more accurate reflection of real-world applications.
1 code implementation • 2 Jul 2024 • Zakariae El Asri, Olivier Sigaud, Nicolas Thome
Applying reinforcement learning (RL) to real-world applications requires addressing a trade-off between asymptotic performance, sample efficiency, and inference time.
1 code implementation • 1 Jul 2024 • Marc Lafon, Elias Ramzi, Clément Rambour, Nicolas Audebert, Nicolas Thome
Despite their success, most prompt learning methods trade-off between classification accuracy and robustness, e. g. in domain generalization or out-of-distribution (OOD) detection.
1 code implementation • 5 Jun 2024 • Paul Couairon, Mustafa Shukor, Jean-Emmanuel Haugeard, Matthieu Cord, Nicolas Thome
Foundation models have emerged as powerful tools across various domains including language, vision, and multimodal tasks.
no code implementations • 15 Mar 2024 • Marc Lafon, Clément Rambour, Nicolas Thome
In this work, we study the out-of-distribution (OOD) detection problem through the use of the feature space of a pre-trained deep classifier.
Out-of-Distribution Detection Out of Distribution (OOD) Detection
no code implementations • 15 Jan 2024 • Nathan Painchaud, Jérémie Stym-Popper, Pierre-Yves Courand, Nicolas Thome, Pierre-Marc Jodoin, Nicolas Duchateau, Olivier Bernard
Deep learning enables automatic and robust extraction of cardiac function descriptors from echocardiographic sequences, such as ejection fraction or strain.
no code implementations • 19 Oct 2023 • William Ndzimbong, Cyril Fourniol, Loic Themyr, Nicolas Thome, Yvonne Keeza, Beniot Sauer, Pierre-Thierry Piechaud, Arnaud Mejean, Jacques Marescaux, Daniel George, Didier Mutter, Alexandre Hostettler, Toby Collins
To validate the dataset's utility, 5 competitive Deep Learning models for automatic kidney segmentation were benchmarked, yielding average DICE scores from 83. 2% to 89. 1% for CT, and 61. 9% to 79. 4% for US images.
2 code implementations • 15 Sep 2023 • Elias Ramzi, Nicolas Audebert, Clément Rambour, André Araujo, Xavier Bitot, Nicolas Thome
It provides an upperbound for rank losses and ensures robust training.
1 code implementation • 13 Jul 2023 • Denis Coquenet, Clément Rambour, Emanuele Dalsasso, Nicolas Thome
Vision-language foundation models such as CLIP have shown impressive zero-shot performance on many tasks and datasets, especially thanks to their free-text inputs.
no code implementations • 14 Jun 2023 • Paul Couairon, Clément Rambour, Jean-Emmanuel Haugeard, Nicolas Thome
In this work, we introduce VidEdit, a novel method for zero-shot text-based video editing that guarantees robust temporal and spatial consistency.
1 code implementation • 26 May 2023 • Marc Lafon, Elias Ramzi, Clément Rambour, Nicolas Thome
HEAT complements prior density estimators of the ID density, e. g. parametric models like the Gaussian Mixture Model (GMM), to provide an accurate yet robust density estimation.
1 code implementation • 16 Feb 2023 • Steeven Janny, Aurélien Béneteau, Madiha Nadri, Julie Digne, Nicolas Thome, Christian Wolf
To perform future forecasting of pressure and velocity on the challenging EAGLE dataset, we introduce a new mesh transformer.
1 code implementation • 7 Feb 2023 • Laura Calem, Hedi Ben-Younes, Patrick Pérez, Nicolas Thome
Predicting multiple trajectories for road users is important for automated driving systems: ego-vehicle motion planning indeed requires a clear view of the possible motions of the surrounding agents.
no code implementations • 15 Dec 2022 • Loic Themyr, Clement Rambour, Nicolas Thome, Toby Collins, Alexandre Hostettler
In particular, vision transformers construct compressed global representations through self-attention and learnable class tokens.
1 code implementation • 8 Dec 2022 • Mustafa Shukor, Nicolas Thome, Matthieu Cord
Finally, we validate the generalization of the approach to other tasks (i. e, Food Recognition) and domains with structured text such as the Medical domain on the ROCO dataset.
Ranked #1 on Cross-Modal Retrieval on Recipe1M+
no code implementations • 11 Oct 2022 • Loic Themyr, Clément Rambour, Nicolas Thome, Toby Collins, Alexandre Hostettler
Transformer models achieve state-of-the-art results for image segmentation.
no code implementations • 26 Aug 2022 • Simon Roburin, Charles Corbière, Gilles Puy, Nicolas Thome, Matthieu Aubry, Renaud Marlet, Patrick Pérez
Predictive performance of machine learning models trained with empirical risk minimization (ERM) can degrade considerably under distribution shifts.
1 code implementation • 8 Jul 2022 • Vincent Le Guen, Clément Rambour, Nicolas Thome
Since BC is an approximate physical model violated in several situations, we propose to train a physically-constrained network complemented with a data-driven network.
2 code implementations • 5 Jul 2022 • Elias Ramzi, Nicolas Audebert, Nicolas Thome, Clément Rambour, Xavier Bitot
Image Retrieval is commonly evaluated with Average Precision (AP) or Recall@k. Yet, those metrics, are limited to binary labels and do not take into account errors' severity.
Ranked #1 on Metric Learning on DyML-Animal
no code implementations • 20 May 2022 • Rémy Sun, Alexandre Ramé, Clément Masson, Nicolas Thome, Matthieu Cord
To solve this issue, we propose a novel unmixing step in MIMO architectures that allows subnetworks to properly share features.
no code implementations • 20 May 2022 • Rémy Sun, Clément Masson, Gilles Hénaff, Nicolas Thome, Matthieu Cord
Deep architecture have proven capable of solving many tasks provided a sufficient amount of labeled data.
1 code implementation • NeurIPS 2021 • Elias Ramzi, Nicolas Thome, Clément Rambour, Nicolas Audebert, Xavier Bitot
In image retrieval, standard evaluation metrics rely on score ranking, e. g. average precision (AP).
Ranked #2 on Image Retrieval on CUB-200-2011
no code implementations • 29 Sep 2021 • Charles Corbière, Marc Lafon, Nicolas Thome, Matthieu Cord, Patrick Perez
A crucial property of KLoS is to be a class-wise divergence measure built from in-distribution samples and to not require OOD training data, in contrast to current second-order uncertainty measures.
1 code implementation • 9 Apr 2021 • Vincent Le Guen, Nicolas Thome
This paper addresses the problem of multi-step time series forecasting for non-stationary signals that can present sudden changes.
2 code implementations • 10 Mar 2021 • Olivier Petit, Nicolas Thome, Clément Rambour, Luc Soler
Medical image segmentation remains particularly challenging for complex and low-contrast anatomical structures.
no code implementations • 11 Dec 2020 • Charles Corbière, Nicolas Thome, Antoine Saporta, Tuan-Hung Vu, Matthieu Cord, Patrick Pérez
In this paper, we introduce a novel target criterion for model confidence, namely the true class probability (TCP).
1 code implementation • NeurIPS 2020 • Vincent Le Guen, Nicolas Thome
We introduce the STRIPE model for representing structured diversity based on shape and time features, ensuring both probable predictions while being sharp and accurate.
1 code implementation • 14 Oct 2020 • Vincent Le Guen, Nicolas Thome
We introduce the STRIPE model for representing structured diversity based on shape and time features, ensuring both probable predictions while being sharp and accurate.
2 code implementations • ICLR 2021 • Yuan Yin, Vincent Le Guen, Jérémie Dona, Emmanuel de Bézenac, Ibrahim Ayed, Nicolas Thome, Patrick Gallinari
In this work, we introduce the APHYNITY framework, a principled approach for augmenting incomplete physical dynamics described by differential equations with deep data-driven models.
3 code implementations • CVPR 2020 • Vincent Le Guen, Nicolas Thome
Leveraging physical knowledge described by partial differential equations (PDEs) is an appealing way to improve unsupervised video prediction methods.
Ranked #2 on Video Prediction on SynpickVP
1 code implementation • NeurIPS 2019 • Charles Corbière, Nicolas Thome, Avner Bar-Hen, Matthieu Cord, Patrick Pérez
In this paper, we propose a new target criterion for model confidence, corresponding to the True Class Probability (TCP).
1 code implementation • NeurIPS 2019 • Charles Corbière, Nicolas Thome, Avner Bar-Hen, Matthieu Cord, Patrick Pérez
In this paper, we propose a new target criterion for model confidence, corresponding to the True Class Probability (TCP).
3 code implementations • NeurIPS 2019 • Vincent Le Guen, Nicolas Thome
We introduce a differentiable loss function suitable for training deep neural nets, and provide a custom back-prop implementation for speeding up optimization.
1 code implementation • 3 Jun 2019 • Thomas Robert, Nicolas Thome, Matthieu Cord
To effectively separate the information, we propose to use a combination of regular and adversarial classifiers to guide the two branches in specializing for class and attribute information respectively.
1 code implementation • CVPR 2019 • Remi Cadene, Hedi Ben-Younes, Matthieu Cord, Nicolas Thome
In this paper, we propose MuRel, a multimodal relational network which is learned end-to-end to reason over real images.
Ranked #1 on Visual Question Answering (VQA) on TDIUC
1 code implementation • 31 Jan 2019 • Hedi Ben-Younes, Rémi Cadene, Nicolas Thome, Matthieu Cord
We demonstrate the practical interest of our fusion model by using BLOCK for two challenging tasks: Visual Question Answering (VQA) and Visual Relationship Detection (VRD), where we design end-to-end learnable architectures for representing relevant interactions between modalities.
1 code implementation • NeurIPS 2018 • Taylor Mordan, Nicolas Thome, Gilles Henaff, Matthieu Cord
Multi-Task Learning (MTL) is appealing for deep learning regularization.
no code implementations • ECCV 2018 • Thomas Robert, Nicolas Thome, Matthieu Cord
In this paper, we introduce a new model for leveraging unlabeled data to improve generalization performances of image classifiers: a two-branch encoder-decoder architecture called HybridNet.
Ranked #52 on Image Classification on STL-10
no code implementations • CVPR 2018 • Ãloi Mehr, André Lieutier, Fernando Sanchez Bermudez, Vincent Guitteny, Nicolas Thome, Matthieu Cord
Typically, we propose to quotient the space of 3D models by the action of rotations.
1 code implementation • 14 May 2018 • Michael Blot, Thomas Robert, Nicolas Thome, Matthieu Cord
Regularization is a big issue for training deep neural networks.
1 code implementation • 30 Apr 2018 • Micael Carvalho, Rémi Cadène, David Picard, Laure Soulier, Nicolas Thome, Matthieu Cord
Designing powerful tools that support cooking activities has rapidly gained popularity due to the massive amounts of available data, as well as recent advances in machine learning that are capable of analyzing them.
Ranked #9 on Cross-Modal Retrieval on Recipe1M
no code implementations • 29 Apr 2018 • Michael Blot, Thomas Robert, Nicolas Thome, Matthieu Cord
Regularization is a big issue for training deep neural networks.
no code implementations • ICLR 2018 • Michael Blot, Thomas Robert, Nicolas Thome, Matthieu Cord
Regularization is a big issue for training deep neural networks.
no code implementations • 19 Jul 2017 • Taylor Mordan, Nicolas Thome, Matthieu Cord, Gilles Henaff
Existing region-based object detectors are limited to regions with fixed box geometry to represent objects, even if those are highly non-rectangular.
2 code implementations • CVPR 2017 • Thibaut Durand, Taylor Mordan, Nicolas Thome, Matthieu Cord
This paper introduces WILDCAT, a deep learning method which jointly aims at aligning image regions for gaining spatial invariance and learning strongly localized features.
Ranked #3 on Weakly Supervised Object Detection on MS COCO
6 code implementations • ICCV 2017 • Hedi Ben-Younes, Rémi Cadene, Matthieu Cord, Nicolas Thome
Bilinear models provide an appealing framework for mixing and merging information in Visual Question Answering (VQA) tasks.
Ranked #35 on Visual Question Answering (VQA) on VQA v2 test-std
1 code implementation • 29 Nov 2016 • Michael Blot, David Picard, Matthieu Cord, Nicolas Thome
We address the issue of speeding up the training of convolutional networks.
no code implementations • 25 Oct 2016 • Michael Blot, Matthieu Cord, Nicolas Thome
Convolutional neural networks (CNN) are widely used in computer vision, especially in image classification.
1 code implementation • 18 Oct 2016 • Rémi Cadène, Nicolas Thome, Matthieu Cord
Our last contribution is a framework, build on top of Torch7, for training and testing deep models on any visual recognition tasks and on datasets of any scale.
1 code implementation • 18 Oct 2016 • Rémi Cadène, Thomas Robert, Nicolas Thome, Matthieu Cord
Our approach is among the three best to tackle the M2CAI Workflow challenge.
1 code implementation • CVPR 2016 • Thibaut Durand, Nicolas Thome, Matthieu Cord
In this paper, we introduce a novel framework for WEakly supervised Learning of Deep cOnvolutional neural Networks (WELDON).
1 code implementation • 11 May 2016 • Micael Carvalho, Matthieu Cord, Sandra Avila, Nicolas Thome, Eduardo Valle
In recent years, deep architectures have been used for transfer learning with state-of-the-art performance in many datasets.
no code implementations • ICCV 2015 • Thibaut Durand, Nicolas Thome, Matthieu Cord
For ranking, we propose efficient solutions to exactly solve the inference and the loss-augmented problems.
1 code implementation • IEEE 2015 • Xin Wang, Devinder Kumar, Nicolas Thome, Matthieu Cord, Frederic Precioso
We present deep experiments of recipe recognition on our dataset using visual, textual information and fusion.
no code implementations • CVPR 2014 • Marc T. Law, Nicolas Thome, Matthieu Cord
This paper introduces a regularization method to explicitly control the rank of a learned symmetric positive semidefinite distance matrix in distance metric learning.
no code implementations • 20 Dec 2013 • Gabriel Dulac-Arnold, Ludovic Denoyer, Nicolas Thome, Matthieu Cord, Patrick Gallinari
In this paper, we investigate a new framework for image classification that adaptively generates spatial representations.
no code implementations • NeurIPS 2013 • Hanlin Goh, Nicolas Thome, Matthieu Cord, Joo-Hwee Lim
We suggest a deep learning strategy that bridges the gap between the two phases, resulting in a three-phase learning procedure.
no code implementations • CVPR 2013 • Christian Theriault, Nicolas Thome, Matthieu Cord
In this paper, we address the challenging problem of categorizing video sequences composed of dynamic natural scenes.