no code implementations • 9 Sep 2024 • Chloé Hashimoto-Cullen, Benjamin Guedj
By taking into account performance on GPs trained on a dataset and designing a random walk on these, we mitigate the training cost of our entire Bayesian decision-making procedure.
no code implementations • 30 May 2024 • Antoine Picard-Weibel, Gabriel Capson-Tojo, Benjamin Guedj, Roman Moscoviz
This article advocates for more systematic consideration of uncertainty for anaerobic digestion models and showcases a new, computationally efficient Bayesian method.
no code implementations • 15 Feb 2024 • Théophile Cantelobre, Carlo Ciliberto, Benjamin Guedj, Alessandro Rudi
Sequential Bayesian Filtering aims to estimate the current state distribution of a Hidden Markov Model, given the past observations.
no code implementations • 13 Feb 2024 • Maxime Haddouche, Paul Viallard, Umut Simsekli, Benjamin Guedj
Modern machine learning usually involves predictors in the overparametrised setting (number of trained parameters greater than dataset size), and their training yield not only good performances on training data, but also good generalisation capacity.
no code implementations • 7 Feb 2024 • Paul Viallard, Maxime Haddouche, Umut Şimşekli, Benjamin Guedj
We also instantiate our bounds as training objectives, yielding non-trivial guarantees and practical performances.
no code implementations • 20 Dec 2023 • Eugenio Clerico, Benjamin Guedj
We establish explicit dynamics for neural networks whose training objective has a regularising term that constrains the parameters to remain close to their initial value.
no code implementations • 17 Oct 2023 • Pierre Jobic, Maxime Haddouche, Benjamin Guedj
We introduce a novel strategy to train randomised predictors in federated learning, where each node of the network aims at preserving its privacy by releasing a local predictor but keeping secret its training dataset with respect to the other nodes.
1 code implementation • 16 Oct 2023 • Fredrik Hellström, Benjamin Guedj
We derive generic information-theoretic and PAC-Bayesian generalization bounds involving an arbitrary convex comparator function, which measures the discrepancy between the training and population loss.
no code implementations • 8 Sep 2023 • Fredrik Hellström, Giuseppe Durisi, Benjamin Guedj, Maxim Raginsky
Over the past decades, the PAC-Bayesian approach has been established as a flexible framework to address the generalization capabilities of machine learning algorithms, and design new ones.
no code implementations • 14 Apr 2023 • Maxime Haddouche, Benjamin Guedj
PAC-Bayes learning is an established framework to both assess the generalisation ability of learning algorithms, and design new learning algorithm by exploiting generalisation bounds as training objectives.
no code implementations • 18 Jan 2023 • Maxime Haddouche, Olivier Wintenberger, Benjamin Guedj
Optimistic Online Learning algorithms have been developed to exploit expert advices, assumed optimistically to be always useful.
no code implementations • 20 Oct 2022 • Felix Biggs, Benjamin Guedj
We introduce a modified version of the excess risk, which can be used to obtain tighter, fast-rate PAC-Bayesian generalisation bounds.
no code implementations • 3 Oct 2022 • Maxime Haddouche, Benjamin Guedj
While PAC-Bayes is now an established learning framework for light-tailed losses (\emph{e. g.}, subgaussian or subexponential), its extension to the case of heavy-tailed losses remains largely uncharted and has attracted a growing interest in recent years.
no code implementations • 6 Sep 2022 • Eugenio Clerico, Tyler Farghly, George Deligiannidis, Benjamin Guedj, Arnaud Doucet
We establish disintegrated PAC-Bayesian generalisation bounds for models trained with gradient descent methods or continuous gradient flows.
4 code implementations • 18 Jun 2022 • Antonin Schrab, Ilmun Kim, Benjamin Guedj, Arthur Gretton
We derive non-asymptotic uniform separation rates for MMDAggInc and HSICAggInc, and quantify exactly the trade-off between computational efficiency and the attainable rates: this result is novel for tests based on incomplete $U$-statistics, to our knowledge.
1 code implementation • 9 Jun 2022 • Felix Biggs, Valentina Zantedeschi, Benjamin Guedj
We study the generalisation properties of majority voting on finite ensembles of classifiers, proving margin-based generalisation bounds via the PAC-Bayes theory.
no code implementations • 31 May 2022 • Maxime Haddouche, Benjamin Guedj
Most PAC-Bayesian bounds hold in the batch learning setting where data is collected at once, prior to inference or prediction.
1 code implementation • 26 Apr 2022 • Jiale Wei, Qiyuan Chen, Pai Peng, Benjamin Guedj, Le Li
This paper presents REPRINT, a simple and effective hidden-space data augmentation method for imbalanced data classification.
no code implementations • 23 Feb 2022 • Badr-Eddine Chérief-Abdellatif, Yuyang Shi, Arnaud Doucet, Benjamin Guedj
Despite its wide use and empirical successes, the theoretical understanding and study of the behaviour and performance of the variational autoencoder (VAE) have only emerged in the past few years.
no code implementations • 11 Feb 2022 • Antoine Picard-Weibel, Benjamin Guedj
We propose new change of measure inequalities based on $f$-divergences (of which the Kullback-Leibler divergence is a particular case).
no code implementations • 11 Feb 2022 • Reuben Adams, John Shawe-Taylor, Benjamin Guedj
Current PAC-Bayes generalisation bounds are restricted to scalar metrics of performance, such as the loss or error rate.
1 code implementation • 11 Feb 2022 • Théophile Cantelobre, Carlo Ciliberto, Benjamin Guedj, Alessandro Rudi
Measures of similarity (or dissimilarity) are a key ingredient to many machine learning algorithms.
1 code implementation • 3 Feb 2022 • Felix Biggs, Benjamin Guedj
We focus on a specific class of shallow neural networks with a single hidden layer, namely those with $L_2$-normalised data and either a sigmoid-shaped Gaussian error function ("erf") activation or a Gaussian Error Linear Unit (GELU) activation.
2 code implementations • 2 Feb 2022 • Antonin Schrab, Benjamin Guedj, Arthur Gretton
KSDAgg avoids splitting the data to perform kernel selection (which leads to a loss in test power), and rather maximises the test power over a collection of kernels.
no code implementations • 15 Nov 2021 • Maria Perez-Ortiz, Omar Rivasplata, Emilio Parrado-Hernandez, Benjamin Guedj, John Shawe-Taylor
We then show that in data starvation regimes, holding out data for the test set bounds adversely affects generalisation performance, while self-certified strategies based on PAC-Bayes bounds do not suffer from this drawback, proving that they might be a suitable choice for the small data regime.
3 code implementations • NeurIPS 2023 • Antonin Schrab, Ilmun Kim, Mélisande Albert, Béatrice Laurent, Benjamin Guedj, Arthur Gretton
In practice, this parameter is unknown and, hence, the optimal MMD test with this particular kernel cannot be used.
no code implementations • 21 Sep 2021 • Maria Perez-Ortiz, Omar Rivasplata, Benjamin Guedj, Matthew Gleeson, Jingyu Zhang, John Shawe-Taylor, Miroslaw Bober, Josef Kittler
We experiment on 6 datasets with different strategies and amounts of data to learn data-dependent PAC-Bayes priors, and we compare them in terms of their effect on test performance of the learnt predictors and tightness of their risk certificate.
no code implementations • 8 Jul 2021 • Felix Biggs, Benjamin Guedj
We give a general recipe for derandomising PAC-Bayesian bounds using margins, with the critical ingredient being that our randomised predictions concentrate around some value.
1 code implementation • NeurIPS 2021 • Valentina Zantedeschi, Paul Viallard, Emilie Morvant, Rémi Emonet, Amaury Habrard, Pascal Germain, Benjamin Guedj
We investigate a stochastic counterpart of majority votes over finite ensembles of classifiers, and study its generalization properties.
no code implementations • 18 Dec 2020 • Maxime Haddouche, Benjamin Guedj, John Shawe-Taylor
Principal Component Analysis (PCA) is a popular method for dimension reduction and has attracted an unfailing interest for decades.
1 code implementation • 7 Dec 2020 • Théophile Cantelobre, Benjamin Guedj, María Pérez-Ortiz, John Shawe-Taylor
Many practical machine learning tasks can be framed as Structured prediction problems, where several output variables are predicted and considered interdependent.
1 code implementation • 24 Nov 2020 • Florent Dewez, Benjamin Guedj, Arthur Talpaert, Vincent Vandewalle
Many real-world problems require to optimise trajectories under constraints.
1 code implementation • 16 Nov 2020 • Arthur Leroy, Pierre Latouche, Benjamin Guedj, Servane Gey
A variational EM algorithm is derived for dealing with the optimisation of the hyper-parameters along with the hyper-posteriors' estimation of latent variables and processes.
1 code implementation • 22 Sep 2020 • Antoine Vendeville, Benjamin Guedj, Shi Zhou
We are able to perform time-evolving estimates of the model parameters and use these to forecast the vote shares for each party in any election.
1 code implementation • 21 Jul 2020 • Arthur Leroy, Pierre Latouche, Benjamin Guedj, Servane Gey
A novel multi-task Gaussian process (GP) framework is proposed, by using a common mean process for sharing information across tasks.
no code implementations • NeurIPS 2020 • Zakaria Mhammedi, Benjamin Guedj, Robert C. Williamson
Conditional Value at Risk (CVaR) is a family of "coherent risk measures" which generalize the traditional mathematical expectation.
no code implementations • 22 Jun 2020 • Felix Biggs, Benjamin Guedj
We make three related contributions motivated by the challenge of training stochastic neural networks, particularly in a PAC-Bayesian setting: (1) we show how averaging over an ensemble of stochastic neural networks enables a new class of \emph{partially-aggregated} estimators; (2) we show that these lead to provably lower-variance gradient estimates for non-differentiable signed-output networks; (3) we reformulate a PAC-Bayesian bound for these networks to derive a directly optimisable, differentiable objective and a generalisation guarantee, without using a surrogate loss or loosening the bound.
1 code implementation • 12 Jun 2020 • Antoine Vendeville, Benjamin Guedj, Shi Zhou
We explore a method to influence or even control the diversity of opinions within a polarised social group.
no code implementations • 12 Jun 2020 • Maxime Haddouche, Benjamin Guedj, Omar Rivasplata, John Shawe-Taylor
We present new PAC-Bayesian generalisation bounds for learning problems with unbounded loss functions.
no code implementations • 11 May 2020 • Florent Dewez, Benjamin Guedj, Vincent Vandewalle
We illustrate our approach by focusing on the estimation of the drag and lift coefficients from recorded flight data.
1 code implementation • 17 Dec 2019 • Benjamin Guedj, Bhargav Srinivasa Desikan
We propose a new supervised learning algorithm, for classification and regression problems where two or more preliminary predictors are available.
no code implementations • 10 Oct 2019 • Benjamin Guedj, Louis Pujol
"No free lunch" results state the impossibility of obtaining meaningful bounds on the error of a learning algorithm without prior assumptions and modelling.
1 code implementation • 10 Oct 2019 • Kento Nozawa, Pascal Germain, Benjamin Guedj
Contrastive unsupervised representation learning (CURL) is the state-of-the-art technique to learn representations (as a set of features) from unlabelled data.
no code implementations • 15 Sep 2019 • Vincent Cohen-Addad, Benjamin Guedj, Varun Kanade, Guy Rom
The specific formulation we use is the $k$-means objective: At each time step the algorithm has to maintain a set of k candidate centers and the loss incurred is the squared distance between the new point and the closest center.
1 code implementation • NeurIPS 2019 • Zakaria Mhammedi, Peter D. Grunwald, Benjamin Guedj
We present a new PAC-Bayesian generalization bound.
1 code implementation • NeurIPS 2019 • Gaël Letarte, Pascal Germain, Benjamin Guedj, François Laviolette
We present a comprehensive study of multilayer neural networks with binary activation, relying on the PAC-Bayesian theory.
no code implementations • 24 May 2019 • Jie M. Zhang, Mark Harman, Benjamin Guedj, Earl T. Barr, John Shawe-Taylor
MV mutates training data labels, retrains the model against the mutated data, then uses the metamorphic relation that captures the consequent training performance changes to assess model fit.
2 code implementations • 1 Apr 2019 • Benjamin Guedj, Juliette Rengot
We introduce a novel aggregation method to efficiently perform image denoising.
no code implementations • 11 Mar 2019 • Stéphane Chrétien, Benjamin Guedj
This paper studies clustering for possibly high dimensional data (e. g. images, time series, gene expression data, and many other settings), and rephrase it as low rank matrix estimation in the PAC-Bayesian framework.
no code implementations • 16 Jan 2019 • Benjamin Guedj
Generalised Bayesian learning algorithms are increasingly popular in machine learning, due to their PAC generalisation properties and flexibility.
no code implementations • 18 May 2018 • Benjamin Guedj, Le Li
When confronted with massive data streams, summarizing data with dimension reduction methods such as PCA raises theoretical and algorithmic pitfalls.
1 code implementation • 26 Apr 2018 • John Klein, Mahmoud Albardan, Benjamin Guedj, Olivier Colot
We examine a network of learners which address the same classification task but must learn from different data sets.
1 code implementation • 25 Apr 2017 • Benjamin Guedj, Bhargav Srinivasa Desikan
We introduce \texttt{pycobra}, a Python library devoted to ensemble learning (regression and classification) and visualisation.
no code implementations • 23 Oct 2016 • Pierre Alquier, Benjamin Guedj
In these bounds the Kullack-Leibler divergence is replaced with a general version of Csisz\'ar's $f$-divergence.
no code implementations • 23 Aug 2016 • Alain Celisse, Benjamin Guedj
The present paper provides a new generic strategy leading to non-asymptotic theoretical guarantees on the Leave-one-Out procedure applied to a broad class of learning algorithms.
no code implementations • 1 Feb 2016 • Le Li, Benjamin Guedj, Sébastien Loustau
When faced with high frequency streams of data, clustering raises theoretical and algorithmic pitfalls.
no code implementations • 6 Jan 2016 • Pierre Alquier, Benjamin Guedj
The aim of this paper is to provide some theoretical understanding of quasi-Bayesian aggregation methods non-negative matrix factorization.
no code implementations • 9 Nov 2015 • Benjamin Guedj, Sylvain Robbiano
This paper is devoted to the bipartite ranking problem, a classical statistical learning task, in a high dimensional setting.