Search Results for author: Riccardo Della Vecchia

Found 8 papers, 0 papers with code

AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents

no code implementations19 Jun 2023 Timothée Mathieu, Riccardo Della Vecchia, Alena Shilova, Matheus Medeiros Centa, Hector Kohler, Odalric-Ambrym Maillard, Philippe Preux

When comparing several RL algorithms, a major question is how many executions must be made and how can we ensure that the results of such a comparison are theoretically sound.

Reinforcement Learning (RL)

Stochastic Online Instrumental Variable Regression: Regrets for Endogeneity and Bandit Feedback

no code implementations18 Feb 2023 Riccardo Della Vecchia, Debabrota Basu

Endogeneity, i. e. the dependence of noise and covariates, is a common phenomenon in real data due to omitted variables, strategic behaviours, measurement errors etc.

Causal Inference regression

Cooperative Online Learning with Feedback Graphs

no code implementations9 Jun 2021 Nicolò Cesa-Bianchi, Tommaso R. Cesari, Riccardo Della Vecchia

We study the interplay between feedback and communication in a cooperative online learning setting where a network of agents solves a task in which the learners' feedback is determined by an arbitrary graph.

Finding Stable Matchings in PhD Markets with Consistent Preferences and Cooperative Partners

no code implementations23 Feb 2021 Maximilian Mordig, Riccardo Della Vecchia, Nicolò Cesa-Bianchi, Bernhard Schölkopf

Our setting is motivated by a PhD market of students, advisors, and co-advisors, and can be generalized to supply chain networks viewed as $n$-sided markets.

Computer Science and Game Theory Theoretical Economics Combinatorics

Two-Sided Matching Markets in the ELLIS 2020 PhD Program

no code implementations28 Jan 2021 Maximilian Mordig, Riccardo Della Vecchia

In this work we summarize the procedure that, in its final step, matches students to advisors in the ELLIS 2020 PhD program.

Computer Science and Game Theory Theoretical Economics Combinatorics

An Efficient Algorithm for Cooperative Semi-Bandits

no code implementations5 Oct 2020 Riccardo Della Vecchia, Tommaso Cesari

Furthermore, we prove that this is only $\sqrt$ k log k-away from the best achievable rate and that Coop-FTPL has a state-of-the-art T 3/2 worst-case computational complexity.

Combinatorial Optimization

Clustering of solutions in the symmetric binary perceptron

no code implementations15 Nov 2019 Carlo Baldassi, Riccardo Della Vecchia, Carlo Lucibello, Riccardo Zecchina

The geometrical features of the (non-convex) loss landscape of neural network models are crucial in ensuring successful optimization and, most importantly, the capability to generalize well.

Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.