no code implementations • 5 Feb 2024 • Lorenzo Masoero, Mario Beraha, Thomas Richardson, Stefano Favaro
In online randomized experiments or A/B tests, accurate predictions of participant inclusion rates are of paramount importance.
no code implementations • 26 Jan 2024 • Mario Beraha, Lorenzo Masoero, Stefano Favaro, Thomas S. Richardson
We derive closed-form expressions for the number of new users expected in a given period, and a simple Monte Carlo algorithm targeting the posterior distribution of the number of days needed to attain a desired number of users; the latter is important for experimental planning.
no code implementations • 27 Sep 2023 • Mario Beraha, Stefano Favaro, Matteo Sesia
We study how to recover the frequency of a symbol in a large discrete data set, using only a compressed representation, or sketch, of those data obtained via random hashing.
1 code implementation • 22 Jan 2021 • Matteo Pegoraro, Mario Beraha
As a byproduct of our approach, we are also able to derive faster routines for previous work on PCA for distributions.
no code implementations • 29 Jul 2020 • Mario Beraha, Matteo Pegoraro, Riccardo Peli, Alessandra Guglielmi
We consider the problem of spatially dependent areal data, where for each area independent observations are available, and propose to model the density of each area through a finite mixture of Gaussian distributions.
Methodology Applications
1 code implementation • 17 Jul 2019 • Mario Beraha, Alberto Maria Metelli, Matteo Papini, Andrea Tirinzoni, Marcello Restelli
Mutual information has been successfully adopted in filter feature-selection methods to assess both the relevancy of a subset of features in predicting the target variable and the redundancy with respect to other variables.