Paper

Quantifying the Evidence for a Planet in Radial Velocity Data

We present results from a data challenge posed to the radial velocity (RV) community: namely, to quantify the "evidence" for $n=\{0, 1, 2, 3\}$ planets in a set of synthetically generated RV datasets containing a range of planet signals. Participating teams were provided the same likelihood function and set of priors to use in their analysis. They applied a variety of methods to estimate $\widehat{\mathcal{Z}}$, the marginal likelihood for each $n$-planet model, including cross-validation, the Laplace approximation, importance sampling, and nested sampling. We found the dispersion in $\widehat{\mathcal{Z}}$ across different methods grew with increasing $n$-planet models: $\sim\,$3 for 0-planets, $\sim\,$10 for 1-planet, $\sim\,$$10^2$-$10^3$ for 2-planets, and $>\,$$10^4$ for 3-planets. Most internal estimates of uncertainty in $\widehat{\mathcal{Z}}$ for individual methods significantly underestimated the observed dispersion across all methods. Methods that adopted a Monte Carlo approach by comparing estimates from multiple runs yielded plausible uncertainties. Finally, two classes of numerical algorithms (those based on importance and nested samplers) arrived at similar conclusions regarding the ratio of $\widehat{\mathcal{Z}}$s for $n$ and ($n+1$)-planet models. One analytic method (the Laplace approximation) demonstrated comparable performance. We express both optimism and caution: we demonstrate that it is practical to perform rigorous Bayesian model comparison for $\leq$3-planet models, yet robust planet discoveries require researchers to better understand the uncertainty in $\widehat{\mathcal{Z}}$ and its connections to model selection. Future research should address greater $n$-planet models, RVs with stellar activity indicators, and many more simulated datasets with a wider range of planet and noise properties.

Results in Papers With Code
(↓ scroll down to see all results)