no code implementations • 7 Feb 2024 • Lihu Chen, Alexandre Perez-Lebel, Fabian M. Suchanek, Gaël Varoquaux
In this work, we construct a new evaluation dataset derived from a knowledge base to assess confidence scores given to answers of Mistral and LLaMA.
2 code implementations • 28 Oct 2022 • Alexandre Perez-Lebel, Marine Le Morvan, Gaël Varoquaux
Yet calibration is not enough: even a perfectly calibrated classifier with the best possible accuracy can have confidence scores that are far from the true posterior probabilities.
1 code implementation • 17 Feb 2022 • Alexandre Perez-Lebel, Gaël Varoquaux, Marine Le Morvan, Julie Josse, Jean-Baptiste Poline
Using gradient-boosted trees, we compare native support for missing values with simple and state-of-the-art imputation prior to learning.