Published September 19, 2023 | Version v1
Dataset Open

On double-descent in uncertainty quantification in overparametrized models (code)

  • 1. École Polytechnique Fédérale de Lausanne (EPFL), Statistical Physics of Computation lab., CH-1015 Lausanne, Switzerland
  • 2. Département d'Informatique, École Normale Supérieure - PSL & CNRS, Paris, France
  • 3. École Polytechnique Fédérale de Lausanne (EPFL), Information, Learning and Physics lab., CH-1015 Lausanne, Switzerland

* Contact person

Description

Uncertainty quantification is a central challenge in reliable and trustworthy machine learning. Naive measures such as last-layer scores are well-known to yield overconfident estimates in the context of overparametrized neural networks. Several methods, ranging from temperature scaling to different Bayesian treatments of neural networks, have been proposed to mitigate overconfidence, most often supported by the numerical observation that they yield better calibrated uncertainty measures. In this work, we provide a sharp comparison between popular uncertainty measures for binary classification in a mathematically tractable model for overparametrized neural networks: the random features model. We discuss a trade-off between classification accuracy and calibration, unveiling a double descent like behavior in the calibration curve of optimally regularized estimators as a function of overparametrization. This is in contrast with the empirical Bayes method, which we show to be well calibrated in our setting despite the higher generalization error and overparametrization. This record provides the code to reproduce the numerical experiments of the related paper "On double-descent in uncertainty quantification in overparametrized models".

Files

File preview

files_description.md

All files

Files (521.6 KiB)

Name Size
md5:8ccfb89c76b279b78bf6765c838ebf9d
330 Bytes Preview Download
md5:eb0de263154e9b45d2248211414d9815
520.1 KiB Preview Download
md5:b51340aa9b662d3826492211878a02f2
1.2 KiB Preview Download

References

Journal reference
L. Clarte, B. Loureiro, F. Krzakala, L. Zdeborova, Proceedings of Machine Learning Research 206, 7089-7125 (2023), doi: 10.48550/arXiv.2210.12760