Consistency and Finite Sample Behavior of Binary Class Probability Estimation

Conference Paper (2021)
Author(s)

A. Mey (TU Delft - Interactive Intelligence)

M. Loog (TU Delft - Pattern Recognition and Bioinformatics)

Research Group
Pattern Recognition and Bioinformatics
Copyright
© 2021 A. Mey, M. Loog
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 A. Mey, M. Loog
Research Group
Pattern Recognition and Bioinformatics
Pages (from-to)
8967-8974
ISBN (electronic)
978-1-57735-866-4
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

We investigate to which extent one can recover class probabilities within the empirical risk minimization (ERM) paradigm. We extend existing results and emphasize the tight relations between empirical risk minimization and class probability estimation. Following previous literature on excess risk bounds and proper scoring rules, we derive a class probability estimator based on empirical risk minimization. We then derive conditions under which this estimator will converge with high probability to the true class probabilities with respect to the L1-norm. One of our core contributions is a novel way to derive finite sample L1-convergence rates of this estimator for different surrogate loss functions. We also study in detail which commonly used loss functions are suitable for this estimation problem and briefly address the setting of model-misspecification.

Files

License info not available