Projected estimators for robust semi-supervised classification

None, None; None, None

Projected estimators for robust semi-supervised classification

Journal Article (2017)

Author(s)

J.H. Krijthe (TU Delft - Pattern Recognition and Bioinformatics, Leiden University Medical Center)

M Loog (University of Copenhagen, TU Delft - Pattern Recognition and Bioinformatics)

Research Group

Pattern Recognition and Bioinformatics

Copyright

DOI related publication

https://doi.org/10.1007/s10994-017-5626-8

Projection Least squares classification Semi-supervised learning

To reference this document use:

https://resolver.tudelft.nl/uuid:becaa8c9-7e5a-46c3-aaca-87763b45a28a

More Info

expand_more

Publication Year

2017

Language

English

Copyright

Research Group

Pattern Recognition and Bioinformatics

Issue number

7

Volume number

106

Pages (from-to)

993-1008

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

For semi-supervised techniques to be applied safely in practice we at least want methods to outperform their supervised counterparts. We study this question for classification using the well-known quadratic surrogate loss function. Unlike other approaches to semi-supervised learning, the procedure proposed in this work does not rely on assumptions that are not intrinsic to the classifier at hand. Using a projection of the supervised estimate onto a set of constraints imposed by the unlabeled data, we find we can safely improve over the supervised solution in terms of this quadratic loss. More specifically, we prove that, measured on the labeled and unlabeled training data, this semi-supervised procedure never gives a lower quadratic loss than the supervised alternative. To our knowledge this is the first approach that offers such strong, albeit conservative, guarantees for improvement over the supervised solution. The characteristics of our approach are explicated using benchmark datasets to further understand the similarities and differences between the quadratic loss criterion used in the theoretical results and the classification accuracy typically considered in practice.

Files

10.1007_s10994_017_5626_8.pdf

(pdf | 0.755 Mb)