A soft-labeled self-training approach

Conference Paper (2016)
Author(s)

Alexander Mey (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Marco Loog (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group
Pattern Recognition and Bioinformatics
DOI related publication
https://doi.org/10.1109/ICPR.2016.7900028 Final published version
More Info
expand_more
Publication Year
2016
Language
English
Research Group
Pattern Recognition and Bioinformatics
Pages (from-to)
2604-2609
ISBN (print)
978-1-5090-4848-9
ISBN (electronic)
978-1-5090-4847-2
Event
ICPR 2016 (2016-12-04 - 2016-12-08), Cancún, Mexico
Downloads counter
139

Abstract

Semi-supervised classification methods try to improve a supervised learned classifier with the help of unlabeled data. In many cases one assumes a certain structure on the data, as for example the manifold assumption, the smoothness assumption or the cluster assumption. Self-training is a method that does not need any assumptions on the data itself. The idea is to use the supervised trained classifier to label the unlabeled points and to enlarge this way the training data. This paper aims to show that a self-training approach with soft-labeling is preferable in many cases in terms of expected loss (risk) minimization. The main idea is to use a soft-labeling to minimize the risk on labeled and unlabeled data together, in which the hard-labeled self-training is an extreme case.