A soft-labeled self-training approach

None, None; None, None

A soft-labeled self-training approach

Conference Paper (2016)

Author(s)

Alexander Mey (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Marco Loog (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group

Pattern Recognition and Bioinformatics

Linear programming Minimization Pattern recognition Mathematical model Labeling Risk management Probability distribution

DOI related publication

https://doi.org/10.1109/ICPR.2016.7900028 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:ebe87900-2654-484b-a69b-f2844cfd11bf

More Info

expand_more

Publication Year

2016

Language

English

Research Group

Pattern Recognition and Bioinformatics

Pages (from-to)

2604-2609

ISBN (print)

978-1-5090-4848-9

ISBN (electronic)

978-1-5090-4847-2

Event

ICPR 2016 (2016-12-04 - 2016-12-08), Cancún, Mexico

Downloads counter

139

Abstract

Semi-supervised classification methods try to improve a supervised learned classifier with the help of unlabeled data. In many cases one assumes a certain structure on the data, as for example the manifold assumption, the smoothness assumption or the cluster assumption. Self-training is a method that does not need any assumptions on the data itself. The idea is to use the supervised trained classifier to label the unlabeled points and to enlarge this way the training data. This paper aims to show that a self-training approach with soft-labeling is preferable in many cases in terms of expected loss (risk) minimization. The main idea is to use a soft-labeling to minimize the risk on labeled and unlabeled data together, in which the hard-labeled self-training is an extreme case.