Semi-supervised learning, causality, and the conditional cluster assumption

None, None; None, None; None, None; None, None

Semi-supervised learning, causality, and the conditional cluster assumption

Conference Paper (2020)

Author(s)

Julius von Kügelgen (University of Cambridge, Max Planck Institute for Intelligent Systems)

Alexander Mey (TU Delft - Interactive Intelligence)

Marco Loog (TU Delft - Pattern Recognition and Bioinformatics, University of Copenhagen)

Bernhard Schölkopf (Max Planck Institute for Intelligent Systems)

Research Group

Pattern Recognition and Bioinformatics

To reference this document use:

https://resolver.tudelft.nl/uuid:9c1d31b6-2a36-48af-8fe5-9aa1bfb75ddf

More Info

expand_more

Publication Year

2020

Language

English

Research Group

Pattern Recognition and Bioinformatics

Volume number

124

Pages (from-to)

1-10

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

While the success of semi-supervised learning (SSL) is still not fully understood, Schölkopf et al. (2012) have established a link to the principle of independent causal mechanisms. They conclude that SSL should be impossible when predicting a target variable from its causes, but possible when predicting it from its effects. Since both these cases are restrictive, we extend their work by considering classification using cause and effect features at the same time, such as predicting a disease from both risk factors and symptoms. While standard SSL exploits information contained in the marginal distribution of all inputs (to improve the estimate of the conditional distribution of the target given inputs), we argue that in our more general setting we should use information in the conditional distribution of effect features given causal features. We explore how this insight generalises the previous understanding, and how it relates to and can be exploited algorithmically for SSL.

Files

11_main_paper.pdf

(pdf | 0.516 Mb)

License info not available