One-Class Classification

for high-dimensional data

Master Thesis (2019)
Author(s)

F.M.A.A. Elghlan (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

David M.J. Tax – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

MJT Reinders – Graduation committee member (TU Delft - Pattern Recognition and Bioinformatics)

Mathijs M. de Weerdt – Graduation committee member (TU Delft - Algorithmics)

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2019 Faris Elghlan
More Info
expand_more
Publication Year
2019
Language
English
Copyright
© 2019 Faris Elghlan
Graduation Date
27-08-2019
Awarding Institution
Delft University of Technology
Programme
['Computer Science | Data Science and Technology']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

This M.Sc. thesis report investigates the application of one-class classification techniques to complex high-dimensional data. The aim of a one-class classifier is to separate target data from non-target data, but only a dataset containing target data is available for training. The issue with high-dimensional data is that it is difficult to perform density estimation due to the `curse of dimensionality'. Most conventional method for one-class classification rely on density estimation.

This thesis focusses on the use of autoencoders and generative adversarial networks (GANs) for one-class classification problems involving image data. Autoencoders can learn encoding and decoding functions for samples from the target dataset. These encoding and decoding functions are, however, expected to not perform well for non-target samples, as they have never been seen during the training phase. This makes it possible to separate target and non-target data. For GANs, the discriminator is used to distinguish between target and non-target data.

Autoencoders and GANs are evaluated extensively in this report. Their behavior, desired parameters and strengths and weaknesses are evaluated by performing experiments. The main findings are that GANs do not perform well for one-class classification tasks, because of mode collapse and insufficient sampling of the non-target data. Even for extremely simple datasets these issues were observed. Autoencoders are shown to perform much better and behave according to the theoretical expectations.

Files

Report.pdf
(pdf | 4.31 Mb)
License info not available