An Alternative Exploitation of Isolation Forests for Outlier Detection

Conference Paper (2021)
Author(s)

Antonella Mensi (University of Verona)

Alessio Franzoni (University of Verona)

David M.J. Tax (TU Delft - Pattern Recognition and Bioinformatics)

Manuele Bicego (University of Verona)

Research Group
Pattern Recognition and Bioinformatics
DOI related publication
https://doi.org/10.1007/978-3-030-73973-7_4
More Info
expand_more
Publication Year
2021
Language
English
Research Group
Pattern Recognition and Bioinformatics
Pages (from-to)
34-44
ISBN (print)
9783030739720

Abstract

Isolation Forests are one of the most successful outlier detection techniques: they isolate outliers by performing random splits in each node. It has been recently shown that a trained Random Forest-based model can also be used to define and extract informative distance measures between objects. Although their success has been shown mainly in the clustering field, we propose to extract these pairwise distances between the objects from an Isolation Forest and use them as input to a distance or density-based outlier detector. We show that the extracted distances from Isolation Forests are able to describe outliers meaningfully. We evaluate our technique on ten benchmark datasets for outlier detection: we employ three different distance measures and evaluate the obtained representation using a density-based classifier, the Local Outlier Factor. We also compare the methodology to the standard Isolation Forests scheme.

No files available

Metadata only record. There are no files for this record.