Proximity isolation forests

Conference Paper (2020)
Author(s)

Antonella Mensi (Università degli Studi di Verona)

Manuele Bicego (Università degli Studi di Verona)

D.M.J. Tax (TU Delft - Pattern Recognition and Bioinformatics)

Research Group
Pattern Recognition and Bioinformatics
DOI related publication
https://doi.org/10.1109/ICPR48806.2021.9412322
More Info
expand_more
Publication Year
2020
Language
English
Research Group
Pattern Recognition and Bioinformatics
Pages (from-to)
8021-8028
ISBN (electronic)
9781728188089

Abstract

Isolation Forests are a very successful approach for solving outlier detection tasks. Isolation Forests are based on classical Random Forest classifiers that require feature vectors as input. There are many situations where vectorial data is not readily available, for instance when dealing with input sequences or strings. In these situations, one can extract higher level characteristics from the input, which is typically hard and often loses valuable information. An alternative is to define a proximity between the input objects, which can be more intuitive. In this paper we propose the Proximity Isolation Forests that extend the Isolation Forests to non-vectorial data. The introduced methodology has been thoroughly evaluated on 8 different problems and it achieves very good results also when compared to other techniques.

No files available

Metadata only record. There are no files for this record.