The Similarity Between Dissimilarities

Conference Paper (2016)
Author(s)

David Tax (TU Delft - Pattern Recognition and Bioinformatics)

Veronika Cheplygina (Erasmus MC, TU Delft - Pattern Recognition and Bioinformatics)

Bob Duin (TU Delft - Pattern Recognition and Bioinformatics)

Jan van de Poll (Transparency Lab)

DOI related publication
https://doi.org/10.1007/978-3-319-49055-7_8 Final published version
More Info
expand_more
Publication Year
2016
Language
English
Pages (from-to)
84-94
Publisher
Springer
ISBN (print)
978-3-319-49054-0
ISBN (electronic)
978-3-319-49055-7
Event
Downloads counter
154

Abstract

When characterizing teams of people, molecules, or general graphs, it is difficult to encode all information using a single feature vector only. For these objects dissimilarity matrices that do capture the interaction or similarity between the sub-elements (people, atoms, nodes), can be used. This paper compares several representations of dissimilarity matrices, that encode the cluster characteristics, latent dimensionality, or outliers of these matrices. It appears that both the simple eigenvalue spectrum, or histogram of distances are already quite effective, and are able to reach high classification performances in multiple instance learning (MIL) problems. Finally, an analysis on teams of people is given, illustrating the potential use of dissimilarity matrix characterization for business consultancy.