An MBO scheme for clustering and semi-supervised clustering of signed networks

Journal Article (2021)
Author(s)

Mihai Cucuringu (The Alan Turing Institute, University of Oxford)

Andrea Pizzoferrato (Imperial College London, Queen Mary University of London, The Alan Turing Institute)

Yves van Gennip (TU Delft - Mathematical Physics)

Research Group
Mathematical Physics
Copyright
© 2021 Mihai Cucuringu, Andrea Pizzoferrato, Y. van Gennip
DOI related publication
https://doi.org/10.4310/CMS.2021.v19.n1.a4
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 Mihai Cucuringu, Andrea Pizzoferrato, Y. van Gennip
Research Group
Mathematical Physics
Issue number
1
Volume number
19
Pages (from-to)
73-109
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

We introduce a principled method for the signed clustering problem, where the goal is to partition a weighted undirected graph whose edge weights take both positive and negative values, such that edges within the same cluster are mostly positive, while edges spanning across clusters are mostly negative. Our method relies on a graph-based diffuse interface model formulation utilizing the Ginzburg-Landau functional, based on an adaptation of the classic numerical Merriman-Bence-Osher (MBO) scheme for minimizing such graph-based functionals. The proposed object ive function aims to minimize the total weight of inter-cluster positively-weighted edges, while maximizing the total weight of the inter-cluster negatively-weighted edges. Our method scales to large sparse networks, and can be easily adjusted to incorporate labelled data information, as is often the case in the context of semisupervised learning. We tested our method on a number of both synthetic stochastic block models and real-world data sets (including financial correlation matrices), and obtained promising results that compare favourably against a number of state-of-the-art approaches from the recent literature.

Files

1901.03091_1.pdf
(pdf | 5.71 Mb)
License info not available