Multi-label Node Classification On Graph-Structured Data

None, None; None, None; None, None; None, None

Multi-label Node Classification On Graph-Structured Data

Journal Article (2023)

Author(s)

T. Zhao (TU Delft - Multimedia Computing)

Ngan Thi Dong (L3S Research Center)

A Hanjalic (TU Delft - Intelligent Systems)

Megha Khosla (TU Delft - Multimedia Computing)

Multimedia Computing

To reference this document use:

https://resolver.tudelft.nl/uuid:ceb87edc-b316-465f-9417-280d08c55c08

More Info

expand_more

Publication Year

2023

Language

English

Multimedia Computing

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Graph Neural Networks (GNNs) have shown state-of-the-art improvements in node classification tasks on graphs. While these improvements have been largely demonstrated in a multi-class classification scenario, a more general and realistic scenario in which each node could have multiple labels has so far received little attention. The first challenge in conducting focused studies on multi-label node classification is the limited number of publicly available multi-label graph datasets. Therefore, as our first contribution, we collect and release three real-world biological datasets and develop a multi-label graph generator to generate datasets with tunable properties. While high label similarity (high homophily) is usually attributed to the success of GNNs, we argue that a multi-label scenario does not follow the usual semantics of homophily and heterophily so far defined for a multi-class scenario. As our second contribution, we define homophily and Cross-Class Neighborhood Similarity for the multi-label scenario and provide a thorough analyses of the collected multi-label datasets. Finally, we perform a large-scale comparative study with methods and datasets and analyse the performances of the methods to assess the progress made by current state of the art in the multi-label node classification scenario. We release our benchmark at https://github.com/Tianqi-py/MLGNC.

Files

851_Multi_label_Node_Classific... (pdf)

(pdf | 3.99 Mb)