Degree-biased random walk for large-scale network embedding

Journal Article (2019)
Author(s)

Yunyi Zhang (Huazhong University of Science and Technology)

Zhan Shi (Huazhong University of Science and Technology)

Dan Feng (Huazhong University of Science and Technology)

Xiuxiu Zhan (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group
Multimedia Computing
DOI related publication
https://doi.org/10.1016/j.future.2019.05.033 Final published version
More Info
expand_more
Publication Year
2019
Language
English
Research Group
Multimedia Computing
Volume number
100
Pages (from-to)
198-209
Downloads counter
243
Collections
Institutional Repository
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Network embedding aims at learning node representation by preserving the network topology. Previous embedding methods do not scale for large real-world networks which usually contain millions of nodes. They generally adopt a one-size-fits-all strategy to collect information, resulting in a large amount of redundancy. In this paper, we propose DiaRW, a scalable network embedding method based on a degree-biased random walk with variable length to sample context information for learning. Our walk strategy can well adapt to the scale-free feature of real-world networks and extract information from them with much less redundancy. In addition, our method can greatly reduce the size of context information, which is efficient for large-scale network embedding. Empirical experiments on node classification and link prediction prove not only the effectiveness but also the efficiency of DiaRW on a variety of real-world networks. Our algorithm is able to learn the network representations with millions of nodes and edges in hours on a single machine, which is tenfold faster than previous methods.

Files

1_s2.0_S0167739X19300378_main_... (pdf)
(pdf | 0.987 Mb)
- Embargo expired in 01-07-2020
License info not available