Degree-biased random walk for large-scale network embedding

Journal article (2019)

Authors

Yunyi Zhang Huazhong University of Science and Technology

Zhan Shi Huazhong University of Science and Technology

Dan Feng Huazhong University of Science and Technology

X. Zhan

DOI

https://doi.org/10.1016/j.future.2019.05.033

Random walks Network embedding Scale-free

More Info

expand_more

To reference this document use:

http://resolver.tudelft.nl/uuid:affc7ad5-1b0e-4570-8ceb-1e0fe6fe1602

Published Date

2019

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Network embedding aims at learning node representation by preserving the network topology. Previous embedding methods do not scale for large real-world networks which usually contain millions of nodes. They generally adopt a one-size-fits-all strategy to collect information, resulting in a large amount of redundancy. In this paper, we propose DiaRW, a scalable network embedding method based on a degree-biased random walk with variable length to sample context information for learning. Our walk strategy can well adapt to the scale-free feature of real-world networks and extract information from them with much less redundancy. In addition, our method can greatly reduce the size of context information, which is efficient for large-scale network embedding. Empirical experiments on node classification and link prediction prove not only the effectiveness but also the efficiency of DiaRW on a variety of real-world networks. Our algorithm is able to learn the network representations with millions of nodes and edges in hours on a single machine, which is tenfold faster than previous methods.

Files

1_s2.0_S0167739X19300378_main_... (.pdf)

(.pdf | 0.987 Mb)