SGD_Tucker

A Novel Stochastic Optimization Strategy for Parallel Sparse Tucker Decomposition

Journal Article (2021)
Author(s)

Hao Li (National Supercomputing Center in Changsha, TU Delft - Electrical Engineering, Mathematics and Computer Science, Hunan University)

Zixuan Li (National Supercomputing Center in Changsha, Hunan University)

Kenli Li (Hunan University, National Supercomputing Center in Changsha)

Jan S. Rellermeyer (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Lydia Chen (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Keqin Li (National Supercomputing Center in Changsha, Hunan University, State University of New York)

Research Group
Data-Intensive Systems
DOI related publication
https://doi.org/10.1109/TPDS.2020.3047460 Final published version
More Info
expand_more
Publication Year
2021
Language
English
Research Group
Data-Intensive Systems
Bibliographical Note
Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.
Journal title
IEEE Transactions on Parallel and Distributed Systems
Issue number
7
Volume number
32
Article number
9309187
Pages (from-to)
1828-1841
Downloads counter
305
Collections
Institutional Repository
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Sparse Tucker Decomposition (STD) algorithms learn a core tensor and a group of factor matrices to obtain an optimal low-rank representation feature for the High-Order, High-Dimension, and Sparse Tensor (HOHDST). However, existing STD algorithms face the problem of intermediate variables explosion which results from the fact that the formation of those variables, i.e., matrices Khatri-Rao product, Kronecker product, and matrix-matrix multiplication, follows the whole elements in sparse tensor. The above problems prevent deep fusion of efficient computation and big data platforms. To overcome the bottleneck, a novel stochastic optimization strategy (SGD__Tucker) is proposed for STD which can automatically divide the high-dimension intermediate variables into small batches of intermediate matrices. Specifically, SGD__Tucker only follows the randomly selected small samples rather than the whole elements, while maintaining the overall accuracy and convergence rate. In practice, SGD__Tucker features the two distinct advancements over the state of the art. First, SGD__Tucker can prune the communication overhead for the core tensor in distributed settings. Second, the low data-dependence of SGD__Tucker enables fine-grained parallelization, which makes SGD__Tucker obtaining lower computational overheads with the same accuracy. Experimental results show that SGD__Tucker runs at least 2XX faster than the state of the art.

Files

09309187.pdf
(pdf | 3.4 Mb)
License info not available