An Efficient Dual-Hierarchy t-SNE Minimization

Journal Article (2021)
Author(s)

Mark Van De Ruit (TU Delft - Computer Graphics and Visualisation)

M. Billeter (University of Leeds)

E. Eisemann (TU Delft - Computer Graphics and Visualisation)

Research Group
Computer Graphics and Visualisation
Copyright
© 2021 M. van de Ruit, M.J. Billeter, E. Eisemann
DOI related publication
https://doi.org/10.1109/TVCG.2021.3114817
More Info
expand_more
Publication Year
2021
Language
English
Copyright
© 2021 M. van de Ruit, M.J. Billeter, E. Eisemann
Research Group
Computer Graphics and Visualisation
Issue number
1
Volume number
28
Pages (from-to)
614-622
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

t-distributed Stochastic Neighbour Embedding (t-SNE) has become a standard for exploratory data analysis, as it is capable of revealing clusters even in complex data while requiring minimal user input. While its run-time complexity limited it to small datasets in the past, recent efforts improved upon the expensive similarity computations and the previously quadratic minimization. Nevertheless, t-SNE still has high runtime and memory costs when operating on millions of points. We present a novel method for executing the t-SNE minimization. While our method overall retains a linear runtime complexity, we obtain a significant performance increase in the most expensive part of the minimization. We achieve a significant improvement without a noticeable decrease in accuracy even when targeting a 3D embedding. Our method constructs a pair of spatial hierarchies over the embedding, which are simultaneously traversed to approximate many N-body interactions at once. We demonstrate an efficient GPGPU implementation and evaluate its performance against state-of-the-art methods on a variety of datasets.

Files

License info not available