Approximated and User Steerable tSNE for Progressive Visual Analytics

Journal Article (2016)
Author(s)

Nicola Pezzotti (TU Delft - Computer Graphics and Visualisation)

B. Lelieveldy (Leiden University Medical Center, TU Delft - Pattern Recognition and Bioinformatics)

L.J.P. van der Maaten (TU Delft - Pattern Recognition and Bioinformatics)

T. Höllt (TU Delft - Computer Graphics and Visualisation)

E. Eisemann (TU Delft - Computer Graphics and Visualisation)

A. Vilanova Bartroli (TU Delft - Computer Graphics and Visualisation)

Research Group
Computer Graphics and Visualisation
Copyright
© 2016 N. Pezzotti, B.P.F. Lelieveldt, L.J.P. van der Maaten, T. Höllt, E. Eisemann, A. Vilanova Bartroli
DOI related publication
https://doi.org/10.1109/TVCG.2016.2570755
More Info
expand_more
Publication Year
2016
Language
English
Copyright
© 2016 N. Pezzotti, B.P.F. Lelieveldt, L.J.P. van der Maaten, T. Höllt, E. Eisemann, A. Vilanova Bartroli
Research Group
Computer Graphics and Visualisation
Issue number
7
Volume number
23
Pages (from-to)
1739-1752
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Progressive Visual Analytics aims at improving the interactivity in existing analytics techniques by means of visualization as well as interaction with intermediate results. One key method for data analysis is dimensionality reduction, for example, to produce 2D embeddings that can be visualized and analyzed efficiently. t-Distributed Stochastic Neighbor Embedding (tSNE) is a well-suited technique for the visualization of high-dimensional data. tSNE can create meaningful intermediate results but suffers from a slow initialization that constrains its application in Progressive Visual Analytics. We introduce a controllable tSNE approximation (A-tSNE), which trades off speed and accuracy, to enable interactive data exploration. We offer real-time visualization techniques, including a density-based solution and a Magic Lens to inspect the degree of approximation. With this feedback, the user can decide on local refinements and steer the approximation level during the analysis. We demonstrate our technique with several datasets, in a real-world research scenario and for the real-time analysis of high-dimensional streams to illustrate its effectiveness for interactive data analysis.

Files

License info not available