Revisiting Hyperbolic t-SNE

None, None

Revisiting Hyperbolic t-SNE

gradient derivation and limitations of the t-distribution

Master Thesis (2025)

Author(s)

H.R. Xia (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

M. Skrodzki – Mentor (TU Delft - Computer Graphics and Visualisation)

K.A. Hildebrandt – Mentor (TU Delft - Computer Graphics and Visualisation)

Jasmijn A. Baaijens – Graduation committee member (TU Delft - Pattern Recognition and Bioinformatics)

Faculty

Electrical Engineering, Mathematics and Computer Science

Hyperbolic space Poincaré disk T-SNE Data visualization Dimensionality reduction Hyperbolic Geometry

To reference this document use:

https://resolver.tudelft.nl/uuid:6ec4fbf6-5141-4c8f-934b-931fe2adee4e

More Info

expand_more

Publication Year

2025

Language

English

Graduation Date

05-09-2025

Awarding Institution

Delft University of Technology

Programme

['Computer Science | Data Science and Technology']

Faculty

Electrical Engineering, Mathematics and Computer Science

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Dimensionality reduction and visualization methods have become an indispensable tool for the exploration of high-dimensional data. In this area t-SNE has established itself as a primary method; providing a means of visualizing high-dimensional data in a way that preserves local neighbourhood structures. However standard t-SNE embeds data into Euclidean space which struggles to capture properties of data that is network-like, tree-like, or contains hierarchies. As an alternative, the use of Hyperbolic space has been proposed to allow for effective embeddings of such data as connections exist between complex networks and Hyperbolic. In addition, Hyperbolic space has been effectively utilized for representing hierarchical relationships.
Given these endeavors, recent works have adapted t-SNE to Hyperbolic space using the Poincaré Disk model resulting in a method that can visually reveal network-like and hierarchical relationships. However, several related works contain an error in their derived gradient. As t-SNE uses the method of gradient descent for optimizing embeddings, this error may lead to incorrect results. In our work we first correct for this error and explore its consequences. One such consequence is that embeddings are strongly pushed outwards in the disk leading to unintelligible results. Since t-SNE uses a t-distribution to model embeddings, we propose the use of a Gaussian distribution instead as it encourages more compact embeddings. Finally we perform experiments comparing each method qualitatively by assessing resulting visualizations, and quantitatively via the PR-metric.

Files

Haoran_Xia_TU_Delft_Master_The... (pdf)

(pdf | 14.4 Mb)

License info not available