DimenFix

A novel meta-strategy to preserve user-defined data values on dimensionality reduction layouts

Journal Article (2025)
Author(s)

Zixuan Han (Student TU Delft)

Diede van der Hoorn (Eindhoven University of Technology)

Thomas Höllt (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Qiaodan Luo (Dalhousie University)

Leonardo Christino (Eindhoven University of Technology)

Evangelos Milios (Dalhousie University)

Fernando V. Paulovich (Eindhoven University of Technology)

Faculty
Electrical Engineering, Mathematics and Computer Science
DOI related publication
https://doi.org/10.1016/j.cag.2025.104231 Final published version
More Info
expand_more
Publication Year
2025
Language
English
Faculty
Electrical Engineering, Mathematics and Computer Science
Journal title
Computers and Graphics
Volume number
130
Article number
104231
Downloads counter
153
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Dimensionality Reduction (DR) methods have become essential tools for the data analysis toolbox. Typically, DR methods combine features of a multivariate dataset to produce dimensions in a reduced space, preserving some data properties, usually pairwise distances or local neighborhoods. Preserving such properties makes DR methods attractive, but it is also one of their weaknesses. When calculating the embedded dimensions, usually through non-linear strategies, the original feature values are lost and not explicitly represented in the spatialization of the produced layouts, making it challenging to interpret the results and understand the features’ contributions to the attained representations. Some strategies have been proposed to tackle this issue, such as coloring the DR layouts or generating explanations. Still, they are post-processes, so specific features (values) are not guaranteed to be preserved or represented. This paper proposes DimenFix, a novel meta-DR strategy that explicitly preserves the values of a particular user-defined feature or external data (not used to generate a layout) in one of the embedded axes. DimenFix can be used to preserve ordinal (e.g., numerical measures) and nominal (e.g., labels) values and works with virtually any gradient-descent DR method. It requires minimum changes to the underlying DR technique, running in linear time considering the number of data instances. In our results, involving Force Scheme and t-SNE adaptations, DimenFix was capable of representing features without heavily impacting distance or neighborhood preservation, allowing for creating hybrid layouts that join characteristics of scatter plots and DR methods.