Advanced Dimensionality Reduction for Imaging Mass Spectrometry of Human Eye Tissue through Low-Rank Modeling with Sparse and Dense Residuals

Journal Article (2025)
Author(s)

R.A.R. Moens (TU Delft - Team Raf Van de Plas)

L.G. Migas (TU Delft - Team Raf Van de Plas)

David M G Anderson (VanderBilt University)

Jeffrey D. Messinger (University of Alabama at Birmingham)

Olga S. Ovchinnikova (Oak Ridge National Laboratory, University of Tennessee)

Richard M. Caprioli (VanderBilt University)

Christine A. Curcio (University of Alabama at Birmingham)

Kevin L. Schey (VanderBilt University)

Jeffrey M. Spraggins (VanderBilt University, Vanderbilt University Medical Center)

Raf Van de Plas (TU Delft - Team Raf Van de Plas, VanderBilt University)

Research Group
Team Raf Van de Plas
DOI related publication
https://doi.org/10.1021/acs.analchem.4c06368
More Info
expand_more
Publication Year
2025
Language
English
Research Group
Team Raf Van de Plas
Journal title
Analytical Chemistry
Issue number
42
Volume number
97
Pages (from-to)
23040-23049
Downloads counter
111
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Imaging mass spectrometry (IMS) yields high-dimensional and large data sets commonly exceeding 100,000 pixels, each reporting a mass spectrum of 200,000 intensity values or more. Reducing the dimensionality and size of IMS data is often necessary to enable downstream analysis, and matrix-factorization-based approaches are often used for this purpose. However, the model underlying most of these techniques, decomposing measurements into the sum of a low-rank term (presumed signal) and a small entry-wise residual term (presumed noise), is often not optimal for IMS. For example, while spatially or spectrally sparse signals are common in IMS data, they can heavily distort the low-rank approximation. Therefore, we propose capturing the IMS data structure using low-rank models that, in addition to a dense residual, allow for sparse variation to be captured separately. We implement two such methods, principal component pursuit (PCP) and stable principal component pursuit (SPCP), apply them to IMS data, and compare them to a classical factorization method, principal component analysis (PCA). We investigate their dimensionality and noise reduction performance on MALDI Q-TOF IMS measurements of human cornea and retina tissue since the human eye is a complex organ with lots of small, tightly packed tissue substructures that are spatially sparse. Our results suggest that if parameters are set adequately, PCP and SPCP enable stronger dimensionality reduction and higher compression of IMS data compared to PCA, while concurrently reducing signal overestimation.