Unsupervised machine learning for exploratory data analysis in imaging mass spectrometry

Review (2019)
Author(s)

Nico Verbeeck (TU Delft - Team Raf Van de Plas, Aspect Analytics NV, Katholieke Universiteit Leuven)

Richard M. Caprioli (VanderBilt University)

R Van de Plas (TU Delft - Team Raf Van de Plas, VanderBilt University)

Research Group
Team Raf Van de Plas
Copyright
© 2019 N. Verbeeck, Richard M. Caprioli, Raf Van de Plas
DOI related publication
https://doi.org/10.1002/mas.21602
More Info
expand_more
Publication Year
2019
Language
English
Copyright
© 2019 N. Verbeeck, Richard M. Caprioli, Raf Van de Plas
Research Group
Team Raf Van de Plas
Issue number
3
Volume number
39
Pages (from-to)
245-291
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Imaging mass spectrometry (IMS) is a rapidly advancing molecular imaging modality that can map the spatial distribution of molecules with high chemical specificity. IMS does not require prior tagging of molecular targets and is able to measure a large number of ions concurrently in a single experiment. While this makes it particularly suited for exploratory analysis, the large amount and high-dimensional nature of data generated by IMS techniques make automated computational analysis indispensable. Research into computational methods for IMS data has touched upon different aspects, including spectral preprocessing, data formats, dimensionality reduction, spatial registration, sample classification, differential analysis between IMS experiments, and data-driven fusion methods to extract patterns corroborated by both IMS and other imaging modalities. In this work, we review unsupervised machine learning methods for exploratory analysis of IMS data, with particular focus on (a) factorization, (b) clustering, and (c) manifold learning. To provide a view across the various IMS modalities, we have attempted to include examples from a range of approaches including matrix assisted laser desorption/ionization, desorption electrospray ionization, and secondary ion mass spectrometry-based IMS. This review aims to be an entry point for both (i) analytical chemists and mass spectrometry experts who want to explore computational techniques; and (ii) computer scientists and data mining specialists who want to enter the IMS field.