scTopoGAN

unsupervised manifold alignment of single-cell data

Journal Article (2023)
Authors

A. Singh (TU Delft - Pattern Recognition and Bioinformatics)

K. Biharie (TU Delft - Pattern Recognition and Bioinformatics, Leiden University Medical Center)

Marcel J. T. Reinders (TU Delft - Pattern Recognition and Bioinformatics, Leiden University Medical Center)

A. Mahfouz (Leiden University Medical Center, TU Delft - Pattern Recognition and Bioinformatics)

T.R.M. Abdelaal (Leiden University Medical Center, TU Delft - Pattern Recognition and Bioinformatics)

Research Group
Pattern Recognition and Bioinformatics
Copyright
© 2023 A. Singh, K. Biharie, M.J.T. Reinders, A.M.E.T.A. Mahfouz, T.R.M. Abdelaal
To reference this document use:
https://doi.org/10.1093/bioadv/vbad171
More Info
expand_more
Publication Year
2023
Language
English
Copyright
© 2023 A. Singh, K. Biharie, M.J.T. Reinders, A.M.E.T.A. Mahfouz, T.R.M. Abdelaal
Research Group
Pattern Recognition and Bioinformatics
Issue number
1
Volume number
3
DOI:
https://doi.org/10.1093/bioadv/vbad171
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Motivation: Single-cell technologies allow deep characterization of different molecular aspects of cells. Integrating these modalities provides a comprehensive view of cellular identity. Current integration methods rely on overlapping features or cells to link datasets measuring different modalities, limiting their application to experiments where different molecular layers are profiled in different subsets of cells. Results: We present scTopoGAN, a method for unsupervised manifold alignment of single-cell datasets with non-overlapping cells or features. We use topological autoencoders (topoAE) to obtain latent representations of each modality separately. A topology-guided Generative Adversarial Network then aligns these latent representations into a common space. We show that scTopoGAN outperforms state-of-the-art manifold alignment methods in complete unsupervised settings. Interestingly, the topoAE for individual modalities also showed better performance in preserving the original structure of the data in the low-dimensional representations when compared to other manifold projection methods. Taken together, we show that the concept of topology preservation might be a powerful tool to align multiple single modality datasets, unleashing the potential of multi-omic interpretations of cells.