Unsupervised Manifold Alignment with TopoGAN

Singh, A.

Unsupervised Manifold Alignment with TopoGAN

Aligning multi-modal biological data without correspondence information available across modalities

Master thesis (2021)

Authors

A. Singh Electrical Engineering, Mathematics and Computer Science

Contributors

A. Mahfouz Leiden University Medical Center (mentor)

Marcel J.T. Reinders Pattern Recognition and Bioinformatics (graduation committee member)

Christoph Lofi Web Information Systems (graduation committee member)

T.R.M. Abdelaal Pattern Recognition and Bioinformatics (coach)

Faculty

Electrical Engineering, Mathematics and Computer Science, Electrical Engineering, Mathematics and Computer Science

Autoencoder Generative Adversarial Networks Alignment Bioinformatics Topology

To reference this document use:

http://resolver.tudelft.nl/uuid:e2399b98-930e-4e4e-b82c-b118452ad952

More Info

expand_more

Published Date

26-08-2021

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

Single-cell multi-modal omics promises to open new doors in bioinformatics by measuring different aspects of cells, thus offering multiple perspectives on the underlying biological phenomenon. Although simultaneous multi-modal measurement protocols do exist, their inherent technical limitations necessitate focus on single modality measurements. These single modality measurements, however, destroy the cell in question, thus making simultaneous measurements impossible. This gives rise to a great availability of multi-modal biological data with no inter-data set sample/feature correspondence. This work proposes a novel approach to align multi-modal data sets in an unsupervised fashion using an Autoencoder to obtain latent embeddings of the modalities and a Generative Adversarial Network to align these latent representations. Minimising the topological error between the original and latent representations of a data set is central to this approach which enables not just the superposition but also alignment of different modalities. Two recently published methods, UnionCom and MMD-MA, have been used for comparison and benchmarking. The approach, termed TopoGAN, has been demonstrated to give consistently stable alignments, give better quantitative performance in realistic unsupervised settings, and scale much better in terms of memory requirements as compared to these state-of-the-art methods.

Files

MSC_Thesis_Akash_5156416.pdf

(pdf | 5.65 Mb)

License info not available