Untangling biological factors influencing trajectory inference from single cell data

Journal Article (2020)
Authors

M. Charrout (TU Delft - Pattern Recognition and Bioinformatics, Leiden University Medical Center)

Marcel JT Reinders (TU Delft - Pattern Recognition and Bioinformatics, Leiden University Medical Center)

A. Mahfouz (TU Delft - Pattern Recognition and Bioinformatics, Leiden University Medical Center)

Research Group
Pattern Recognition and Bioinformatics
To reference this document use:
https://doi.org/10.1093/nargab/lqaa053
More Info
expand_more
Publication Year
2020
Language
English
Research Group
Pattern Recognition and Bioinformatics
Issue number
3
Volume number
2
DOI:
https://doi.org/10.1093/nargab/lqaa053
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Advances in single-cell RNA sequencing over the past decade has shifted the discussion of cell identity toward the transcriptional state of the cell. While the incredible resolution provided by single-cell RNA sequencing has led to great advances in unraveling tissue heterogeneity and inferring cell differentiation dynamics, it raises the question of which sources of variation are important for determining cellular identity. Here we show that confounding biological sources of variation, most notably the cell cycle, can distort the inference of differentiation trajectories. We show that by factorizing single cell data into distinct sources of variation, we can select a relevant set of factors that constitute the core regulators for trajectory inference, while filtering out confounding sources of variation (e.g. cell cycle) which can perturb the inferred trajectory. Script are available publicly on https://github.com/mochar/cell variation.