Untangling biological factors influencing trajectory inference from single cell data

Journal Article (2020)
Author(s)

Mohammed Charrout (TU Delft - Pattern Recognition and Bioinformatics, Leiden University Medical Center)

Marcel J.T. Reinders (TU Delft - Pattern Recognition and Bioinformatics, Leiden University Medical Center)

Ahmed Mahfouz (TU Delft - Pattern Recognition and Bioinformatics, Leiden University Medical Center)

DOI related publication
https://doi.org/10.1093/nargab/lqaa053 Final published version
More Info
expand_more
Publication Year
2020
Language
English
Journal title
NAR Genomics and Bioinformatics
Issue number
3
Volume number
2
Article number
lqaa053
Downloads counter
140
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Advances in single-cell RNA sequencing over the past decade has shifted the discussion of cell identity toward the transcriptional state of the cell. While the incredible resolution provided by single-cell RNA sequencing has led to great advances in unraveling tissue heterogeneity and inferring cell differentiation dynamics, it raises the question of which sources of variation are important for determining cellular identity. Here we show that confounding biological sources of variation, most notably the cell cycle, can distort the inference of differentiation trajectories. We show that by factorizing single cell data into distinct sources of variation, we can select a relevant set of factors that constitute the core regulators for trajectory inference, while filtering out confounding sources of variation (e.g. cell cycle) which can perturb the inferred trajectory. Script are available publicly on https://github.com/mochar/cell variation.