Analysis of cell deconvolution methods

A comparison of reference-based and reference-free cell deconvolution

Bachelor Thesis (2024)
Author(s)

S.M. Howard (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

S. Makrodimitris – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

Marcel J. T. Reinders – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

Bram Pronk – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

Daan Hazelaar – Mentor (Erasmus MC)

JA Pouwelse – Graduation committee member (TU Delft - Data-Intensive Systems)

Faculty
Electrical Engineering, Mathematics and Computer Science
More Info
expand_more
Publication Year
2024
Language
English
Graduation Date
28-06-2024
Awarding Institution
Delft University of Technology
Project
['CSE3000 Research Project']
Programme
['Computer Science and Engineering']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

In recent years, a new way of cancer diagnostics has emerged, the analysis of DNA fragments circulating in the blood of cancer patients known as fragmentomics. This DNA, known as cell-free DNA (cfDNA), is an easily available biomarker for cell types. Deducing the tissue origin of cfDNA can reveal anomalies in cell death caused by diseases. That holds great potential in cancer detection and monitoring. The process of establishing the cell composition of a blood sample is called cell deconvolution. This research paper focuses on the comparison of two methods of cell deconvolution. The first one UXM, solves this problem by employing a reference-based technique using a methylation atlas. The second one reference-free cfSort, utilizes a Deep Learning Neural Network to perform the sample analysis. In the paper, however, a simpler architecture was trained due to the difficulties in reproduction. Experiments have been conducted to assess the sensitivity of both methods. Experiments consisted of 5 major cell types together mixed with white blood cell DNA fragments to assess the sensitivity of each method. Furthermore, different metrics such as Pearson's correlation coefficient have been used to determine the accuracy of both methods. In the end, UXM outperformed cfSort in most metrics, including Pearson's correlation coefficient, indicating its superior accuracy in deconvolution tasks. However, cfSort showed potential for higher prediction accuracy with further development and better documentation. The findings highlight the strengths and limitations of both methods. This study suggests that while UXM is currently more reliable, future improvements in cfSort could make it a viable alternative. Continued research is recommended to enhance the accuracy and transparency of these methods, ensuring their effectiveness in real-world healthcare applications.

Files

Research_paper_final.pdf
(pdf | 0.568 Mb)
License info not available