Comparing De Novo and COSMIC Mutational Signatures in Single-Cell Sequencing Data

Bachelor Thesis (2025)
Author(s)

F.T.M. de Haas (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

S. Costa – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

I. Stresec – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

Joana Gonçalves – Mentor (TU Delft - Pattern Recognition and Bioinformatics)

Catharine Oertel – Graduation committee member (TU Delft - Interactive Intelligence)

Faculty
Electrical Engineering, Mathematics and Computer Science
More Info
expand_more
Publication Year
2025
Language
English
Graduation Date
27-06-2025
Awarding Institution
Delft University of Technology
Project
['CSE3000 Research Project']
Programme
['Computer Science and Engineering']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Understanding mutational processes active in cancer at the single-cell level is essential for characterizing intra-tumor heterogeneity. Previous studies extracted these processes, called mutational signatures, and the known signatures can be found in the Catalogue of Somatic Mutations in Cancer (COSMIC) database. These signatures were derived based on bulk sequencing data of thousands of whole genomes. This study proposes and applies a systematic method to compare single-cell-derived de novo mutational signatures to the COSMIC signatures. Using two single-cell cancer datasets (breast and neck cancer), two stable signatures were extracted per dataset. Within each dataset, the de novo signatures were extremely similar (cosine similarity > 0.96), suggesting uniform mutational processes within individual tumors. No direct one-to-one matches were found between de novo and COSMIC signatures. However, the de novo signatures can be interpreted as combinations of known mutational processes. These results demonstrate the feasibility of extracting de novo signatures based on single-cell data, while also highlighting limitations due to possible overfitting. Future work should include simulation experiments, analysis of additional tumors, and evaluation of alternative signature extraction methods.

Files

License info not available