RUBIC identifies driver genes by detecting recurrent DNA copy number breaks

Journal Article (2016)
Author(s)

Ewald Van Dyk (TU Delft - Pattern Recognition and Bioinformatics, Nederlands Kanker Instituut - Antoni van Leeuwenhoek ziekenhuis)

Marlous Hoogstraat (Nederlands Kanker Instituut - Antoni van Leeuwenhoek ziekenhuis)

J ten Hoeve (Nederlands Kanker Instituut - Antoni van Leeuwenhoek ziekenhuis)

M.J.T. Reinders (TU Delft - Pattern Recognition and Bioinformatics)

L.F.A. Wessels (Nederlands Kanker Instituut - Antoni van Leeuwenhoek ziekenhuis, TU Delft - Pattern Recognition and Bioinformatics)

Research Group
Pattern Recognition and Bioinformatics
Copyright
© 2016 H.O. Van Dyk, M Hoogstraat, J ten Hoeve, M.J.T. Reinders, L.F.A. Wessels
DOI related publication
https://doi.org/10.1038/ncomms12159
More Info
expand_more
Publication Year
2016
Language
English
Copyright
© 2016 H.O. Van Dyk, M Hoogstraat, J ten Hoeve, M.J.T. Reinders, L.F.A. Wessels
Research Group
Pattern Recognition and Bioinformatics
Pages (from-to)
1-10
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The frequent recurrence of copy number aberrations across tumour samples is a reliable hallmark of certain cancer driver genes. However, state-of-the-art algorithms for detecting recurrent aberrations fail to detect several known drivers. In this study, we propose RUBIC, an approach that detects recurrent copy number breaks, rather than recurrently amplified or deleted regions. This change of perspective allows for a simplified approach as recursive peak splitting procedures and repeated re-estimation of the background model are avoided.
Furthermore, we control the false discovery rate on the level of called regions, rather than at the probe level, as in competing algorithms. We benchmark RUBIC against GISTIC2 (a stateof- the-art approach) and RAIG (a recently proposed approach) on simulated copy number data and on three SNP6 and NGS copy number data sets from TCGA. We show that RUBIC calls more focal recurrent regions and identifies a much larger fraction of known cancer genes.