Computational Challenges of Next Generation Sequencing Pipelines Using Heterogeneous Systems
E.J. Houtgast (TU Delft - Computer Engineering, Bluebee, Rijswijk)
V.M. Sima (Bluebee, Rijswijk)
KLM Bertels (TU Delft - Quantum & Computer Engineering, FTQC/Bertels Lab)
Z Al-Ars (TU Delft - Computer Engineering)
More Info
expand_more
Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
We are rapidly entering the era of genomics. The dramatic cost reduction of DNA sequencing due to the introduction of Next Generation Sequencing (NGS) techniques has resulted in an exponential growth of genetics data. The amount of data generated, and its associated processing into useful information, poses serious computational challenges. Here, we give a brief introduction of NGS, show a typical NGS processing pipeline, and show the associated challenges from a computational perspective. A case study is presented where one component of the NGS processing pipeline is accelerated: BWA-MEM, the de-facto industry-standard for the mapping stage. This is a first step in achieving a fully heterogeneously accelerated NGS pipeline.