Efficient Acceleration of the Pair-HMMs Forward Algorithm for GATK HaplotypeCaller on Graphics Processing Units

Journal Article (2018)
Authors

S. Ren (TU Delft - Computer Engineering)

K Bertels (FTQC/Bertels Lab, TU Delft - Quantum & Computer Engineering)

Zaid Al-Ars (TU Delft - Computer Engineering)

Research Group
Computer Engineering
Copyright
© 2018 S. Ren, K.L.M. Bertels, Z. Al-Ars
To reference this document use:
https://doi.org/10.1177/1176934318760543
More Info
expand_more
Publication Year
2018
Language
English
Copyright
© 2018 S. Ren, K.L.M. Bertels, Z. Al-Ars
Research Group
Computer Engineering
Volume number
14
Pages (from-to)
1-12
DOI:
https://doi.org/10.1177/1176934318760543
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

GATK HaplotypeCaller (HC) is a popular variant caller, which is widely used to identify variants in complex genomes. However, due to its high variants detection accuracy, it suffers from long execution time. In GATK HC, the pair-HMMs forward algorithm accounts for a large percentage of the total execution time. This article proposes to accelerate the pair-HMMs forward algorithm on graphics processing units (GPUs) to improve the performance of GATK HC. This article presents several GPU-based implementations of the pair-HMMs forward algorithm. It also analyzes the performance bottlenecks of the implementations on an NVIDIA Tesla K40 card with various data sets. Based on these results and the characteristics of GATK HC, we are able to identify the GPU-based implementations with the highest performance for the various analyzed data sets. Experimental results show that the GPU-based implementations of the pair-HMMs forward algorithm achieve a speedup of up to 5.47× over existing GPU-based implementations.