GPU Accelerated Sequence Alignment with Trace-back for GATK HaplotypeCaller

Journal Article (2019)
Authors

Shanshan Ren (TU Delft - Computer Engineering)

Nauman Ahmed (TU Delft - Computer Engineering)

K. Bertels (FTQC/Bertels Lab, (OLD)Quantum Computer Architectures)

Zaid Al-Ars (TU Delft - Computer Engineering)

Research Group
Computer Engineering
Copyright
© 2019 S. Ren, N. Ahmed, K.L.M. Bertels, Z. Al-Ars
To reference this document use:
https://doi.org/10.1186/s12864-019-5468-9
More Info
expand_more
Publication Year
2019
Language
English
Copyright
© 2019 S. Ren, N. Ahmed, K.L.M. Bertels, Z. Al-Ars
Research Group
Computer Engineering
Volume number
20
Pages (from-to)
103-116
DOI:
https://doi.org/10.1186/s12864-019-5468-9
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Background: Pairwise sequence alignment is widely used in many biological tools and applications. Existing GPU accelerated implementations mainly focus on calculating optimal alignment score and omit identifying the optimal alignment itself. In GATK HaplotypeCaller (HC), the semi-global pairwise sequence alignment with traceback has so far
been difficult to accelerate effectively on GPUs.
Results: We first analyze the characteristics of the semi-global alignment with traceback in GATK HC and then propose a new algorithm that allows for retrieving the optimal alignment efficiently on GPUs. For the first stage, we choose intra-task parallelization model to calculate the position of the optimal alignment score and the backtracking matrix. Moreover, in the first stage, our GPU implementation also records the length of consecutive matches/mismatches in
addition to lengths of consecutive insertions and deletions as in the CPU-based implementation. This helps efficiently
retrieve the backtracking matrix to obtain the optimal alignment in the second stage.
Conclusions: Experimental results show that our alignment kernel with traceback is up to 80x and 14.14x faster than its CPU counterpart with synthetic datasets and real datasets, respectively. When integrated into GATK HC (alongside a GPU accelerated pair-HMMs forward kernel), the overall acceleration is 2.3x faster than the baseline GATK HC
implementation, and 1.34x faster than the GATK HC implementation with the integrated GPU-based pair-HMMs forward algorithm. Although the methods proposed in this paper is to improve the performance of GATK HC, they can also be used in other pairwise alignments and applications.