Efficient GPU Acceleration for Computing Maximal Exact Matches in Long DNA Reads

None, None; None, None; None, None

Efficient GPU Acceleration for Computing Maximal Exact Matches in Long DNA Reads

Conference Paper (2020)

Author(s)

N. Ahmed (TU Delft - Quantum & Computer Engineering)

Koen L.M. Bertels (TU Delft - QCD/Almudever Lab, TU Delft - (OLD)Quantum Computer Architectures)

Z. Al-Ars (TU Delft - Computer Engineering)

Department

Quantum & Computer Engineering

Copyright

DOI related publication

https://doi.org/10.1145/3386052.3386066

GPU Seeding DNA analysis Maximal exact matches

To reference this document use:

https://resolver.tudelft.nl/uuid:712845ad-54ca-45bf-9c5f-dd75ae51ff17

More Info

expand_more

Publication Year

2020

Language

English

Copyright

Department

Quantum & Computer Engineering

Pages (from-to)

28-34

ISBN (electronic)

978-1-4503-7676-1

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The seeding heuristic is widely used in many DNA analysis applications to speed up the analysis time. In many applications, seeding takes a substantial amount of the total execution time. In this paper, we present an efficient GPU implementation for computing maximal exact matching (MEM) seeds in long DNA reads. We applied various optimizations to reduce the number of GPU global memory accesses and to avoid redundant computation. Our implementation also extracts maximum parallelism from the MEM computation tasks. We tested our implementation using data from the state-of-the-art third generation Pacbio DNA sequencers, which produces DNA reads that are tens of kilobases long. Our implementation is up to 9x faster for computing MEM seeds as compared to the fastest CPU implementation running on a server-grade machine with 24 threads. Computing suffix array intervals (first part of MEM computation) is up to 3x faster whereas calculating the location of the match (second part) is up to 9x faster. The implementation is publicly available at https://github.com/nahmedraja/GPUseed.

Files

Icbbb2020.pdf

(pdf | 0.874 Mb)

License info not available