Hardware Acceleration of High-Performance Computational Flow Dynamics Using High-Bandwidth Memory-Enabled Field-Programmable Gate Arrays

None, None; None, None; None, None; None, None; None, None; None, None

Hardware Acceleration of High-Performance Computational Flow Dynamics Using High-Bandwidth Memory-Enabled Field-Programmable Gate Arrays

Journal Article (2022)

Author(s)

Tom Hogervorst (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Razvan Nane (TU Delft - QCD/DiCarlo Lab)

Giacomo Marchiori (Big Data Accelerate B.V.)

Tong Dong Qiu (Big Data Accelerate B.V.)

Markus Blatt (OPM-OS AS)

Alf Birger Rustad (Equinor S.A.)

Research Group

Computer Engineering

FPGA CFD GPU HPC Iterative solvers BiCGStab ILU0

DOI related publication

https://doi.org/10.1145/3476229 Final published version

To reference this document use

https://resolver.tudelft.nl/uuid:695b9a87-d7b5-4076-aac4-1418c394da67

More Info

expand_more

Publication Year

2022

Language

English

Research Group

Computer Engineering

Issue number

2

Volume number

15

Article number

20

Pages (from-to)

1-35

Downloads counter

356

Collections

Institutional Repository

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Scientific computing is at the core of many High-Performance Computing applications, including computational flow dynamics. Because of the utmost importance to simulate increasingly larger computational models, hardware acceleration is receiving increased attention due to its potential to maximize the performance of scientific computing. Field-Programmable Gate Arrays could accelerate scientific computing because of the possibility to fully customize the memory hierarchy important in irregular applications such as iterative linear solvers. In this article, we study the potential of using Field-Programmable Gate Arrays in High-Performance Computing because of the rapid advances in reconfigurable hardware, such as the increase in on-chip memory size, increasing number of logic cells, and the integration of High-Bandwidth Memories on board. To perform this study, we propose a novel Sparse Matrix-Vector multiplication unit and an ILU0 preconditioner tightly integrated with a BiCGStab solver kernel. We integrate the developed preconditioned iterative solver in Flow from the Open Porous Media project, a state-of-the-art open source reservoir simulator. Finally, we perform a thorough evaluation of the FPGA solver kernel in both stand-alone mode and integrated in the reservoir simulator, using the NORNE field, a real-world case reservoir model using a grid with more than 105 cells and using three unknowns per cell.

Files

3476229.pdf

(pdf | 4.74 Mb)

- Embargo expired in 01-07-2023

License info not available