Print Email Facebook Twitter Closing the performance gap between an iterative frequency-domain solver and an explicit time-domain scheme for 3D migration on parallel architectures Title Closing the performance gap between an iterative frequency-domain solver and an explicit time-domain scheme for 3D migration on parallel architectures Author Knibbe, H.P. Mulder, W.A. Oosterlee, C.W. Vuik, C. Faculty Electrical Engineering, Mathematics and Computer Science Department Delft Institute of Applied Mathematics Date 2014-02-17 Abstract Three-dimensional reverse-time migration with the constant-density acoustic wave equation requires an efficient numerical scheme for the computation of wavefields. An explicit finite-difference scheme in the time domain is a common choice. However, it requires a significant amount of disk space for the imaging condition. The frequency-domain approach simplifies the correlation of the source and receiver wavefields, but requires the solution of a large sparse linear system of equations. For the latter, we use an iterative Krylov solver based on a shifted Laplace multigrid preconditioner with matrix-dependent prolongation. The question is whether migration in the frequency domain can compete with a time-domain implementation when both are performed on a parallel architecture. Both methods are naturally parallel over shots, but the frequency-domain method is also parallel over frequencies. If we have a sufficiently large number of compute nodes, we can compute the result for each frequency in parallel and the required time is dominated by the number of iterations for the highest frequency. As a parallel architecture, we consider a commodity hardware cluster that consists of multicore central processing units (CPUs), each of them connected to two graphics processing units (GPUs). Here, GPUs are used as accelerators and not as an independent compute node. The parallel implementation of the 3D migration in frequency domain is compared to a time-domain implementation. We optimize the throughput of the latter with dynamic load balancing, asynchronous I/O, and compression of snapshots. Because the frequency-domain solver uses matrix-dependent prolongation, the coarse-grid operators require more storage than available on GPUs for problems of realistic size. Due to data transfer, there is no significant speedup using GPU-accelerators. Therefore, we consider an implementation on CPUs only. Nevertheless, with the parallelization over shots and frequencies, this approach could compete with the time-domain implementation on multiple GPUs. Subject 3Dacousticfinite differenceimagingreverse time migration To reference this document use: http://resolver.tudelft.nl/uuid:a6a35526-0f1d-46a4-b8eb-3e4b289eba98 DOI https://doi.org/10.1190/geo2013-0214.1 Publisher Society of Exploration Geophysicists ISSN 0016-8033 Source Geophysics, 79 (2), 2014 Part of collection Institutional Repository Document type journal article Rights © 2014 Society of Exploration Geophysicists Files PDF Knibbe_2014.pdf 1.27 MB Close viewer /islandora/object/uuid:a6a35526-0f1d-46a4-b8eb-3e4b289eba98/datastream/OBJ/view