Generating high-performance FPGA accelerator designs for big data analytics with Fletcher and Apache Arrow

Journal article (2021)

Authors

J.W. Peltenburg Computer Engineering -

J. van Straten Computer Engineering -

M. Brobbel Computer Engineering -

Z. Al-Ars Computer Engineering -

Zaid Al-Ars Computer Engineering -

H.P. Hofstee IBM Austin, Computer Engineering -

H. Peter Hofstee IBM Austin, Computer Engineering -

Research Group

Computer Engineering () (TU Delft)

DOI: https://doi.org/10.1007/s11265-021-01650-6

FPGA Big data Apache Arrow Fletcher Accelerator Analytics

To reference this document use:

http://resolver.tudelft.nl/uuid:13c2cbb7-f92f-4ab5-bd34-ebd2d0cf2308

More Info

expand_more

Published Date

2021

Language

English

Faculty

Electrical Engineering, Mathematics and Computer Science

Department

Quantum & Computer Engineering

Research Group

Computer Engineering

Abstract

As big data analytics systems are squeezing out the last bits of performance of CPUs and GPUs, the next near-term and widely available alternative industry is considering for higher performance in the data center and cloud is the FPGA accelerator. We discuss several challenges a developer has to face when designing and integrating FPGA accelerators for big data analytics pipelines. On the software side, we observe complex run-time systems, hardware-unfriendly in-memory layouts of data sets, and (de)serialization overhead. On the hardware side, we observe a relative lack of platform-agnostic open-source tooling, a high design effort for data structure-specific interfaces, and a high design effort for infrastructure. The open source Fletcher framework addresses these challenges. It is built on top of Apache Arrow, which provides a common, hardware-friendly in-memory format to allow zero-copy communication of large tabular data, preventing (de)serialization overhead. Fletcher adds FPGA accelerators to the list of over eleven supported software languages. To deal with the hardware challenges, we present Arrow-specific components, providing easy-to-use, high-performance interfaces to accelerated kernels. The components are combined based on a generic architecture that is specialized according to the application through an extensive infrastructure generation framework that is presented in this article. All generated hardware is vendor-agnostic, and software drivers add a platform-agnostic layer, allowing users to create portable implementations.

Files

Peltenburg2021_Article_Generat... (pdf)

(pdf | 3.24 Mb)