The Case for Polymorphic Registers in Dataflow Computing

Journal Article (2018)
Author(s)

C.B. Ciobanu (Universiteit van Amsterdam, TU Delft - Data-Intensive Systems)

Georgi Gaydadjiev (Maxeler Technologies, TU Delft - Data-Intensive Systems)

C. Pilato (University of Lugano)

Donatella Sciuto (Politecnico di Milano)

Research Group
Data-Intensive Systems
Copyright
© 2018 C.B. Ciobanu, G. Gaydadjiev, Christian Pilato, Donatella Sciuto
DOI related publication
https://doi.org/10.1007/s10766-017-0494-1
More Info
expand_more
Publication Year
2018
Language
English
Copyright
© 2018 C.B. Ciobanu, G. Gaydadjiev, Christian Pilato, Donatella Sciuto
Research Group
Data-Intensive Systems
Issue number
6
Volume number
46
Pages (from-to)
1185-1219
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Heterogeneous systems are becoming increasingly popular, delivering high performance through hardware specialization. However, sequential data accesses may have a negative impact on performance. Data parallel solutions such as Polymorphic Register Files (PRFs) can potentially accelerate applications by facilitating high-speed, parallel access to performance-critical data. This article shows how PRFs can be integrated into dataflow computational platforms. Our semi-automatic, compiler-based methodology generates customized PRFs and modifies the computational kernels to efficiently exploit them. We use a separable 2D convolution case study to evaluate the impact of memory latency and bandwidth on performance compared to a state-of-the-art NVIDIA Tesla C2050 GPU. We improve the throughput up to 56.17X and show that the PRF-augmented system outperforms the GPU for 9×9 or larger mask sizes, even in bandwidth-constrained systems.