Convolutional neural networks on dataflow engines
Nils Voss (Imperial College London, Maxeler Technologies)
Marco Bacis (Politecnico di Milano, Maxeler Technologies)
Oskar Mencer (Maxeler Technologies)
Georgi Gaydadjiev (Imperial College London, Maxeler Technologies)
Wayne Luk (Imperial College London)
More Info
expand_more
Abstract
In this paper we discuss a high performance implementation for Convolutional Neural Networks (CNNs) inference on the latest generation of Dataflow Engines (DFEs). We discuss the architectural choices made during the design phase taking into account the DFE chip properties. We then perform design space exploration, considering the memory bandwidth and resources utilisation constraints derived from the used DFE and the chosen architecture. Finally, we discuss the high performance implementation and compare the obtained performance against other implementations, showing that our proposed design reaches 2,450 GOPS when running VGG16 as a test case.