Exact and Heuristic Allocation of Multi-kernel Applications to Multi-FPGA Platforms

Shan, Junnan; Casu, Mario R.; Cortadella, Jordi; Lavagno, Luciano; Lazarescu, Mihai T.

Repository landing page

oai:iris.polito.it:11583/2740832

Exact and Heuristic Allocation of Multi-kernel Applications to Multi-FPGA Platforms

Authors: Junnan Shan
Mario R. Casu
Jordi Cortadella
Luciano Lavagno
Mihai T. Lazarescu
Publication date: 1 January 2019
Publisher: place:New York, NY
Doi

Abstract

FPGA-based accelerators demonstrated high energy efficiency compared to GPUs and CPUs. However, single FPGA designs may not achieve sufficient task parallelism. In this work, we optimize the mapping of high-performance multi-kernel applications, like Convolutional Neural Networks, to multi-FPGA platforms. First, we formulate the system level optimization problem, choosing within a huge design space the parallelism and number of compute units for each kernel in the pipeline. Then we solve it using a combination of Geometric Programming, producing the optimum performance solution given resource and DRAM bandwidth constraints, and a heuristic allocator of the compute units on the FPGA cluster

Similar works

Full text

Open in the Core reader

Download PDF

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

oai:iris.polito.it:11583/27408...

Last time updated on 30/10/2019

This paper was published in PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino).

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.