PARTANS:An autotuning framework for stencil computation on multi-GPU systems

Lutz, Thibaut; Fensch, Christian; Cole, Murray

Repository landing page

research

oai:pure.ed.ac.uk:publications/82cb78c9-afcd-4e38-9379-b81e3fb92174

PARTANS:An autotuning framework for stencil computation on multi-GPU systems

Authors: Thibaut Lutz
Christian Fensch
Murray Cole
Publication date: 1 January 2013
Publisher
Doi

Abstract

GPGPUs are a powerful and energy-efficient solution for many problems. For higher performance or larger problems, it is necessary to distribute the problem across multiple GPUs, increasing the already high programming complexity.In this article, we focus on abstracting the complexity of multi-GPU programming for stencil computation. We show that the best strategy depends not only on the stencil operator, problem size, and GPU, but also on the PCI express layout. This adds nonuniform characteristics to a seemingly homogeneous setup, causing up to 23% performance loss. We address this issue with an autotuner that optimizes the distribution across multiple GPUs

article

Similar works

Full text

Open in the Core reader

Download PDF

Edinburgh Research Explorer

oai:pure.ed.ac.uk:publications...

Last time updated on 08/02/2015

This paper was published in Edinburgh Research Explorer.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.