We are not able to resolve this OAI Identifier to the repository landing page. If you are the repository manager for this record, please head to the Dashboard and adjust the settings.
Institute of Mathematics and Informatics Bulgarian Academy of Sciences
Abstract
AMS subject classification: 68Q22, 90C90We discuss in this paper the problem of generating highly efficient code when a
n + 1-dimensional nested loop program is executed on a n-dimensional torus/grid
of distributed-memory general-purpose machines. We focus on a class of uniform
recurrences with non-negative components of the dependency matrix. Using tiling
the iteration space strategy we show that minimizing the total running time reduces
to solving a non-trivial non-linear integer optimization problem. For the later we
present a mathematical framework that enables us to derive an O(n log n) algorithm
for finding a good approximate solution. The theoretical evaluations and the experimental results show that the obtained solution approximates the original minimum
sufficiently well in the context of the considered problem. Such algorithm is realtime usable for very large values of n and can be used as optimization techniques in
parallelizing compilers as well as in performance tuning of parallel codes by hand
Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.