Dynamic automatic differentiation of GPU broadcast kernels

Revels, Jarrett; Besard, Tim; Churavy, Valentin; De Sutter, Bjorn; Edelman, Alan

Repository landing page

research

oai:archive.ugent.be:8607909

Dynamic automatic differentiation of GPU broadcast kernels

Authors: Jarrett Revels
Tim Besard
Valentin Churavy
Bjorn De Sutter
Alan Edelman
Publication date: 1 January 2018
Publisher

Abstract

We show how forward-mode automatic differentiation (AD) can be employed within larger reverse-mode computations to dynamically differentiate broadcast operations in a GPU-friendly manner. Our technique fully exploits the broadcast Jacobian's inherent sparsity structure, and unlike a pure reverse-mode approach, this "mixed-mode" approach does not require a backwards pass over the broadcasted operation's subgraph, obviating the need for several reverse-mode-specific programmability restrictions on user-authored broadcast operations. Most notably, this approach allows broadcast fusion in primal code despite the presence of data-dependent control flow. We discuss an experiment in which a Julia implementation of our technique outperformed pure reverse-mode TensorFlow and Julia implementations for differentiating through broadcast operations within an HM-LSTM cell update calculation

Similar works

Full text

Open in the Core reader

Download PDF

Ghent University Academic Bibliography

oai:archive.ugent.be:8607909

Last time updated on 17/03/2019

This paper was published in Ghent University Academic Bibliography.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.