Parallel dual-numbers reverse AD

Tom J. Smeding*, Matthijs I.L. Vákár*

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Where dual-numbers forward-mode automatic differentiation (AD) pairs each scalar value with its tangent value, dual-numbers reverse-mode AD attempts to achieve reverse AD using a similarly simple idea: by pairing each scalar value with a backpropagator function. Its correctness and efficiency on higher-order input languages have been analysed by Brunel, Mazza and Pagani, but this analysis used a custom operational semantics for which it is unclear whether it can be implemented efficiently. We take inspiration from their use of linear factoring to optimise dual-numbers reverse-mode AD to an algorithm that has the correct complexity and enjoys an efficient implementation in a standard functional language with support for mutable arrays, such as Haskell. Aside from the linear factoring ingredient, our optimisation steps consist of well-known ideas from the functional programming community. We demonstrate the use of our technique by providing a practical implementation that differentiates most of Haskell98. Where previous work on dual numbers reverse AD has required sequentialisation to construct the reverse pass, we demonstrate that we can apply our technique to task-parallel source programs and generate a task-parallel derivative computation.

Original languageEnglish
Article numbere16
Number of pages66
JournalJournal of Functional Programming
Volume35
DOIs
Publication statusPublished - 15 Aug 2025

Bibliographical note

Publisher Copyright:
© The Author(s), 2025. Published by Cambridge University Press.

Fingerprint

Dive into the research topics of 'Parallel dual-numbers reverse AD'. Together they form a unique fingerprint.

Cite this