Start forward mode AD #389

gdalle · 2024-11-24T15:00:46Z

This is a very rough backbone of forward mode AD, based on #386 and the existing reverse mode implementation.

Will's edits (apologies for editing your thing @gdalle -- I just want to make sure that the todo list is at the top of the PR):

Todo:

~~make FunctionWrappers work correctly~~ not going to do this in this PR
add support for MistyClosures
add tests for Hessian vector products
define is_primitive separately for forwards and reverse pass.
do a complete pass to review design -- are there any high-level things we ought to modify?
improve DRY-ness of code, particularly in testing infrastructure in particular.

Once the above are complete, I'll request reviews.

codecov · 2024-11-24T16:44:26Z

Codecov Report

Attention: Patch coverage is 94.04070% with 82 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/interpreter/s2s_forward_mode_ad.jl	88.77%	22 Missing ⚠️
src/test_utils.jl	86.66%	16 Missing ⚠️
src/rrules/foreigncall.jl	75.75%	8 Missing ⚠️
src/rrules/memory.jl	87.69%	8 Missing ⚠️
src/utils.jl	76.92%	6 Missing ⚠️
src/rrules/tasks.jl	64.28%	5 Missing ⚠️
src/dual.jl	85.71%	3 Missing ⚠️
src/rrules/builtins.jl	97.82%	3 Missing ⚠️
src/developer_tools.jl	0.00%	2 Missing ⚠️
src/interpreter/s2s_reverse_mode_ad.jl	71.42%	2 Missing ⚠️
... and 5 more

Files with missing lines	Coverage Δ
src/Mooncake.jl	`100.00% <ø> (ø)`
src/interpreter/ir_utils.jl	`89.68% <100.00%> (+2.81%)`	⬆️
src/rrules/array_legacy.jl	`100.00% <100.00%> (ø)`
src/rrules/avoiding_non_differentiable_code.jl	`100.00% <100.00%> (ø)`
src/rrules/blas.jl	`99.64% <100.00%> (+0.84%)`	⬆️
src/rrules/fastmath.jl	`100.00% <100.00%> (ø)`
src/rrules/lapack.jl	`100.00% <100.00%> (+0.56%)`	⬆️
src/rrules/linear_algebra.jl	`100.00% <100.00%> (ø)`
src/rrules/low_level_maths.jl	`100.00% <100.00%> (ø)`
src/rrules/new.jl	`91.30% <100.00%> (+2.84%)`	⬆️
... and 20 more

... and 2 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

willtebbutt

This is great. I've left a few comments, but if you're planning to do a bunch of additional stuff, then maybe they're redundant. Either way, don't feel the need to respond to them.

src/interpreter/s2s_forward_mode_ad.jl

test/forward.jl

src/frules/basic.jl

src/interpreter/s2s_forward_mode_ad.jl

Co-authored-by: Will Tebbutt <[email protected]> Signed-off-by: Guillaume Dalle <[email protected]>

gdalle · 2024-11-26T07:09:38Z

@willtebbutt following our discussion yesterday I scratched my head some more, and I decided that it would be infinitely simpler to enforce the invariant that one line of primal IR maps to one line of dual IR. While this may require additional fallbacks in the Julia code itself, I hope it will make our lives much easier on the IR side. What do you think?

willtebbutt · 2024-11-26T08:35:47Z

I think this could work.

You could just replace the frule!! calls with a call to a function call_frule!! which would be something like

@inline function call_frule!!(rule::R, fargs::Vararg{Any, N}) where {N}
    return rule(map(x -> x isa Dual ? x : zero_dual(x), fargs)...)
end

The optimisation pass will lower this to the what we were thinking about writing out in the IR anyway.

I think the other important kinds of nodes would be largely straightforward to handle.

gdalle · 2024-11-26T08:48:17Z

I think we might need to be slightly more subtle. If an argument to the :call or :invoke expression is a CC.Argument or a CC.SSAValue, we don't wrap it in a Dual because we assume it will already be one, right?

willtebbutt · 2024-11-26T08:54:02Z

Yes. I think my propose code handles this though, or am I missing something?

gdalle · 2024-11-26T08:58:16Z

In the spirit of higher-order AD, we may encounter Dual inputs that we want to wrap with a second Dual, and Dual inputs that we want to leave as-is. So I think this wrapping needs to be decided from the type of each argument in the IR?

willtebbutt · 2024-11-26T09:06:13Z

Very good point.

So I think this wrapping needs to be decided from the type of each argument in the IR?

Agreed. Specifically, I think we need to distinguish between literals / QuoteNodes / GlobalRefs, and Argument / SSAValues?

gdalle · 2024-11-26T09:08:32Z

I still need to dig into the different node types we might encounter (and I still don't understand QuoteNodes) but yeah, Argument and SSAValue don't need to be wrapped.

willtebbutt · 2024-11-27T12:36:48Z

I was reviewing the design docs and realised that, sadly, the "one line of primal IR maps to one line of dual IR" won't work for Core.GotoIfNot nodes. See https://compintell.github.io/Mooncake.jl/previews/PR386/developer_documentation/forwards_mode_design/#Statement-Transformation .

gdalle · 2024-11-27T13:05:47Z

I think that's okay, the main trouble is adding new lines which insert new variables because it requires manual renumbering. A GoTo should be much simpler.

willtebbutt · 2024-11-27T13:15:46Z

Were the difficulties around renumbering etc not resolved by not compact!ing until the end? I feel like I might be missing something.

gdalle · 2024-11-27T13:21:01Z

No they weren't. I experimented with compact! in various places and I was struggling a lot, so I asked Frames for advice. She agreed that insertion should usually be avoided.
If we have to insert something for GoTo, I think it will still be easier because we're not defining a new SSAValue so we don't have to adapt future statements that refer to it.

willtebbutt · 2024-11-27T13:26:48Z

Ah, right, but we do need to insert a new SSAValue. Suppose that the GotoIfNot of interest is

GotoIfNot(%5, #3)

i.e. jump to block 3 if not %5. In the forwards-mode IR this would become

%new_ssa = Expr(:call, primal, %5)
GotoIfNot(%new_ssa, #3)

Does this not cause the same kind of problems?

gdalle · 2024-11-27T13:37:38Z

Oh yes you're probably right. Although it might be slightly less of a hassle because the new SSA is only used in one spot, right after. I'll take a look

gdalle · 2024-11-27T13:38:14Z

Do you know what I should do about expressions of type :code_coverage_effect? I assume they're inserted automatically and they're alone on their lines?

willtebbutt · 2024-11-27T14:07:43Z

Yup -- I just strip them out of the IR entirely in reverse-mode. See https://github.com/compintell/Mooncake.jl/blob/0f37c079bd1ae064e7b84696eed4a1f7eb763f1f/src/interpreter/s2s_reverse_mode_ad.jl#L728

The way to remove an instruction from an IRCode is just to replace the instruction with nothing.

gdalle · 2024-11-27T14:21:12Z

I think this works for GotoIfNot:

make all the insertions necessary
compact! once to make sure they applied
shift the conditions of all GotoIfNot nodes to refer to the node right before them (where we get the primal value of the condition)

MWE (requires this branch of Mooncake):

const CC = Core.Compiler
using Mooncake
using MistyClosures

f(x) = x > 1 ? 2x : 3 + x
ir = Base.code_ircode(f, (Float64,))[1][1]
initial_ir = copy(ir)
get_primal_inst = CC.NewInstruction(Expr(:call, +, 1, 2), Any)  # placeholder for get_primal
CC.insert_node!(ir, CC.SSAValue(3), get_primal_inst, false)
ir = CC.compact!(ir)
for k in 1:length(ir.stmts)
    inst = ir[CC.SSAValue(k)][:stmt]
    if inst isa Core.GotoIfNot
        Mooncake.replace_call!(ir,CC.SSAValue(k), Core.GotoIfNot(CC.SSAValue(k-1), inst.dest))
    end
end
ir

julia> initial_ir
5 1 ─ %1 = Base.lt_float(1.0, _2)::Bool                                                                                 │╻╷╷ >
  │   %2 = Base.or_int(%1, false)::Bool                                                                                 ││╻   <
  └──      goto #3 if not %2                                                                                            │   
  2 ─ %4 = Base.mul_float(2.0, _2)::Float64                                                                             ││╻   *
  └──      return %4                                                                                                    │   
  3 ─ %6 = Base.add_float(3.0, _2)::Float64                                                                             ││╻   +
  └──      return %6                                                                                                    │   
                                                                                                                            

julia> ir
5 1 ─ %1 = Base.lt_float(1.0, _2)::Bool                                                                                 │╻╷╷ >
  │        Base.or_int(%1, false)::Bool                                                                                 ││╻   <
  │   %3 = (+)(1, 2)::Any                                                                                               │   
  └──      goto #3 if not %3                                                                                            │   
  2 ─ %5 = Base.mul_float(2.0, _2)::Float64                                                                             ││╻   *
  └──      return %5                                                                                                    │   
  3 ─ %7 = Base.add_float(3.0, _2)::Float64                                                                             ││╻   +
  └──      return %7

willtebbutt · 2025-03-31T21:25:15Z

Just requires implementing forwards-mode for FunctionWrappers, then will be ready for review.

edit: also for MistyClosures, and do test this works by computing some Hessian-vector products!

yebai · 2025-04-01T11:09:51Z

ext/MooncakeSpecialFunctionsExt.jl

-@from_rrule DefaultCtx Tuple{typeof(cosint),IEEEFloat}
-@from_rrule DefaultCtx Tuple{typeof(ellipk),IEEEFloat}
-@from_rrule DefaultCtx Tuple{typeof(ellipe),IEEEFloat}
+@from_chain_rule DefaultCtx Tuple{typeof(airyai),IEEEFloat}


A relatively minor comment: from_chainrules is more precise than from_chain_rule. The former clarifies that we are importing a rule from ChainRules, while the latter mislead me since I thought it refers to the generic chain rule terminology.

Suggested change

@from_chain_rule DefaultCtx Tuple{typeof(airyai),IEEEFloat}

@from_chainrules DefaultCtx Tuple{typeof(airyai),IEEEFloat}

Ah, interesting. I have no strong view either way, so I'm happy to change it if you think it's from_chainerules is clearer.

willtebbutt · 2025-04-01T15:40:08Z

@gdalle is there going to be an easy way to test Hessian-vector products using DI before we have the various things sorted in ADTypes / DI that we need to have sorted in order to make forwards-mode Mooncake via DI work. I'm just wondering to try and include Hessian-vector products computed via DI as test cases in this PR, or to punt it until a later date.

gdalle · 2025-04-01T16:36:06Z

Yeah no it's not gonna be completely straightforward. But doing the ADTypes changes is a matter of minutes

codecov-commenter · 2025-05-11T11:33:16Z

Codecov Report

Attention: Patch coverage is 89.65996% with 149 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/rrules/misty_closures.jl	0.00%	56 Missing ⚠️
src/interpreter/s2s_forward_mode_ad.jl	88.77%	22 Missing ⚠️
src/test_utils.jl	86.77%	16 Missing ⚠️
src/rrules/foreigncall.jl	75.75%	8 Missing ⚠️
src/rrules/memory.jl	87.69%	8 Missing ⚠️
src/utils.jl	75.00%	7 Missing ⚠️
src/rrules/twice_precision.jl	92.10%	6 Missing ⚠️
src/rrules/tasks.jl	64.28%	5 Missing ⚠️
src/dual.jl	85.71%	3 Missing ⚠️
src/interpreter/ir_utils.jl	90.32%	3 Missing ⚠️
... and 8 more

📢 Thoughts on this report? Let us know!

gdalle · 2025-05-11T14:59:16Z

Hey @willtebbutt! Anything I can do to help bring this over the finish line?

willtebbutt · 2025-05-12T08:03:49Z

I'm going to do a pass over the remaining items (listed at the top) tonight, and will report back. Probably the most helpful thing really will be reviewing -- it's such a large PR that it will be hard for anyone who hasn't been involved in its development.

gdalle · 2025-05-12T08:07:15Z

In particular, should we tie together the definition of is_primitive for forwards-mode and reverse-mode, or permit something to be primitive in just one?

I think it would make sense to keep the two separate, for the case where defining rules is not strictly necessary but useful to enhance performance. For example, someone may want to define a reverse rule for an optimization solver to avoid backpropagating through every iteration, but decide that the forward-mode behavior is good enough.

willtebbutt · 2025-05-12T08:10:30Z

I think it would make sense to keep the two separate, for the case where defining rules is not strictly necessary but useful to enhance performance. For example, someone may want to define a reverse rule for an optimization solver to avoid backpropagating through every iteration, but decide that the forward-mode behavior is good enough.

I completely agree. I'm going to edit the todo item to reflect this.

gdalle · 2025-05-20T07:27:47Z

Gentle bump on this one :) @willtebbutt do you need an ADTypes object / some DI infrastructure for HVPs?

willtebbutt · 2025-05-20T09:16:38Z

Apologies @gdalle -- I've made some progress locally on this, but have yet to push changes. I don't think I need an ADType object yet. My feeling is that it's probably best to do this as part of some follow up work. My approach in this PR is to:

check that forwards mode can differentiate simple MistyClosures, and then
check that forwards mode can differentiate the specific MistyClosures produced by reverse-mode AD.

I think I'm most of the way with 1, but getting 2 to work will be the real test.

gdalle · 2025-05-20T09:34:05Z

Is it such a big deal if nested differentiation is only part of a later push? Adding forward mode in addition to reverse mode (not necessarily on top of it) would already be useful for the community.
Unless you think that getting nested differentiation to work will require significant changes to the design, which may lead to breaking the user-facing aspects?

willtebbutt · 2025-05-20T11:11:53Z

To my mind, much of the utility of forwards-mode derives from its ability to be combined with reverse-mode to compute HVPs. Usually I'd be in favour of incrementally adding stuff over the course of a few PRs, but this is such an important feature that I'd rather not merge any forwards-mode stuff until we can apply it over reverse-mode.

gdalle · 2025-05-20T12:26:22Z

To my mind, much of the utility of forwards-mode derives from its ability to be combined with reverse-mode to compute HVPs.

That's because you're an optimization kind of person, but you need to look further ;)
For instance, SciML mostly cares about efficient forward-mode (sparse) Jacobians inside OrdinaryDiffEq and friends. It's sometimes hard to get Enzyme to work there, so the current options are mostly ForwardDiff and FiniteDiff, mediated through DifferentiationInterface. A forward mode in Mooncake would be an interesting fourth option in such cases.

willtebbutt · 2025-05-20T12:30:12Z

That's a fair point. Okay, my proposal is this: if I've not managed to get a basic forwards-over-reverse example working by the end of the week, we punt it to a subsequent PR. I agree that it would be very nice to get this merged sooner rather than later.

gdalle · 2025-05-20T12:38:25Z

Sounds good! Let me know when this is review-ready

MasonProtter · 2025-05-30T10:15:16Z

Just chiming in that I ended up here in this thread today specifically because I had a SciML OrdinaryDiffeEq use-case where I wanted Forward-Mode AD, but ForwardDiff.jl would be difficult to use, and Enzyme.jl complained too much.

Forward-over-reverse is of course always nice, but not the only usecase for forward mode!

Exciting to see progress being made.

yebai · 2025-05-30T21:46:39Z

It would be great to make a fresh push to finish off this PR, @willtebbutt. Despite the forward-over-reverse functionality, there are a few other issues to address for a robust design and codebase, as listed above.

willtebbutt · 2025-06-04T19:33:47Z

test/rrules/misty_closures.jl

+    # Construct a callable which performs reverse-mode, and apply forwards-mode over it.
+    rule = Mooncake.build_rrule(Tuple{typeof(quadratic), Float64})
+    TestUtils.test_rule(
+        StableRNG(123), low_level_gradient, rule, quadratic, 5.0;
+        interface_only=false,
+        is_primitive=false,
+        perf_flag=:none,
+        unsafe_perturb=true,
+        forward=true,
+    )
+
+    # Manually test that this correectly computes the second derivative.
+    frule = Mooncake.build_frule(
+        Mooncake.get_interpreter(),
+        Tuple{typeof(low_level_gradient), typeof(rule), typeof(quadratic), Float64}
+    )
+    result = frule(
+        zero_dual(low_level_gradient),
+        zero_dual(rule),
+        zero_dual(quadratic),
+        Mooncake.Dual(5.0, 1.0),
+    )
+    @test tangent(result) == 2.0
+end


I'm pleased to say that this works -- we can successfully compute the second derivative using forwards-over-reverse.

Awesome! Should I get started on adding AutoForwardMooncake to ADTypes?

That way we could run DI tests with this branch

That might actually be good at this point -- it would be good to have forward-mode + forward-over-reverse tests with DI before releasing, as that's how users will interact with it anyway.

Can you add interface functions to this PR for DI?

value_and_pushforward!!

prepare_pushforward_cache

gentle bump on this :)

gdalle added 3 commits November 24, 2024 11:36

Start forward mode prototype

be316ff

First working autodiff

deac913

Docstring

9c96c8d

willtebbutt reviewed Nov 24, 2024

View reviewed changes

gdalle and others added 8 commits November 24, 2024 18:18

Apply suggestions from code review

136aff6

Co-authored-by: Will Tebbutt <[email protected]> Signed-off-by: Guillaume Dalle <[email protected]>

Moving files around

f65cc53

Primitives already known

053a8bb

Merge branch 'main' into gd/forward

6d8ec04

Keep pushing forward (pun intended)

a3107a8

Still buggy, don't touch

2836ac8

Keep instruction mapping one to one

09d63bd

Use replace_call

fa679eb

gdalle mentioned this pull request Nov 27, 2024

Forwards-Mode Design Docs #386

Merged

Initial forwards-mode timings

48b61ec

yebai reviewed Apr 1, 2025

View reviewed changes

willtebbutt mentioned this pull request Apr 1, 2025

Forwards-Mode Rules for nnlib and friends #542

Open

Merge in main

3d9f9bf

willtebbutt added 3 commits May 26, 2025 19:22

Constrain JuliaInterpreter

05d3c65

Basic MistyClosure support

df0d2d7

Merge in main

ed912eb

yebai assigned willtebbutt May 28, 2025

willtebbutt added 3 commits June 4, 2025 20:27

Do not use MistyClosure internals inside reverse-mode

b9c5f7e

Forwards-over-reverse mwe

6990348

Remove overly strict performance check

941e171

willtebbutt reviewed Jun 4, 2025

View reviewed changes

gdalle mentioned this pull request Jun 5, 2025

feat: Add forward mode Mooncake SciML/ADTypes.jl#110

Draft

5 tasks

	@from_chain_rule DefaultCtx Tuple{typeof(airyai),IEEEFloat}
	@from_chainrules DefaultCtx Tuple{typeof(airyai),IEEEFloat}

Start forward mode AD #389

Are you sure you want to change the base?

Start forward mode AD #389

Uh oh!

Conversation

gdalle commented Nov 24, 2024 • edited by willtebbutt Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Nov 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

willtebbutt left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gdalle commented Nov 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

willtebbutt commented Nov 26, 2024

Uh oh!

gdalle commented Nov 26, 2024

Uh oh!

willtebbutt commented Nov 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gdalle commented Nov 26, 2024

Uh oh!

willtebbutt commented Nov 26, 2024

Uh oh!

gdalle commented Nov 26, 2024

Uh oh!

willtebbutt commented Nov 27, 2024

Uh oh!

gdalle commented Nov 27, 2024

Uh oh!

willtebbutt commented Nov 27, 2024

Uh oh!

gdalle commented Nov 27, 2024

Uh oh!

willtebbutt commented Nov 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gdalle commented Nov 27, 2024

Uh oh!

gdalle commented Nov 27, 2024

Uh oh!

willtebbutt commented Nov 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gdalle commented Nov 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

willtebbutt commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

willtebbutt commented Apr 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gdalle commented Apr 1, 2025

Uh oh!

codecov-commenter commented May 11, 2025 • edited by codecov bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

gdalle commented May 11, 2025

Uh oh!

willtebbutt commented May 12, 2025

Uh oh!

gdalle commented Nov 24, 2024 •

edited by willtebbutt

Loading

codecov bot commented Nov 24, 2024 •

edited

Loading

gdalle commented Nov 26, 2024 •

edited

Loading

willtebbutt commented Nov 26, 2024 •

edited

Loading

willtebbutt commented Nov 27, 2024 •

edited

Loading

willtebbutt commented Nov 27, 2024 •

edited

Loading

gdalle commented Nov 27, 2024 •

edited

Loading

willtebbutt commented Mar 31, 2025 •

edited

Loading

willtebbutt commented Apr 1, 2025 •

edited

Loading

codecov-commenter commented May 11, 2025 •

edited by codecov bot

Loading

MasonProtter commented May 30, 2025 •

edited

Loading