ReverseDiff support by mohdibntarek · Pull Request #144 · PumasAI/SimpleChains.jl

mohdibntarek · 2023-05-21T21:14:30Z

This PR implements ReverseDiff support which uncovered a spooky bug. Here is a reproducer that gives a wrong gradient. I will explain the spooky part and how to get the correct gradient in a comment.

using Revise, SimpleChains, ForwardDiff, ReverseDiff

x = [0.7, -0.5]
p = [-0.3, 0.6]
sc = SimpleChain(static(2), TurboDense{false}(identity, 1))
f(p) = SimpleChains.call_chain(sc, x, p)[1]
f(p)
f(p)

ForwardDiff.gradient(f, p)
ReverseDiff.gradient(f, p)

mohdibntarek · 2023-05-21T21:15:14Z

src/chain_rules.jl

+  v, pb = _rrule(sc, argv, paramsv, _returns_scalar(sc))
+  return v, Δ -> begin
+    _Δ = Base.tail(pb(collect(Δ)))
+    _Δ = Base.tail(pb(collect(Δ)))


If you comment this line out, you get the correct gradient. deepcopying everything didn't fix this so I am not sure what's going on.

So call it once, it works. Call it twice, it breaks.

deepcopy isn't likely to help on pointers.

perhaps the _rrule function should be a struct which stores the initial pointer address and starts from there every time it's called

chriselrod · 2023-05-26T18:44:58Z

I would suggest taking this approach:
https://github.com/JuliaSIMD/LoopVectorization.jl/blob/8fe27b9014f16b356e09d0b108968402253fde90/src/LoopVectorization.jl#L265C1-L268

That is, in Julia versions without package extensions, LoopVectorization.jl loads the extension files.
In versions with package extensions, they'll instead be loaded lazily as an extension (while still benefiting from precompilation).

I think it'd make sense to give StaticArrays and ForwardDiff that same treatment, but that is obviously unrelated to this PR.

mohdibntarek · 2023-05-26T18:46:53Z

yes I will do that as soon as it works well

chriselrod · 2023-05-26T18:52:16Z

src/chain_rules.jl

+        offset += 1
+      else
+        l = length(y)
+        v[offset + 1 : offset + l] = vec(y)


Note that vec allocates if !isa(y,AbstractVector)

This looks performance sensitive. Could use an @inbounds?

But also, why?
This seems internal. We should probably only be rruling relatively high level SimpleChains calls, that hide things like params getting called?

It was called in the log prior calculation.

broken ReverseDiff support

f7f132e

mohdibntarek commented May 21, 2023

View reviewed changes

chriselrod reviewed May 26, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ReverseDiff support#144

ReverseDiff support#144
mohdibntarek wants to merge 1 commit intomainfrom
mt/reversediff

mohdibntarek commented May 21, 2023

Uh oh!

mohdibntarek May 21, 2023

Uh oh!

mohdibntarek May 21, 2023

Uh oh!

chriselrod May 21, 2023

Uh oh!

mohdibntarek May 22, 2023

Uh oh!

chriselrod commented May 26, 2023 •

edited

Loading

Uh oh!

mohdibntarek commented May 26, 2023

Uh oh!

chriselrod May 26, 2023

Uh oh!

mohdibntarek May 26, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mohdibntarek commented May 21, 2023

Uh oh!

mohdibntarek May 21, 2023

Choose a reason for hiding this comment

Uh oh!

mohdibntarek May 21, 2023

Choose a reason for hiding this comment

Uh oh!

chriselrod May 21, 2023

Choose a reason for hiding this comment

Uh oh!

mohdibntarek May 22, 2023

Choose a reason for hiding this comment

Uh oh!

chriselrod commented May 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mohdibntarek commented May 26, 2023

Uh oh!

chriselrod May 26, 2023

Choose a reason for hiding this comment

Uh oh!

mohdibntarek May 26, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chriselrod commented May 26, 2023 •

edited

Loading