You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We have a pretty complex we app that makes heavy use of fibers that run in parallel. Lately I stumbled upon the following scenario:
fiber1 runs Control.Parallel.parOneOf (which is based on Effect.Aff._parAffAlt)
The argument of Control.Parallel.parOneOf contains an array of 6 fibers that run in parallel
Before any of the 6 child fibers complete, fiber1 is killed
Expected behavior
All 6 child fibers are killed
Actual behavior
5 of the 6 fibers are killed, one fiber continues running
Additional context
each of the child fiber contains quite a complex sub-tree of fibers which use bracket and habe non-tirivial finilizers.
To Reproduce
The case I describe happens in a complex integration test, which is impractical to run on your side. I tried to reduce the test to something consice, but when I reduce the complexity of the child fibers to something like delay (Milliseconds 10.0) the system works as expected. So the problem happens only when the child fibers use non-trivial finilizers and brackets.
Why I can be sure that the problem lies in _parAffAlt and not in our code? Because when I replaced the library Effect.Aff._parAffAlt function with my own implementation (see below) the system started working as expected.
Question
The unit tests in test/Test/Main.purs use very small fiber trees which mostly contain only the delay function. Those unit test
pass without triggering this bug.
Would it be possible to extend the test cases with large fiber trees built by Test.QuickCheck?
Does any body else encountered a similar hard to reproduce behaviour?
altPar
::foralla
. Affa->Affa->Affa
altPar a1 a2 = supervise do
f1Ref <- liftEffect $ Ref.new Nothing
f2Ref <- liftEffect $ Ref.new Nothing
f1 <- suspendAff do
r1 <- attempt a1
case r1 ofRight _ ->do
mf2 <- liftEffect $ Ref.read f2Ref
for_ mf2 \f2 ->do
killFiber (error "CancellationException1") f2
Left _ -> pure unit
pure r1
f2 <- suspendAff do
r2 <- attempt a2
case r2 ofRight _ ->do
mf1 <- liftEffect $ Ref.read f1Ref
for_ mf1 \f1 ->do
killFiber (error "CancellationException2") f1
Left _ -> pure unit
pure r2
liftEffect $ Ref.write (Just f1) f1Ref
liftEffect $ Ref.write (Just f2) f2Ref
fr1 <- forkAff (joinFiber f1)
fr2 <- forkAff (joinFiber f2)
r1 <- catchError
(joinFiber fr1)
(\err -> pure (Left err))
r2 <- catchError
(joinFiber fr2)
(\err -> pure (Left err))
caseTuple r1 r2 ofTuple (Right a) _ -> pure a
Tuple (Left _) (Right a) -> pure a
Tuple (Left ce) (Left err) | message ce == "CancellationException2"-> throwError err
Tuple (Left err) (Left _) -> throwError err
The text was updated successfully, but these errors were encountered:
I don't think I've ever experienced this and known it, at least. Can you give me an idea of what kind of work is being done?
How are you observing that it isn't canceled?
Is it all truly asynchronous, or some synchronous code, too?
Same for cancelers?
Is the initial cancelation in the same tick of the event loop?
Any idea what the size and shape of the tree is (like is it relatively balanced or is it full right or left associated)?
Is it only ParAff alt node, or also apply/map nodes?
Would it be possible to extend the test cases with large fiber trees built by Test.QuickCheck?
I don't know. If you have thoughts on how to do this, and are able to reproduce your issue with it, then I could see adding it. It would be nice if tests didn't take an unbounded amount of time :)
We have a pretty complex we app that makes heavy use of fibers that run in parallel. Lately I stumbled upon the following scenario:
fiber1
runsControl.Parallel.parOneOf
(which is based onEffect.Aff._parAffAlt
)Control.Parallel.parOneOf
contains an array of 6 fibers that run in parallelfiber1
is killedExpected behavior
All 6 child fibers are killed
Actual behavior
5 of the 6 fibers are killed, one fiber continues running
Additional context
each of the child fiber contains quite a complex sub-tree of fibers which use
bracket
and habe non-tirivial finilizers.To Reproduce
The case I describe happens in a complex integration test, which is impractical to run on your side. I tried to reduce the test to something consice, but when I reduce the complexity of the child fibers to something like
delay (Milliseconds 10.0)
the system works as expected. So the problem happens only when the child fibers use non-trivial finilizers and brackets.Why I can be sure that the problem lies in
_parAffAlt
and not in our code? Because when I replaced the libraryEffect.Aff._parAffAlt
function with my own implementation (see below) the system started working as expected.Question
The unit tests in
test/Test/Main.purs
use very small fiber trees which mostly contain only thedelay
function. Those unit testpass without triggering this bug.
Would it be possible to extend the test cases with large fiber trees built by
Test.QuickCheck
?Does any body else encountered a similar hard to reproduce behaviour?
The text was updated successfully, but these errors were encountered: