Simplify wait_complete function by LiaCastaneda · Pull Request #19937 · apache/datafusion

LiaCastaneda · 2026-01-22T12:37:49Z

Which issue does this PR close?

Rationale for this change

The current v52 signature pub async fn wait_complete(self: &Arc<Self>) (introduced in #19546) is a bit unergonomic. The method requires &Arc<DynamicFilterPhysicalExpr>, but when working with Arc<dyn PhysicalExpr>, downcasting only gives you &DynamicFilterPhysicalExpr. Since you can't convert &DynamicFilterPhysicalExpr to Arc<DynamicFilterPhysicalExpr>, the method becomes impossible to call.

The &Arc<Self> param was used to check is_used() via Arc strong count, but this was overly defensive.

What changes are included in this PR?

Changed DynamicFilterPhysicalExpr::wait_complete signature from pub async fn wait_complete(self: &Arc<Self>) to pub async fn wait_complete(&self).
Removed the is_used() check from wait_complete() - this method, like wait_update(), should only be called on filters that have consumers. If the caller doesn't know whether the filter has consumers, they should call is_used() first to avoid waiting indefinitely. This approach avoids complex signatures and dependencies between the APIs methods.

Are these changes tested?

Yes, existing tests cover this functionality, I removed the "mock" consumer from test_hash_join_marks_filter_complete_empty_build_side and test_hash_join_marks_filter_complete since the fix in #19734 makes is_used check the outer struct strong_count as well.

Are there any user-facing changes?

The signature of wait_complete changed.

adriangb · 2026-01-22T13:35:05Z

datafusion/physical-expr/src/expressions/dynamic_filters.rs

+    /// # Note
+    ///
+    /// This method should only be called on filters that have consumers. If you don't
+    /// know whether the filter is being used, call [`Self::is_used`] first to avoid
+    /// waiting indefinitely.


I think there's a bit of nuance here. This is not because of the DynamicFilterPhysicalExpr API itself, it's only because of how HashJoinExec is implemented.

Under normal operation it would be a consumer calling wait_complete and hence it knows that a consumer exists because it is a consumer. In other words, under normal operation wait_completed is only called by consumers and thus is_used would always be true.

Or put another way, the only way this could go wrong is e.g. in a test or if HashJoinExec itself called wait_complete. By definition if more than 1 thing has a reference to the dynamic filter, there is a consumer. If there is only 1 reference, it must be the one HashJoinExec has (outside of tests). So it would have to be HashJoinExec that is calling wait_complete() right?

So the scenario described here seems more like a programming error or misuse of the APIs, not something that could happen under normal operation of a bug free usage of these APIs, right? In other words: if I was implementing this I could probably put the is_used() check behind #[cfg(debug_assertions)] or something to catch a programming error on my end, but it wouldn't really make sense to have that check at runtime in production, right?

Under normal operation it would be a consumer calling wait_complete and hence it knows that a consumer exists because it is a consumer. In other words, under normal operation wait_completed is only called by consumers and thus is_used would always be true.

🤔 I agree -- when I implemented this API, I had in mind that it would be used mainly in custom probe nodes of a HashJoinExec, so yes, ideally wait_complete is always called by consumers (in those cases it's impossible for wait_complete to wait indefinitely).

However, I remember when I added is_used in HashJoinExec::execute(), I saw in the tests we were waiting indefinitely (but this was mainly because before, is_used only checked the inner struct, which is not the case anymore), which is why I then added is_used inside wait_complete.

I included this note mainly thinking about a scenario where some third-party node has a reference to the DynamicFilter of the HashJoin but doesn't know if it has a consumer or not. However, in that case, the third-party node would hold a reference and is_used would return true, then filters would be computed and wait_complete would return successfully.

So yes, if this happens, it would be a programming error. I will remove the comment.

Or just make it clear that this scenario would only result from a programming error 😄

adriangb · 2026-01-22T15:11:32Z

datafusion/physical-expr/src/expressions/dynamic_filters.rs

+    /// In the unlikely scenario where this method waits indefinitely, it indicates
+    /// a programming error where `wait_update()` is being called without any consumers
+    /// holding a reference to the filter.


I'd say something like:

Producers (e.g.) HashJoinExec will never update the expression or mark it as completed if there are no consumers as an optimization to avoid extra work. If you call this method on a dynamic filter created by such a producer and there are no consumers registered this method would wait indefinitely. This should not happen under normal operation because if you have a reference to this structure that means you are a consumer and hence the producer will update the filter. If you do run into this scenario it would indicate a programming error either in your producer or in DataFusion if the producer is a built in node; please report the bug or open an issue explaining your use case.

adriangb · 2026-01-27T17:22:07Z

I plan to merge this once CI passes. Thank you @LiaCastaneda !

## Which issue does this PR close? ## Rationale for this change The current v52 signature `pub async fn wait_complete(self: &Arc<Self>)` (introduced in apache#19546) is a bit unergonomic. The method requires `&Arc<DynamicFilterPhysicalExpr>`, but when working with `Arc<dyn PhysicalExpr>`, downcasting only gives you `&DynamicFilterPhysicalExpr`. Since you can't convert `&DynamicFilterPhysicalExpr` to `Arc<DynamicFilterPhysicalExpr>`, the method becomes impossible to call. The `&Arc<Self>` param was used to check` is_used()` via Arc strong count, but this was overly defensive. ## What changes are included in this PR? - Changed `DynamicFilterPhysicalExpr::wait_complete` signature from `pub async fn wait_complete(self: &Arc<Self>)` to `pub async fn wait_complete(&self)`. - Removed the `is_used()` check from `wait_complete()` - this method, like `wait_update()`, should only be called on filters that have consumers. If the caller doesn't know whether the filter has consumers, they should call `is_used()` first to avoid waiting indefinitely. This approach avoids complex signatures and dependencies between the APIs methods. ## Are these changes tested? Yes, existing tests cover this functionality, I removed the "mock" consumer from `test_hash_join_marks_filter_complete_empty_build_side` and `test_hash_join_marks_filter_complete` since the fix in apache#19734 makes is_used check the outer struct `strong_count` as well. ## Are there any user-facing changes? The signature of `wait_complete` changed. (cherry picked from commit bef1368)

* Fix dynamic filter is_used function (apache#19734) ## Which issue does this PR close?  - Closes apache#19715. ## Rationale for this change The:is_used() API incorrectly returned false for custom `DataSource` implementations that didn't call reassign_expr_columns() -> with_new_children() . This caused `HashJoinExec` to skip computing dynamic filters even when they were actually being used. ## What changes are included in this PR? Updated is_used() to check both outer and inner Arc counts ## Are these changes tested? Functionality is covered by existing test `test_hashjoin_dynamic_filter_pushdown_is_used`. I was not sure if to add a repro since it would require adding a custom `DataSource`, the current tests in datafusion/core/tests/physical_optimizer/filter_pushdown/mod.rs use `FileScanConfig` ## Are there any user-facing changes? no (cherry picked from commit 278950a) * Simplify wait_complete function (apache#19937) ## Which issue does this PR close? ## Rationale for this change The current v52 signature `pub async fn wait_complete(self: &Arc<Self>)` (introduced in apache#19546) is a bit unergonomic. The method requires `&Arc<DynamicFilterPhysicalExpr>`, but when working with `Arc<dyn PhysicalExpr>`, downcasting only gives you `&DynamicFilterPhysicalExpr`. Since you can't convert `&DynamicFilterPhysicalExpr` to `Arc<DynamicFilterPhysicalExpr>`, the method becomes impossible to call. The `&Arc<Self>` param was used to check` is_used()` via Arc strong count, but this was overly defensive. ## What changes are included in this PR? - Changed `DynamicFilterPhysicalExpr::wait_complete` signature from `pub async fn wait_complete(self: &Arc<Self>)` to `pub async fn wait_complete(&self)`. - Removed the `is_used()` check from `wait_complete()` - this method, like `wait_update()`, should only be called on filters that have consumers. If the caller doesn't know whether the filter has consumers, they should call `is_used()` first to avoid waiting indefinitely. This approach avoids complex signatures and dependencies between the APIs methods. ## Are these changes tested? Yes, existing tests cover this functionality, I removed the "mock" consumer from `test_hash_join_marks_filter_complete_empty_build_side` and `test_hash_join_marks_filter_complete` since the fix in apache#19734 makes is_used check the outer struct `strong_count` as well. ## Are there any user-facing changes? The signature of `wait_complete` changed. (cherry picked from commit bef1368)

github-actions bot added physical-expr Changes to the physical-expr crates physical-plan Changes to the physical-plan crate labels Jan 22, 2026

LiaCastaneda mentioned this pull request Jan 22, 2026

Compute Dynamic Filters only when a consumer supports them #19546

Merged

LiaCastaneda marked this pull request as ready for review January 22, 2026 13:08

adriangb reviewed Jan 22, 2026

View reviewed changes

LiaCastaneda force-pushed the lia/return-wait-complete-to-old-signature branch from e2d9bfb to abc077a Compare January 22, 2026 17:26

LiaCastaneda added 5 commits January 27, 2026 11:21

simplify wait_complete function

85b9b7c

Add comment on usage

3f91426

Add comment on usage in wait_update as well

7f35bd9

Remove comment

1d95ac3

Add comment on error

8fa17e0

adriangb force-pushed the lia/return-wait-complete-to-old-signature branch from abc077a to 8fa17e0 Compare January 27, 2026 17:21

adriangb approved these changes Jan 27, 2026

View reviewed changes

adriangb added this pull request to the merge queue Jan 27, 2026

Merged via the queue into apache:main with commit bef1368 Jan 27, 2026
32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify wait_complete function#19937

Simplify wait_complete function#19937
adriangb merged 5 commits intoapache:mainfrom
LiaCastaneda:lia/return-wait-complete-to-old-signature

LiaCastaneda commented Jan 22, 2026 •

edited

Loading

Uh oh!

adriangb Jan 22, 2026

Uh oh!

LiaCastaneda Jan 22, 2026

Uh oh!

adriangb Jan 22, 2026

Uh oh!

adriangb Jan 22, 2026 •

edited

Loading

Uh oh!

adriangb commented Jan 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

LiaCastaneda commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

adriangb Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

LiaCastaneda Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

adriangb Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

adriangb Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

adriangb commented Jan 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LiaCastaneda commented Jan 22, 2026 •

edited

Loading

adriangb Jan 22, 2026 •

edited

Loading