Preserving sort on `UnionExec` inputs instead of introducing a suboptimal top-level sort

### Is your feature request related to a problem or challenge?

When you have a `UNION` over mostly sorted inputs and explicitly add sorts to the unsorted ones, the `enforce_sorting` optimizer removes those targeted sorts and moves the sort to the top level instead.

Here's an excerpt of the verbose explain, as you can generate from the failing test in the [reproducer PR](https://github.com/apache/datafusion/pull/18352) (full explain: [gist](https://gist.github.com/rgehan/b632f5e106d60240876fa15df0d007e4)):

```
+-----------------------------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| plan_type                               | plan                                                                                                                                                                     |
+-----------------------------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------+                                                                                                                                                                                                                                                                          |
| logical_plan                            | Aggregate: groupBy=[[id]], aggr=[[]]                                                                                                                                     |
|                                         |   Union                                                                                                                                                                  |
|                                         |     TableScan: sorted projection=[id]                                                                                                                                    |
|                                         |     Sort: unsorted.id ASC NULLS LAST                                                                                                                                     |
|                                         |       TableScan: unsorted projection=[id]                                                                                                                                |
...
| initial_physical_plan                   | AggregateExec: mode=Final, gby=[id@0 as id], aggr=[], ordering_mode=Sorted                                                                                               |
|                                         |   AggregateExec: mode=Partial, gby=[id@0 as id], aggr=[], ordering_mode=Sorted                                                                                           |
|                                         |     UnionExec                                                                                                                                                            |
|                                         |       DataSourceExec: file_groups={1 group: [[{testdata}/alltypes_tiny_pages.parquet]]}, projection=[id], output_ordering=[id@0 ASC NULLS LAST], file_type=parquet       |
|                                         |       SortExec: expr=[id@0 ASC NULLS LAST], preserve_partitioning=[false]                                                                                                |
|                                         |         DataSourceExec: file_groups={1 group: [[{testdata}/alltypes_tiny_pages.parquet]]}, projection=[id], file_type=parquet                                            |
...
| physical_plan after EnforceDistribution | OutputRequirementExec: order_by=[], dist_by=Unspecified                                                                                                                  |
|                                         |   AggregateExec: mode=Final, gby=[id@0 as id], aggr=[], ordering_mode=Sorted                                                                                             |
|                                         |     SortExec: expr=[id@0 ASC NULLS LAST], preserve_partitioning=[false]                                                                                                  |
|                                         |       CoalescePartitionsExec                                                                                                                                             |
|                                         |         AggregateExec: mode=Partial, gby=[id@0 as id], aggr=[], ordering_mode=Sorted                                                                                     |
|                                         |           UnionExec                                                                                                                                                      |
|                                         |             DataSourceExec: file_groups={1 group: [[{testdata}/alltypes_tiny_pages.parquet]]}, projection=[id], output_ordering=[id@0 ASC NULLS LAST], file_type=parquet |
|                                         |             SortExec: expr=[id@0 ASC NULLS LAST], preserve_partitioning=[false]                                                                                          |
|                                         |               DataSourceExec: file_groups={1 group: [[{testdata}/alltypes_tiny_pages.parquet]]}, projection=[id], file_type=parquet                                      |
|                                         |                                                                                                                                                                          |
| physical_plan after EnforceSorting      | OutputRequirementExec: order_by=[], dist_by=Unspecified                                                                                                                  |
|                                         |   AggregateExec: mode=Final, gby=[id@0 as id], aggr=[], ordering_mode=Sorted                                                                                             |
|                                         |     SortPreservingMergeExec: [id@0 ASC NULLS LAST]                                                                                                                       |
|                                         |       SortExec: expr=[id@0 ASC NULLS LAST], preserve_partitioning=[true]                                                                                                 |
|                                         |         AggregateExec: mode=Partial, gby=[id@0 as id], aggr=[]                                                                                                           |
|                                         |           UnionExec                                                                                                                                                      |
|                                         |             DataSourceExec: file_groups={1 group: [[{testdata}/alltypes_tiny_pages.parquet]]}, projection=[id], output_ordering=[id@0 ASC NULLS LAST], file_type=parquet |
|                                         |             DataSourceExec: file_groups={1 group: [[{testdata}/alltypes_tiny_pages.parquet]]}, projection=[id], file_type=parquet                                        |
...
+-----------------------------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

This re-sorts all data instead of just the unsorted partition, which prevents usage of streaming operators (e.g. `SortPreservingMergeExec`), increases memory usage / spilling significantly.

This turns what should be a small parallel sort into a memory-intensive / spilling sort of the entire dataset.


### Describe the solution you'd like

Sorts below a `UnionExec` should be preferred over a top-level sort.

In #9867, @NGA-TRAN proposed explicitly implementing `required_input_ordering` in `UnionExec`, which seems to fix the reproducer tests I added in #18352. It however breaks other unit tests.

### Describe alternatives you've considered

- Pre-sorting all data, before feeding it to `datafusion`
- Implementing a custom sort operator that wouldn't get optimized out

While these are viable workarounds, they are not ideal, and I believe `datafusion` should be able to handle this case.


### Additional context

Reproducer tests in PR #18352.

Related to issue #9898 and its corresponding PR #9867.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Preserving sort on `UnionExec` inputs instead of introducing a suboptimal top-level sort #18380

Is your feature request related to a problem or challenge?

Describe the solution you'd like

Describe alternatives you've considered

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Preserving sort on UnionExec inputs instead of introducing a suboptimal top-level sort #18380

Description

Is your feature request related to a problem or challenge?

Describe the solution you'd like

Describe alternatives you've considered

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Preserving sort on `UnionExec` inputs instead of introducing a suboptimal top-level sort #18380