CNDB-15485: Fix ResultRetriever key comparison to prevent dupes in result set (#2024) #2092

pkolaczk · 2025-10-29T15:45:54Z

CNDB-15485: Fix ResultRetriever key comparison to prevent dupes in result set (#2024)

(cherry picked from commit ada025c)

Copy of #2023, but targeting
cndb-main-release-202510 branch.

https://github.com/riptano/cndb/issues/15485

This PR fixes a bug introduced to this branch via
#1884. The bug only impacts
SAI file format aa when the index file was produced via compaction,
which is why the modified test simply adds coverage to compact the table
and hit the bug.

The bug happens when an iterator produces the same partition across two
different batch fetches from storage. These keys were not collapsed in
the key.equals(lastKey) logic because compacted indexes use a row id
per row instead of per partition, and the logic in
PrimaryKeyWithSource considers rows with different row ids to be
distinct. However, when we went to materialize a batch from storage, we
hit this code:

        ClusteringIndexFilter clusteringIndexFilter = command.clusteringIndexFilter(firstKey.partitionKey());
        if (cfs.metadata().comparator.size() == 0 || firstKey.hasEmptyClustering())
        {
            return clusteringIndexFilter;
        }
        else
        {
            nextClusterings.clear();
            for (PrimaryKey key : keys)
                nextClusterings.add(key.clustering());
            return new ClusteringIndexNamesFilter(nextClusterings, clusteringIndexFilter.isReversed());
        }

which returned clusteringIndexFilter for aa because those indexes do
not have the clustering information. Therefore, each batch fetched the
whole partition (which was subsequently filtered to the proper results),
and produced a multiplier effect where we saw batch many duplicates.

This fix works by comparing partition keys and clustering keys directly,
which is a return to the old comparison logic from before
#1884. There was actually a
discussion about this in the PR to main, but unfortunately, we missed
this case
#1883 (comment).

A more proper long term fix might be to remove the logic of creating a
PrimaryKeyWithSource for AA indexes. However, I preferred this
approach because it is essentially a revert instead of fixing forward
solution.

github-actions · 2025-10-29T15:46:11Z

eolivelli

LGTM

src/java/org/apache/cassandra/index/sai/plan/StorageAttachedIndexSearcher.java

…sult set (#2024) (cherry picked from commit ada025c) Copy of #2023, but targeting `cndb-main-release-202510` branch. riptano/cndb#15485 This PR fixes a bug introduced to this branch via #1884. The bug only impacts SAI file format `aa` when the index file was produced via compaction, which is why the modified test simply adds coverage to compact the table and hit the bug. The bug happens when an iterator produces the same partition across two different batch fetches from storage. These keys were not collapsed in the `key.equals(lastKey)` logic because compacted indexes use a row id per row instead of per partition, and the logic in `PrimaryKeyWithSource` considers rows with different row ids to be distinct. However, when we went to materialize a batch from storage, we hit this code: ```java ClusteringIndexFilter clusteringIndexFilter = command.clusteringIndexFilter(firstKey.partitionKey()); if (cfs.metadata().comparator.size() == 0 || firstKey.hasEmptyClustering()) { return clusteringIndexFilter; } else { nextClusterings.clear(); for (PrimaryKey key : keys) nextClusterings.add(key.clustering()); return new ClusteringIndexNamesFilter(nextClusterings, clusteringIndexFilter.isReversed()); } ``` which returned `clusteringIndexFilter` for `aa` because those indexes do not have the clustering information. Therefore, each batch fetched the whole partition (which was subsequently filtered to the proper results), and produced a multiplier effect where we saw `batch` many duplicates. This fix works by comparing partition keys and clustering keys directly, which is a return to the old comparison logic from before #1884. There was actually a discussion about this in the PR to `main`, but unfortunately, we missed this case #1883 (comment). A more proper long term fix might be to remove the logic of creating a `PrimaryKeyWithSource` for AA indexes. However, I preferred this approach because it is essentially a `revert` instead of fixing forward solution.

sonarqubecloud · 2025-10-30T09:30:54Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
80.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

cassci-bot · 2025-10-30T09:36:38Z

❌ Build ds-cassandra-pr-gate/PR-2092 rejected by Butler

1 regressions found
See build details here

Found 1 new test failures

Test	Explanation	Runs	Upstream
o.a.c.concurrent.StageTimeMeasurementTest.executionAndQueueTimeAreCountedOnSubmitWithResult (compression)	NEW	🔴🔵	0 / 0

No known test failures found

pkolaczk force-pushed the c15485-oct-release branch from 9e8ccb3 to 7e937d7 Compare October 29, 2025 15:50

pkolaczk requested a review from michaeljmarshall October 29, 2025 15:52

eolivelli approved these changes Oct 29, 2025

View reviewed changes

michaeljmarshall reviewed Oct 29, 2025

View reviewed changes

src/java/org/apache/cassandra/index/sai/plan/StorageAttachedIndexSearcher.java Show resolved Hide resolved

pkolaczk force-pushed the c15485-oct-release branch from 7e937d7 to 40f06b7 Compare October 30, 2025 08:42

pkolaczk mentioned this pull request Oct 30, 2025

CNDB-15485: Fix PrimaryKeyWithSource comparisons #2027

Closed

pkolaczk merged commit 9e7fddc into cndb-main-release-202510 Oct 30, 2025
488 of 493 checks passed

pkolaczk deleted the c15485-oct-release branch October 30, 2025 12:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CNDB-15485: Fix ResultRetriever key comparison to prevent dupes in result set (#2024) #2092

CNDB-15485: Fix ResultRetriever key comparison to prevent dupes in result set (#2024) #2092

Uh oh!

pkolaczk commented Oct 29, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 29, 2025

Uh oh!

eolivelli left a comment

Uh oh!

Uh oh!

sonarqubecloud bot commented Oct 30, 2025

Uh oh!

cassci-bot commented Oct 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

CNDB-15485: Fix ResultRetriever key comparison to prevent dupes in result set (#2024) #2092

CNDB-15485: Fix ResultRetriever key comparison to prevent dupes in result set (#2024) #2092

Uh oh!

Conversation

pkolaczk commented Oct 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 29, 2025

Checklist before you submit for review

Uh oh!

eolivelli left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sonarqubecloud bot commented Oct 30, 2025

Quality Gate passed

Uh oh!

cassci-bot commented Oct 30, 2025

❌ Build ds-cassandra-pr-gate/PR-2092 rejected by Butler

Found 1 new test failures

No known test failures found

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pkolaczk commented Oct 29, 2025 •

edited

Loading