CNDB-15608 simplify ceiling row id from primary key #2142

k-rus · 2025-12-01T09:33:34Z

What is the issue

PartitionAwarePrimaryKeyMap implements overcomplicated ceiling method calling exactRowIdOrInvertedCeiling.
Part of https://github.com/riptano/cndb/issues/15608

What does this PR fix and why was it fixed

Simplifies PartitionAwarePrimaryKeyMap.ceiling to use the corresponding correct method from the reader directly.
This can be seen as a follow up to https://github.com/datastax/cassandra/pull/1096/files#diff-c5011580ab9b0d99d9e504570c4cccb152221d3dbe62c8a956e83fce9070b380

CNDB's PR: https://github.com/riptano/cndb/pull/16154

PartitionAwarePrimaryKeyMap implements unused exactRowIdOrInvertedCeiling and overcomplicated ceiling method.

github-actions · 2025-12-01T09:33:50Z

sonarqubecloud · 2025-12-01T12:51:45Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
100.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

cassci-bot · 2025-12-01T12:59:11Z

❌ Build ds-cassandra-pr-gate/PR-2142 rejected by Butler

4 regressions found
See build details here

Found 4 new test failures

Test	Explanation	Runs	Upstream
o.a.c.distributed.test.FullRepairCoordinatorTimeoutTest.prepareRPCTimeout[DATACENTER_AWARE/true]	REGRESSION	🔵🔴	0 / 19
o.a.c.distributed.test.repair.ConcurrentValidationRequestsTest.testConcurrentValidations	REGRESSION	🔴🔵	0 / 19
o.a.c.index.sai.cql.VectorCompaction100dTest.testZeroOrOneToManyCompaction[dc true]	NEW	🔴⚪	0 / 19
o.a.c.index.sai.cql.VectorSiftSmallTest.testSiftSmall[dc false]	REGRESSION	🔴⚪	0 / 19

Found 2 known test failures

k-rus · 2025-12-01T14:46:04Z

4 regressions found See build details here

Found 4 new test failures

Test Explanation Runs Upstream
o.a.c.distributed.test.FullRepairCoordinatorTimeoutTest.prepareRPCTimeout[DATACENTER_AWARE/true] REGRESSION 🔵🔴 0 / 19
o.a.c.distributed.test.repair.ConcurrentValidationRequestsTest.testConcurrentValidations REGRESSION 🔴🔵 0 / 19
o.a.c.index.sai.cql.VectorCompaction100dTest.testZeroOrOneToManyCompaction[dc true] NEW 🔴⚪ 0 / 19
o.a.c.index.sai.cql.VectorSiftSmallTest.testSiftSmall[dc false]

Test failures are unrelated to the PR.

k-rus · 2025-12-01T14:52:07Z

CNDB's PR: https://github.com/riptano/cndb/pull/16154

michaeljmarshall

LGTM

The implementation is identical, AFAICT. The one minor detail is that neither indexOf nor ceilingRowId actually return negative values beyond the Long.MIN_VALUE or the -1 for when lastIndex >= valueCount. This does not break the logic of ceiling, but does lead to a pessimization that should be fixed. I will create a subsequent issue to track that work.

    @Override
    public long ceilingRowId(long targetValue)
    {
        // already out of range
        if (lastIndex >= valueCount)
            return -1;

        long rowId = findBlockRowId(targetValue);
        lastIndex = rowId >= 0 ? rowId : -rowId - 1;
        return lastIndex >= valueCount ? -1 : lastIndex;
    }

    @Override
    public long indexOf(long targetValue)
    {
        // already out of range
        if (lastIndex >= valueCount)
            return Long.MIN_VALUE;

        long rowId = findBlockRowId(targetValue);
        lastIndex = rowId >= 0 ? rowId : -rowId - 1;
        return rowId >= valueCount ? Long.MIN_VALUE : rowId;
    }

k-rus · 2025-12-01T18:17:20Z

The implementation is identical, AFAICT. The one minor detail is that neither indexOf nor ceilingRowId actually return negative values beyond the Long.MIN_VALUE or the -1 for when lastIndex >= valueCount. This does not break the logic of ceiling, but does lead to a pessimization that should be fixed. I will create a subsequent issue to track that work.

@michaeljmarshall
This is another difference, which was the reason for me to mention:
ceilingRowId compares lastIndex >= valueCount, while indexOf compares rowId >= valueCount before returning the actual value or the corresponding negative value. I guess subtracting 1 in lastIndex = rowId >= 0 ? rowId : -rowId - 1 guarantees for not exceeding valueCount on negative rowId.

michaeljmarshall · 2025-12-01T18:50:14Z

In re-reviewing the logic, I was mistaken about the misused methods and the pessimization. Everything looks correct to me.

ceilingRowId compares lastIndex >= valueCount, while indexOf compares rowId >= valueCount before returning the actual value or the corresponding negative value. I guess subtracting 1 in lastIndex = rowId >= 0 ? rowId : -rowId - 1 guarantees for not exceeding valueCount on negative rowId.

I'm not sure I follow your point about exceeding valueCount. As far as I can tell, this logic:

        long rowId = findBlockRowId(targetValue);
        lastIndex = rowId >= 0 ? rowId : -rowId - 1;
        return rowId >= valueCount ? Long.MIN_VALUE : rowId;

works by getting either the exact row match back or -(low + 1) from the binary search when no match is found. The -rowId - 1 undoes that -(low + 1) logic and gets the next value. When we check rowId >= valueCount, I think it would work fine if we also had lastIndex >= valueCount since in the negative case, the caller needs to convert the inverted ceiling to a value and then discovers it is out of scope anyway.

k-rus · 2025-12-01T18:56:53Z

The -rowId - 1 undoes that -(low + 1) logic and gets the next value.

This is for lastIndex.

When we check rowId >= valueCount, I think it would work fine if we also had lastIndex >= valueCount since in the negative case, the caller needs to convert the inverted ceiling to a value and then discovers it is out of scope anyway.

Why will it be out of the scope? lastIndex is a positive value due to -(low + 1) and it might not go out of scope like rowId >= valueCount for negative rowId.

cc @michaeljmarshall

PartitionAwarePrimaryKeyMap implements overcomplicated `ceiling` method calling `exactRowIdOrInvertedCeiling`. This commit Simplifies PartitionAwarePrimaryKeyMap.ceiling to use the corresponding correct method from the reader directly. This can be seen as a follow up to https://github.com/datastax/cassandra/pull/1096/files#diff-c5011580ab9b0d99d9e504570c4cccb152221d3dbe62c8a956e83fce9070b380

CNDB-15608 simplify ceiling row id from primary key

7ac85ff

PartitionAwarePrimaryKeyMap implements unused exactRowIdOrInvertedCeiling and overcomplicated ceiling method.

Rollback unsupported as it's used from v2

de5869f

k-rus changed the title ~~[WIP] CNDB-15608 simplify ceiling row id from primary key~~ CNDB-15608 simplify ceiling row id from primary key Dec 1, 2025

k-rus requested review from jkni and michaeljmarshall December 1, 2025 14:48

jkni approved these changes Dec 1, 2025

View reviewed changes

michaeljmarshall approved these changes Dec 1, 2025

View reviewed changes

k-rus merged commit 42ae0f3 into main Dec 1, 2025
486 of 496 checks passed

k-rus deleted the rf-15608-simplify-v1-ceiling branch December 1, 2025 18:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CNDB-15608 simplify ceiling row id from primary key #2142

CNDB-15608 simplify ceiling row id from primary key #2142

Uh oh!

k-rus commented Dec 1, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 1, 2025 •

edited by k-rus

Loading

Uh oh!

sonarqubecloud bot commented Dec 1, 2025

Uh oh!

cassci-bot commented Dec 1, 2025

Uh oh!

k-rus commented Dec 1, 2025

Found 4 new test failures

Uh oh!

k-rus commented Dec 1, 2025

Uh oh!

michaeljmarshall left a comment

Uh oh!

k-rus commented Dec 1, 2025

Uh oh!

Uh oh!

michaeljmarshall commented Dec 1, 2025

Uh oh!

k-rus commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

CNDB-15608 simplify ceiling row id from primary key #2142

CNDB-15608 simplify ceiling row id from primary key #2142

Uh oh!

Conversation

k-rus commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What is the issue

What does this PR fix and why was it fixed

Uh oh!

github-actions bot commented Dec 1, 2025 • edited by k-rus Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist before you submit for review

Uh oh!

sonarqubecloud bot commented Dec 1, 2025

Quality Gate passed

Uh oh!

cassci-bot commented Dec 1, 2025

❌ Build ds-cassandra-pr-gate/PR-2142 rejected by Butler

Found 4 new test failures

Found 2 known test failures

Uh oh!

k-rus commented Dec 1, 2025

Found 4 new test failures

Uh oh!

k-rus commented Dec 1, 2025

Uh oh!

michaeljmarshall left a comment

Choose a reason for hiding this comment

Uh oh!

k-rus commented Dec 1, 2025

Uh oh!

Uh oh!

michaeljmarshall commented Dec 1, 2025

Uh oh!

k-rus commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

k-rus commented Dec 1, 2025 •

edited

Loading

github-actions bot commented Dec 1, 2025 •

edited by k-rus

Loading