Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CNDB-12290 CC5 fix to use TrieMemtable as default in MemtableParams #1481

Open
wants to merge 1,289 commits into
base: main-5.0
Choose a base branch
from

Conversation

djatnieks
Copy link

@djatnieks djatnieks commented Dec 31, 2024

What is the issue

Update CC 5.0 to use TrieMemtable by default in MemtableParams to align with main branch.

What does this PR fix and why was it fixed

Updates MemtableParams to use DefaultMemtableFactory, which uses TrieMemtable, instead of using SkipTableMemtableFactory.

Checklist before you submit for review

  • Make sure there is a PR in the CNDB project updating the Converged Cassandra version
  • Use NoSpamLogger for log lines that may appear frequently in the logs
  • Verify test results on Butler
  • Test coverage for new/modified code is > 80%
  • Proper code formatting
  • Proper title for each commit staring with the project-issue number, like CNDB-1234
  • Each commit has a meaningful description
  • Each commit is not very long and contains related changes
  • Renames, moves and reformatting are in distinct commits

Sorry, something went wrong.

blambov and others added 30 commits August 23, 2024 16:47
…t queries (#1245)

Fixes a race condition when a filter-then-sort query gets intertwined with a deletion (or upsert) and a flush, only with DC/v5 format. The fix is simply considering that the postings might be empty after reading a vector.
…a.util to be able to find the expected Striped.custom method.
…fault for the initial build task and always mark the index queryable in a callback; in CC 5.0, directly calling markIndexBuilt leads to deadlock through a nested load request to LoadingMap.blockingLoadIfAbsent.
Update expected output to match CC change to limit expr strings to 9 characters in RowFilter.SimpleExpression.toString from commit "Use vector index node count estimates when planning TopK queries (#1164)". With this change, the result of toCqlString(query) may not be valid CQL due to truncation.
…ace with DatabaseDescriptor.createAllDirectories.

Fixes flaky test failures observed in AbstractReadQueryToCQLStringTest and ClearSnapshotTest.
When pre-filtering, sorting happens on already filtered rows,
so the topK limit must not be divided by the selectivity.

Fixes riptano/cndb#10542
This gets us to the IndexNotAvailableException
This commit fixes a minor bug in SAI cost estimation code that caused
the cost of intersections to be sometimes overestimated.
The estimation code didn't account for the fact that we may reach
the end of the iterator before finding a matching key.
* update ANN costs and scale disk accesses by cache hit rate
* Integrate brute-force estimation with Plan
* ANN costs better match the query execution
* include rawExpectedNodesVisited in trace when surprised
* LinearFit handles duplicate X values instead of returning NaN
* split sort into ANN_SORT_OPEN_COST and ANN_SORT_KEY_COST
* incorporate rerankK in estimation
* remove DISK_ACCESS_COST, I don't think it accurately models the difference between an sai search (multiple random-access requests to disk) and fetching a neighbor list (single request)
* update statistics on the unrestricted search path
* use actual degree in cost estimate instead of hardcoded default
* delete SelectiveIntersectionTest, it no longer adds value wrt PlanTest, and high SAI_OPEN_COST means that it's not actually correct to use index scans on the tiny table involved
* rename FilterSortOrder constants
CNDB-9441: fix SAI rebuild order to complete transaction first
* remove CassandraOnDiskHnsw
* move CloseableReranker to upper level
* remove JVectorLuceneOnDiskGraph and VectorSupplier
* relocated DiskBinarySearch and remove the rest of the sai.disk.v2.hnsw package
* containsUnitVectors is no longer Optional
Includes SortedIterator and TopKSelector, as well as a comparator-accepting
LucenePriorityQueue.
* Vector: Release GraphSearcherAccessManager on exception

* Use injection to force test failure
…tations (#1228) (#1256)

CNDB-7178: Keep track of failed segments when applying mutations

We don't want to delete failed segments if we've failed to apply mutation.
Rather provide hooks so we can act on failed mutation.
…oup of SAI expressions (#1246)

NEQ is unhandled in this switch, leading to a NPE under certain query conditions where an Expression is created and no operation is set for this expression.
…ng index build (#1258)

- use writeTimeReadFileChannelFor() for recalculateChecksum on appending index segment
Introduce a new SAI index option, equals_behaviour_when_analyzed, to decide the
behaviour of the equals (=) operator on columns with an analyzed index.
Possible values are:
* MATCH: The = operator will behave the same as the : operator. A client
  warning will be emitted on SELECT recommending the user to use  the : operator
  instead. This is the default value for backward compatibility.
* UNSUPPORTED: The = operator will be rejected on analyzed columns. This will
  be the default in the future.
jkni and others added 18 commits December 17, 2024 18:32
Unbounded queue length at the native transport stage can caused large
backlogs of requests that the system processes, even though clients may
no longer expect a response.

This PR implements a limited backport of CNDB-11070, introducing the
notion of a native request timeout that can shed messages with excessive
queue times at the NTR stage as well as async read/write stages, if
enabled. Cross-node message timeouts are also now respected earlier in
the mutation verb handler.

This is a fairly straightforward cherry-pick of
#1393 targeting main instead
of cc-main-migration-release.
This reverts commit c06c94c.

It seems the removal of `Index#postProcessor` by CNDB-11762 broke some
tests in CNDB's `MultiNodeBillingTest`. Unfortunately that patch [was
merged before creating the PR bumping the CC version used by
CNDB](#1422 (comment)).
[The CNDB PR](riptano/cndb#12076) was created
after that merging but it was superseded by other CC version bumps.

So I'm adding this reversal so we can investigate how the removal of
`Index#postProcessor` affects those tests.
This patch replaces null values of `deterministic`, `monotonic` and
`monotonic_on` columns in `system_schema.functions` and
`system_schema.aggregates` with negative defaults. These defaults will
be addressed if/once DB-672 gets ported to CC.
There are two mechanisms of detecting that the cluster is in the upgrade
state and the minimum version. Both are slightly different, and both are
not pluggable which means that CNDB doesn't work properly with them.

Those mechanisms are implemented in `Gossiper`. Although we do not use
`Gossiper` in CNDB, there are classes like `ColumnFilter` which go to
`Gossiper` to check the upgrade state.

So far, using that stuff in CDNB was a bit unpredictable, some of them
reported the cluster is upgraded and in the current version, the other
did not.

This turned out to be a problem, especially for the `ColumnFilter`
because when we upgrade DSE --> CC, CC assumes that the newest filter
version should be used, which is not correctly deserialized and
interpreted by DSE.

The fix is not small, but it probably simplifies stuff a bit.

First of all, two mechanism are merged into one. Moreover, we added
pluggability of it so that we can provide the appropriate implementation
in CNDB coordinators and writers, which is based on ETCD.
Part of riptano/cndb#12139

Moves constant shard count outside looping shards to reduce confusion.
…with DurationSpec type and 'native_transport_timeout_in_ms' as convertible old name with Long type; add some tests.
…MemtableIndexTest and TrieMemtableIndexAllocationsHeapBuffersTest from main branch.
…strictions (#1449)

Closes riptano/cndb#12139

This PR adds a test of row count of a SAI plan in the presence of
restrictions. Currently it tests queries with inequality, equality and
half-ranges on different SAI column types and with or without
histograms.
…g VIntOutOfRangeException to the catch block of SSTableIdentityIterator.exhaust method.
…pactionProgress to return the operation type from the first entry in the shared progress; needed in cases that a CompactionTask type is changed after creation.
…opriate (#1469)

Fixes riptano/cndb#12239

We found the System.nanoTime was using significant cpu cost, but because
the timeout is high enough, we can accept the inaccuracy.

- [ ] Make sure there is a PR in the CNDB project updating the Converged
Cassandra version
- [ ] Use `NoSpamLogger` for log lines that may appear frequently in the
logs
- [ ] Verify test results on Butler
- [ ] Test coverage for new/modified code is > 80%
- [ ] Proper code formatting
- [ ] Proper title for each commit staring with the project-issue
number, like CNDB-1234
- [ ] Each commit has a meaningful description
- [ ] Each commit is not very long and contains related changes
- [ ] Renames, moves and reformatting are in distinct commits
Fixes regression in jvector 3.0.4 when compacting PQVectors larger than 2GB
### What is the issue
SimpleClientPerfTest has been failing in CI since changes from
CNDB-10759

### What does this PR fix and why was it fixed
This change in `SimpleClientPerfTest`, updates the anonymous class
`Message.Codec<QueryMessage>` to override the correct method, `public
CompletableFuture<Response> maybeExecuteAsync` from `QueryMessage`,
whose signature was changed as part of CNDB-10759.

### Checklist before you submit for review
- [ ] Make sure there is a PR in the CNDB project updating the Converged
Cassandra version
- [ ] Use `NoSpamLogger` for log lines that may appear frequently in the
logs
- [ ] Verify test results on Butler
- [ ] Test coverage for new/modified code is > 80%
- [ ] Proper code formatting
- [ ] Proper title for each commit staring with the project-issue
number, like CNDB-1234
- [ ] Each commit has a meaningful description
- [ ] Each commit is not very long and contains related changes
- [ ] Renames, moves and reformatting are in distinct commits
…ing for async batchlog removal (#1485)

The test asserts that the batchlog is removed immediately after the
write completes, but removal of the batchlog is async and can be
delayed, particularly in resource-limited environments like CI.
The test generates partition index accesses by reusing the same key, and if the key cache is enabled, the test will fail for bigtable profiles because the key will be in the key cache.
…by filtering queries (#1484)

Queries creating fake index contexts each create their own context,
which can then race on metric registration (as the metrics have the same
patterns). This can cause a query to fail. These metrics are superfluous, 
we can skip creating them entirely.
…ate.accumulatedDataSize; it only worked to fix SensorsWriteTest.testMultipleRowsMutationWithClusteringKey for SkipListMemtable and may not be necessary if the default memtable becomes TrieMemtable. Revisit SensorsWriteTest later if necessary.
Move static class TrieMemtable.Factory to TrieMemtableFactory class;
Use suggested TriePartitionUpdate.unsharedHeapSize implementation;
Use InMemoryTrie.shortLived in TrieToDotTest and TrieToMermaidTest;
Add specific versions aa, ca, and da to RowIndexTest;
Add addMemoryUsageTo in SkipListMemtable and TrieMemtable
Add TrieMemtable.switchOut
Base automatically changed from CNDB-12154 to main-5.0 January 17, 2025 23:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet