fix: 🚑 make it so `is_in_bucket` is consistent across same file key records #598

TDemeco · 2025-12-05T20:45:08Z

This PR fixes a bug where the is_in_bucket field of files in the indexer DB wasn't being populated correctly when creating new file records for a file that was already previously stored by the MSP.

We allow new storage requests for file keys that have already been fulfilled to allow the user to add redundancy to them (in the form of more BSPs), but since the MSP already has the file from the previous storage request, it can accept the new one with an inclusion proof instead of a non-inclusion proof. This makes it so the runtime doesn't update its bucket root (since it shouldn't, as the bucket already had the file from before), which means the MutationsApplied event is never emitted, and this is the event that the indexer detects to update the is_in_bucket field of a file record.
This made it so the new file records in the indexer DB created by these subsequent storage requests permanently had their is_in_bucket field set to false, and so this could create inconsistencies which could lead to the indexer trying to delete a file record that still has an MSP association, failing to import the block and stalling.

The fix is twofold:

First, correct the mistake in the current records found in the DB, by executing a migration that checks for file records that have both is_in_bucket set to false and a sibling record (i.e. another file record for the same file key) that has its is_in_bucket set to true (I believe it would have been enough to check the oldest sibling record, but just in case we check all of them), and setting the is_in_bucket field for those files to true.
Then the more permanent fix is that now when the indexer creates a new file record, instead of defaulting the is_in_bucket field of the new record to false, it checks if any sibling record has its is_in_bucket field set to true and if so, creates the new record with the field set to true as well.

Note

As a comment, I left a function to check MSP associations to file records in the indexer. This is not currently used, but I believe we should add that check before every deletion once we are up and running in an environment where the indexer stalling would end up being critical and we rather keep it working while we work on the fix of whatever caused the inconsistency. Fixing inconsistent information could prove hard though, so we might end up deciding not to use this function at all and maintain the indexer's current behaviour.

⚠️ Breaking Changes ⚠️

Short description

There's a new migration for the indexer DB that must be executed for all existing DBs.
Who is affected
- Indexer node runners since they'll have to run the new migration.
Suggested code changes

No code changes required.

…ecords

HermanObst · 2025-12-05T21:07:43Z

Starting 👀

client/indexer-db/migrations/.diesel_lock

HermanObst · 2025-12-05T21:24:25Z

...nt/indexer-db/migrations/2025-12-05-191030_normalize_is_in_bucket_across_file_records/up.sql

+-- If ANY file record with a given file_key has is_in_bucket=true, then ALL
+-- records for that file_key should have is_in_bucket=true. This is because


So the only way this can happen is if the SR is already accepted by the MSP and the user send other one to increase?

Yes, as that's the only way to have more than one file record for the same file key.

client/indexer-service/src/handler.rs

HermanObst · 2025-12-05T21:34:16Z

client/indexer-service/src/handler.rs

                        let block_hash_bytes = block_hash.as_bytes().to_vec();
                        let tx_hash_bytes = evm_tx_hash.map(|h| h.as_bytes().to_vec());

+                        // Check if this file key is already present in the bucket of the MSP


So when the user ask for more redundancy the MSP needs to "re-accept" it, right?

Yes, otherwise the new storage request will expire without an MSP response, so it will be considered rejected.

HermanObst

LGTM.
For tracking reasons, it would be cool to be bit more specific with the current bug.
e.g: exactly what was happening with the files with the flag set on false, and why these where triggering a failure in the block import and thus the stall.

Not for this PR, but it would be high priority that we add a test that replicates this specific case, that way we can't re introduce the bug later.
@santikaplan

TDemeco · 2025-12-05T21:54:47Z

LGTM. For tracking reasons, it would be cool to be bit more specific with the current bug. e.g: exactly what was happening with the files with the flag set on false, and why these where triggering a failure in the block import and thus the stall.

It's a bit convoluted and hard to explain in detail so it wouldn't add much context, but simplifying it a bit the issue we faced was:
The indexer checks after each deletion done by the fisherman whether a file record of that file has any remaining association (BSP or MSP) and tries to delete the file record from the DB if there are none. Since the is_in_bucket for a file was incorrectly set to false even though it had an MSP association, the indexer thought that it was safe to delete it from the DB, tried, but the DB constraint of not allowing a file record deletion if it has any remaining associations made it so the delete returned an error, to which the indexer reacted by erroring out from the block indexing function and retrying for the same block, indefinitely.

fix: 🚑 make it so is_in_bucket is consistent across same file key r…

2bc19e8

…ecords

TDemeco requested a review from snowmead December 5, 2025 20:45

test: ✅ add missing is_in_bucket parameter in backend postgres tests

c59803e

TDemeco requested a review from HermanObst December 5, 2025 20:54

HermanObst reviewed Dec 5, 2025

View reviewed changes

client/indexer-db/migrations/.diesel_lock Outdated Show resolved Hide resolved

snowmead approved these changes Dec 5, 2025

View reviewed changes

HermanObst reviewed Dec 5, 2025

View reviewed changes

client/indexer-service/src/handler.rs Outdated Show resolved Hide resolved

HermanObst reviewed Dec 5, 2025

View reviewed changes

fix: 🩹 amend review

243bb5f

HermanObst approved these changes Dec 5, 2025

View reviewed changes

TDemeco merged commit 0a076dc into main Dec 5, 2025
43 checks passed

TDemeco deleted the fix/is-in-bucket-consistency branch December 5, 2025 22:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: 🚑 make it so `is_in_bucket` is consistent across same file key records #598

fix: 🚑 make it so `is_in_bucket` is consistent across same file key records #598

TDemeco commented Dec 5, 2025 •

edited

Loading

Uh oh!

HermanObst commented Dec 5, 2025

Uh oh!

Uh oh!

HermanObst Dec 5, 2025

Uh oh!

TDemeco Dec 5, 2025

Uh oh!

Uh oh!

HermanObst Dec 5, 2025

Uh oh!

TDemeco Dec 5, 2025

Uh oh!

HermanObst left a comment •

edited

Loading

Uh oh!

TDemeco commented Dec 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		-- If ANY file record with a given file_key has is_in_bucket=true, then ALL
		-- records for that file_key should have is_in_bucket=true. This is because

fix: 🚑 make it so is_in_bucket is consistent across same file key records #598

fix: 🚑 make it so is_in_bucket is consistent across same file key records #598

Conversation

TDemeco commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Note

⚠️ Breaking Changes ⚠️

Uh oh!

HermanObst commented Dec 5, 2025

Uh oh!

Uh oh!

HermanObst Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

TDemeco Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

HermanObst Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

TDemeco Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

HermanObst left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TDemeco commented Dec 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix: 🚑 make it so `is_in_bucket` is consistent across same file key records #598

fix: 🚑 make it so `is_in_bucket` is consistent across same file key records #598

TDemeco commented Dec 5, 2025 •

edited

Loading

HermanObst left a comment •

edited

Loading