Improve BFT resilience of BlockBuffer stores #8196

jordanschalm · 2025-11-26T22:02:49Z

This PR refactors the PendingBlockBuffer used by compliance engines to store disconnected blocks.

Consolidates Proposal types using generics (similar to Block)
Consolidates PendingBlockBuffer using generics, so we have one type-safe implementation
Changes the backend of PendingBlockBuffer to use LevelledForest instead of bespoke backend
- We remove the DropForParentID method, because LevelledForest has no parallel of this (it's a very minor memory optimization, and has the potential to drop unprocessed blocks in a concurrent environment)
Adds a ActiveViewRangeSize to the PendingBlockBuffer, which puts an upper limit on what views we will accept when adding blocks.
- The compliance engine does already have a similar mechanism (SkipNewProposalsThreshold), however including it at the PendingBlockBuffer layer as well is inexpensive and provides a more isolated guarantee that the memory usage is bounded, so I was inclined to include it here as well.

See #8170 for detail.

NOTE: Some of these changes (primarily the test suite) were done with AI (Cursor CLI - Composer 1). I marked commits which were substantially authored by AI with an ai: prefix (just so you are aware -- I have reviewed them 😃).

Summary by CodeRabbit

Bug Fixes
- Finalization and proposal processing now surface and propagate processing errors.
New Features / Refactor
- Replaced pending-block cache with a view-aware, forest-backed generic buffer; enforces active-range limits and reduces redundant parent requests.
- Unified proposal types and added richer proposal validation and handling paths.
Tests
- Added comprehensive buffer tests and updated mocks to match new buffer and proposal APIs.
Chores
- Added a typed BeyondActiveRange error for out-of-range proposals.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

this should allow removing some type overhead in the BlockBuffer types

github-actions · 2025-11-26T22:03:01Z

Dependency Review

✅ No vulnerabilities or license issues or OpenSSF Scorecard issues found.

Scanned Files

None

codecov-commenter · 2025-11-26T22:07:03Z

Codecov Report

❌ Patch coverage is 30.51471% with 189 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
module/mock/generic_pending_block_buffer.go	0.00%	70 Missing ⚠️
module/mock/pending_block_buffer.go	0.00%	24 Missing ⚠️
module/mock/pending_cluster_block_buffer.go	0.00%	24 Missing ⚠️
module/mock/buffered_proposal.go	0.00%	16 Missing ⚠️
engine/collection/compliance/core.go	22.22%	10 Missing and 4 partials ⚠️
engine/consensus/compliance/core.go	62.16%	10 Missing and 4 partials ⚠️
module/mempool/errors.go	0.00%	10 Missing ⚠️
module/buffer/pending_blocks.go	91.66%	5 Missing ⚠️
engine/collection/epochmgr/factories/compliance.go	0.00%	4 Missing ⚠️
engine/collection/compliance/engine.go	0.00%	1 Missing and 1 partial ⚠️
... and 4 more

📢 Thoughts on this report? Let us know!

remove some ai-generated tests and adjust implementation and docs

it already exists in the compliance engine as well, but including in mempool provides stronger localized guarantee of behaviour

module/buffer.go

durkmurder · 2025-12-01T16:45:03Z

module/buffer/pending_blocks.go

+//
+//structwrite:immutable
+type proposalVertex[T module.BufferedProposal] struct {
+	proposal flow.Slashable[T]


How about caching the header to avoid overhead of calling the ProposalHeader() and additional functions in each call?

I have an idea to avoid this entirely by switching around the generic types used - will open a new PR.

durkmurder

🏅 This is great, I like it a lot, a clean implementation and a very useful concept.

tim-barry

Looks good. Thanks for improving the tests.

tim-barry · 2025-12-09T23:15:40Z

I opened #8248 as a potential alternate solution to Yurii's comment about caching ProposalHeader.

…experiment Refactor generic type parameters in BlockBuffer

jordanschalm · 2025-12-13T00:19:00Z

engine/collection/compliance/core.go

 	}

-	// drop all the children that should have been processed now
-	c.pending.DropForParent(blockID)


Removing this line is causing the integration test failure. We assume that c.pending only includes blocks that have not been processed and added to our local state. However, with the change in this PR, we retain blocks until their view is finalized.

Specific problematic case:

There are blocks like A<-B<-C.

We receive B first and cache it in buffer

We receive A and process it. Now A and B are in our local state

Critically, B is also still in the block buffer

We receive C

We first check to see if its parent B is in the block buffer

flow-go/engine/consensus/compliance/core.go

Line 203 in e813f5c

_, found := c.pending.ByID(header.ParentID)

B is in the block buffer. We interpret this as meaning that B does not yet connect to the finalized state, store C in the buffer, and continue.

Now we will never process C

…8170-ln-perm-block-buffer

coderabbitai · 2025-12-22T01:40:23Z

📝 Walkthrough

Walkthrough

Refactors pending-block buffering to a generic, forest-backed implementation with view-range enforcement and error-returning APIs; makes proposals generic; updates buffer construction sites to pass finalized-view and active-range; surfaces buffer and finalization errors in compliance engines; updates mocks and tests to new semantics.

Changes

Cohort / File(s)	Summary
Generic Pending Block Buffer `module/buffer.go`, `module/buffer/pending_blocks.go`, `module/buffer/pending_blocks_test.go`	Introduce `GenericPendingBlockBuffer[T]` / `GenericPendingBlocks[T]` backed by `forest.LevelledForest`; `Add()` and `PruneByView()` return `error`; constructors accept `finalizedView` and `activeViewRangeSize`; add comprehensive tests.
Removed backend & cluster-specific impl `module/buffer/backend.go` (deleted), `module/buffer/backend_test.go` (deleted), `module/buffer/pending_cluster_blocks.go` (deleted)	Remove old backend implementation, its tests, and the prior `PendingClusterBlocks` concrete implementation (replaced by generic alias).
Proposal / Block generics `model/flow/block.go`, `model/cluster/block.go`	Add `GenericProposal[T]`; make `Proposal = GenericProposal[Payload]`; expose `UntrustedProposal` alias; move `ProposalHeader` to generic receiver; add constructors (`NewProposal`, `NewRootProposal`).
Mempool Errors `module/mempool/errors.go`	Add `BeyondActiveRangeError` type, constructor, `Unwrap()`/`Error()` and `IsBeyondActiveRangeError()` predicate.
Compliance engine logic & error propagation `engine/consensus/compliance/core.go`, `engine/consensus/compliance/core_test.go`, `engine/consensus/compliance/engine.go`, `engine/collection/compliance/core.go`, `engine/collection/compliance/core_test.go`, `engine/collection/compliance/engine.go`	`ProcessFinalizedBlock` now returns `error` and callers propagate it; `OnBlockProposal` now validates proposals, distinguishes BeyondActiveRange vs other buffer errors, caches or requests parents conditionally, and removes explicit `DropForParent` cleanup; tests and mocks updated.
Factories & init sites `engine/collection/epochmgr/factories/compliance.go`, `cmd/consensus/main.go`, `consensus/integration/nodes_test.go`, `module/metrics/example/verification/main.go`	Update pending-blocks construction to pass `finalizedView` and `activeViewRangeSize` (threshold) from cluster state or config.
Mocks & generated test helpers `module/mock/pending_block_buffer.go`, `module/mock/pending_cluster_block_buffer.go`, `module/mock/generic_pending_block_buffer.go`, `module/mock/buffered_proposal.go`	Regenerate/adjust mocks to generic proposal types; `Add`/`PruneByView` now return `error`; update return types to `GenericProposal[...]`; remove `DropForParent` mocks; add generic mocks.

Sequence Diagram(s)

sequenceDiagram
  participant Node
  participant Core as ComplianceCore
  participant Validator
  participant Pending as PendingBlocks
  participant Hot as HotStuff

  Note over Node,Core: New OnBlockProposal flow
  Node->>Core: Submit SignedProposal
  Core->>Validator: ValidateProposal(signedProposal)
  alt invalid
    Validator-->>Core: validation error
    Core->>Node: notify violation / update vote-aggregator
  else valid
    Validator-->>Core: valid
    Core->>Pending: Add(proposal)
    alt BeyondActiveRangeError
      Pending-->>Core: BeyondActiveRangeError
      Core->>Node: log & ignore (no error propagation)
    else buffer error
      Pending-->>Core: error
      Core->>Node: return/fail processing (error)
    else added
      Pending-->>Core: ok
      Core->>Hot: forward proposal for HotStuff processing
    end
  end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related issues

[LN Decentralization E] BlockBuffer BFT improvements #8170 — Adds finalized-view-aware PendingBlocks constructor and view-range pruning (matches this PR's buffer constructor and pruning behavior).
[LN Decentralization E] Improve proposal validation in consensus compliance engine #8171 — Moves proposal validation into compliance engine before buffer insertion and adapts to new buffer semantics (matches validation-before-insert changes).
[LN Decentralization E] Improve proposal validation in cluster compliance engine #8172 — Adds engine-side handling for rejected proposals and distinct handling for beyond-active-range vs other errors (matches enhanced error handling and violation notifications).

Suggested reviewers

peterargue
m-Peter

Poem

I hopped through forest nodes with cheer,
Swapped flat lists for branches far and near,
Generics snug, views kept tight,
Errors caught, proposals right,
A rabbit nods — the cache is clear 🥕

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 53.33% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'Improve BFT resilience of BlockBuffer stores' is specific and directly related to the main objective of the PR: refactoring the PendingBlockBuffer to enhance Byzantine Fault Tolerance resilience through generics consolidation and LevelledForest implementation.

✨ Finishing touches

📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 3

🧹 Nitpick comments (4)

module/mempool/errors.go (1)
67-71: Consider naming consistency with existing constructors.

The existing constructors in this file use the f suffix (NewUnknownExecutionResultErrorf, NewBelowPrunedThresholdErrorf) to indicate they accept format arguments. For consistency, consider renaming to NewBeyondActiveRangeErrorf.
🔎 Proposed fix
-func NewBeyondActiveRangeError(msg string, args ...interface{}) error {
+func NewBeyondActiveRangeErrorf(msg string, args ...interface{}) error {
 	return BeyondActiveRangeError{
 		err: fmt.Errorf(msg, args...),
 	}
 }
module/buffer/pending_blocks.go (1)

55-60: Incomplete TODO requires follow-up.

The documentation explains how ActiveViewRangeSize limits tree depth but notes that tree width (conflicting blocks from byzantine proposers) is not bounded. The TODO is left incomplete. Consider tracking this in an issue if it's outside the scope of this PR.

Would you like me to open an issue to track the implementation of width-limiting for the pending blocks buffer?

module/buffer/pending_blocks_test.go (2)

231-273: Consider making the pruning test more deterministic.

The test uses rand.Intn to select blocks and the prune view, which could theoretically lead to non-deterministic test behavior. While it's unlikely to cause flakiness since any random selection should exercise the pruning logic correctly, consider using a fixed seed or deterministic selection for reproducibility.

Additionally, lines 247-252 may skip loop iterations due to the i%2 == 1 condition combined with the early continue at line 243, resulting in fewer than N blocks being added. This is fine for the test's purpose but worth noting.

358-381: Concurrent test only covers Add operations.

The test verifies concurrent Add operations are safe, which is valuable. However, consider also testing concurrent reads (ByID, ByParentID) mixed with writes to fully exercise the concurrency guarantees documented in the interface ("Safe for concurrent use").

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c0258ee and 5108c97.

📒 Files selected for processing (23)

cmd/consensus/main.go
consensus/integration/nodes_test.go
engine/collection/compliance/core.go
engine/collection/compliance/core_test.go
engine/collection/compliance/engine.go
engine/collection/epochmgr/factories/compliance.go
engine/consensus/compliance/core.go
engine/consensus/compliance/core_test.go
engine/consensus/compliance/engine.go
model/cluster/block.go
model/flow/block.go
module/buffer.go
module/buffer/backend.go
module/buffer/backend_test.go
module/buffer/pending_blocks.go
module/buffer/pending_blocks_test.go
module/buffer/pending_cluster_blocks.go
module/mempool/errors.go
module/metrics/example/verification/main.go
module/mock/buffered_proposal.go
module/mock/generic_pending_block_buffer.go
module/mock/pending_block_buffer.go
module/mock/pending_cluster_block_buffer.go

💤 Files with no reviewable changes (3)

module/buffer/backend.go
module/buffer/pending_cluster_blocks.go
module/buffer/backend_test.go

🧰 Additional context used

🧬 Code graph analysis (14)

engine/collection/epochmgr/factories/compliance.go (2)

module/buffer/pending_blocks.go (1)

NewPendingClusterBlocks (84-90)

fvm/evm/offchain/query/view.go (1)

View (23-31)

engine/collection/compliance/core_test.go (1)

module/buffer/pending_blocks.go (1)

NewPendingClusterBlocks (84-90)

module/mempool/errors.go (1)

fvm/errors/errors.go (1)

As (45-47)

module/mock/generic_pending_block_buffer.go (3)

model/flow/block.go (2)

HashablePayload (14-16)

GenericProposal (168-173)

module/buffer.go (1)

GenericPendingBlockBuffer (14-36)

utils/unittest/fixtures/identifier.go (1)

Identifier (8-8)

module/buffer.go (2)

model/flow/block.go (2)

HashablePayload (14-16)

GenericProposal (168-173)

module/mock/generic_pending_block_buffer.go (1)

GenericPendingBlockBuffer (11-13)

module/buffer/pending_blocks_test.go (9)

module/buffer/pending_blocks.go (2)

GenericPendingBlocks (63-67)

NewPendingBlocks (75-82)

utils/unittest/fixtures/payload.go (1)

Payload (8-8)

utils/unittest/block.go (1)

BlockFixture (14-21)

utils/unittest/fixtures.go (5)

AsSlashable (309-315)

ProposalFromBlock (282-287)

BlockWithParentFixture (398-400)

BlockHeaderFixture (580-600)

IdentifierFixture (1151-1155)

utils/unittest/fixtures/header.go (1)

Header (10-10)

utils/unittest/fixtures/util.go (2)

NoError (6-10)

Assert (13-17)

engine/enqueue.go (1)

Message (12-15)

module/mempool/errors.go (2)

IsBeyondActiveRangeError (82-85)

IsBelowPrunedThresholdError (55-58)

utils/unittest/fixtures/identifier.go (1)

Identifier (8-8)

module/mock/pending_block_buffer.go (4)

module/buffer.go (1)

PendingBlockBuffer (39-39)

model/flow/block.go (1)

GenericProposal (168-173)

utils/unittest/fixtures/payload.go (1)

Payload (8-8)

utils/unittest/fixtures/identifier.go (1)

Identifier (8-8)

engine/consensus/compliance/core_test.go (1)

module/buffer/pending_blocks.go (1)

NewPendingBlocks (75-82)

cmd/consensus/main.go (2)

module/buffer/pending_blocks.go (1)

NewPendingBlocks (75-82)

fvm/evm/offchain/query/view.go (1)

View (23-31)

model/cluster/block.go (1)

model/flow/block.go (1)

GenericProposal (168-173)

module/mock/pending_cluster_block_buffer.go (4)

module/buffer.go (1)

PendingClusterBlockBuffer (42-42)

model/flow/block.go (1)

GenericProposal (168-173)

utils/unittest/fixtures/payload.go (1)

Payload (8-8)

utils/unittest/fixtures/identifier.go (1)

Identifier (8-8)

consensus/integration/nodes_test.go (1)

module/buffer/pending_blocks.go (1)

NewPendingBlocks (75-82)

module/metrics/example/verification/main.go (1)

module/buffer/pending_blocks.go (1)

NewPendingBlocks (75-82)

engine/collection/compliance/core.go (1)

module/metrics/labels.go (1)

ResourceClusterProposal (78-78)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (37)

GitHub Check: Lint (./insecure/)
GitHub Check: Lint (./integration/)
GitHub Check: Lint (./)
GitHub Check: Unit Tests Insecure (insecure/integration/functional/test/gossipsub/rpc_inspector)
GitHub Check: Unit Tests Insecure (insecure/integration/functional/test/gossipsub/scoring)
GitHub Check: Unit Tests Insecure (insecure)
GitHub Check: Unit Tests (network/p2p/scoring)
GitHub Check: Unit Tests (others)
GitHub Check: Unit Tests (cmd)
GitHub Check: Unit Tests (engine)
GitHub Check: Unit Tests (network/p2p/connection)
GitHub Check: Unit Tests (network)
GitHub Check: Unit Tests (network/test/cohort2)
GitHub Check: Unit Tests (engine/verification)
GitHub Check: Unit Tests (network/p2p)
GitHub Check: Unit Tests (network/p2p/node)
GitHub Check: Unit Tests (engine/execution/ingestion)
GitHub Check: Unit Tests (engine/common)
GitHub Check: Unit Tests (engine/consensus)
GitHub Check: Unit Tests (network/test/cohort1)
GitHub Check: Unit Tests (engine/collection)
GitHub Check: Unit Tests (module)
GitHub Check: Unit Tests (network/alsp)
GitHub Check: Unit Tests (engine/execution)
GitHub Check: Unit Tests (engine/access)
GitHub Check: Unit Tests (fvm)
GitHub Check: Unit Tests (ledger)
GitHub Check: Unit Tests (module/dkg)
GitHub Check: Unit Tests (engine/execution/computation)
GitHub Check: Unit Tests (storage)
GitHub Check: Unit Tests (consensus)
GitHub Check: Unit Tests (admin)
GitHub Check: Unit Tests (utils)
GitHub Check: Unit Tests (state)
GitHub Check: Integration Tests Others (integration)
GitHub Check: Docker Build
GitHub Check: analyze-code (go)

🔇 Additional comments (36)

module/mempool/errors.go (1)

59-85: LGTM!

The BeyondActiveRangeError implementation follows the established pattern in this file and correctly implements error chaining with Unwrap(). The predicate function uses errors.As appropriately for error type detection.

module/mock/buffered_proposal.go (1)

1-47: LGTM!

Auto-generated mock file following standard mockery conventions.

model/flow/block.go (2)

166-178: LGTM!

The introduction of GenericProposal[T HashablePayload] with Proposal as a type alias is a clean approach for consolidating proposal types while maintaining backward compatibility. The struct correctly contains the block and proposer signature data.

230-234: LGTM!

The ProposalHeader method correctly operates on the generic type, enabling it to work with any payload type that satisfies HashablePayload.

model/cluster/block.go (2)

82-85: LGTM!

The cluster Proposal type alias to flow.GenericProposal[Payload] correctly leverages the generic infrastructure while maintaining cluster-specific payload handling.

99-135: LGTM!

The constructors correctly validate cluster-specific blocks via NewBlock/NewRootBlock and enforce appropriate signature constraints (non-empty for proposals, empty for root proposals).

module/buffer/pending_blocks.go (4)

13-41: LGTM!

The proposalVertex adapter correctly implements the forest.Vertex interface, caching the block ID for efficient lookups and mapping view to level.

92-114: LGTM!

The Add method correctly validates the block view against the active range and returns an appropriate BeyondActiveRangeError. The check correctly handles the activeViewRangeSize == 0 case as unlimited.

116-146: LGTM!

The ByID and ByParentID methods correctly retrieve proposals from the forest with proper locking. The type assertion from forest.Vertex to proposalVertex[T] is safe given the controlled insertion via Add.

148-170: LGTM!

The PruneByView method correctly adjusts for the exclusive nature of PruneUpToLevel. The highestPrunedView helper correctly inverts the offset applied during construction.

engine/consensus/compliance/engine.go (1)

183-186: LGTM: Error propagation properly added.

The change from fire-and-forget to error-aware finalized block processing is a critical reliability improvement. Errors that were previously silently ignored will now be surfaced and handled appropriately.

consensus/integration/nodes_test.go (1)

520-521: LGTM: Parameterized cache initialization is correct.

The pending blocks cache is now properly initialized with the finalized view and skip threshold, aligning with the new generic buffer API that enforces view-range bounds.

engine/collection/epochmgr/factories/compliance.go (1)

69-73: LGTM: Proper initialization with error handling.

The cache is correctly initialized with the finalized view and skip threshold from configuration. The error handling for the finalized header retrieval ensures failures are properly propagated.

engine/collection/compliance/engine.go (1)

169-171: LGTM: Error propagation properly added.

Consistent with the consensus compliance engine, this change ensures that finalized block processing errors are properly surfaced rather than silently ignored.

engine/collection/compliance/core_test.go (2)

127-127: LGTM: Mock expectations updated correctly.

The mock setup properly reflects the new API:

Add no longer requires a boolean return value configuration (now returns error)

PruneByView correctly returns nil error

Also applies to: 147-147

545-545: LGTM: Real buffer instantiation is correct.

The test correctly instantiates the real buffer with the finalized view from the test head and an appropriate active range size for testing.

cmd/consensus/main.go (1)

808-808: LGTM: Production buffer initialization is correct.

The pending blocks cache is properly initialized with:

The last finalized header's view to establish the baseline

The skip threshold from compliance configuration to bound the active view range

This ensures the buffer enforces view-range constraints during block proposal processing.

engine/consensus/compliance/core_test.go (2)

203-203: LGTM: Mock expectations updated correctly.

The mock setup properly reflects the new API, consistent with the collection compliance tests:

Add no longer requires a boolean return value configuration

PruneByView correctly returns nil error

Also applies to: 223-223

587-587: LGTM: Real buffer instantiation is correct.

The test correctly instantiates the real buffer with the test head's view and an appropriate active range size for testing.

engine/collection/compliance/core.go (4)

96-99: LGTM!

Good addition of error handling for ProcessFinalizedBlock during initialization. This ensures that if pruning fails, the error is properly propagated rather than silently ignored.

199-207: LGTM!

Appropriate handling of BeyondActiveRangeError - logging and returning nil is the correct behavior since blocks beyond the active range should be silently dropped without causing failures. The comment explaining the threshold flexibility is helpful.

221-229: LGTM!

Consistent error handling pattern with the previous Add call site. The BeyondActiveRangeError is correctly treated as a non-fatal condition.

382-394: LGTM!

The signature change from void to error-returning is appropriate. Propagating errors from PruneByView ensures that any unexpected errors during pruning are surfaced to callers rather than silently ignored.

engine/consensus/compliance/core.go (3)

103-106: LGTM!

Consistent error handling for ProcessFinalizedBlock during initialization, matching the collection compliance engine.

248-265: Improved logic for parent request handling.

The restructured logic now:

Adds the block to pending buffer first

Only requests the parent if it's not already in the pending buffer

This reduces duplicate parent requests since if the parent is pending, we've already requested it or will process it soon. The order change is correct.

436-448: LGTM!

Consistent with the collection compliance engine changes - error propagation from PruneByView is now properly handled.

module/buffer.go (2)

8-36: Well-designed generic interface.

The generic GenericPendingBlockBuffer[T flow.HashablePayload] interface is cleanly designed with:

Clear documentation for error conditions

Proper use of type constraints

Consistent method signatures

The error documentation for Add and PruneByView is helpful for callers.

38-42: LGTM!

The type aliases PendingBlockBuffer and PendingClusterBlockBuffer maintain backward compatibility while leveraging the new generic implementation.

module/buffer/pending_blocks_test.go (5)

1-13: LGTM!

Good test setup with appropriate imports. The test file follows standard Go testing patterns using testify/suite.

42-133: Good coverage of Add behavior and edge cases.

Tests comprehensively cover:

Basic add and retrieval

Duplicate handling

Below finalized view (no-op)

Active view range boundary enforcement

Zero active range (no limit)

This aligns well with the interface contract.

156-229: Thorough ByID and ByParentID testing.

Good coverage of:

Existing vs non-existent block lookup

Multiple children per parent

Direct children only (not grandchildren)

275-337: Good multi-step pruning tests.

The tests verify:

Error handling for pruning below finalized view

Sequential pruning behavior

Parent-child relationship integrity after partial pruning

383-421: LGTM!

Good edge case coverage for empty buffer operations and add-after-prune behavior.

module/mock/generic_pending_block_buffer.go (1)

1-139: Auto-generated mock - no review needed.

This file is generated by mockery and correctly implements the GenericPendingBlockBuffer[T] interface. The generic type parameter support and standard mock patterns look correct.

module/mock/pending_block_buffer.go (1)

1-139: Auto-generated mock correctly reflects interface changes.

This file is auto-generated by mockery and correctly implements the updated PendingBlockBuffer interface with:

Generic proposal types (flow.GenericProposal[flow.Payload])

Error-returning Add and PruneByView methods

Removal of DropForParent (as noted in PR objectives)

The mock implementation properly handles error return values and type assertions for the new generic types.

module/mock/pending_cluster_block_buffer.go (1)

1-141: Auto-generated mock correctly reflects interface changes.

This file is auto-generated by mockery and correctly implements the updated PendingClusterBlockBuffer interface with:

Generic proposal types (flow.GenericProposal[cluster.Payload])

Error-returning Add and PruneByView methods

Removal of DropForParent

The implementation is consistent with the parallel mock in pending_block_buffer.go, using the appropriate cluster.Payload type parameter.

coderabbitai · 2025-12-22T01:43:57Z

engine/consensus/compliance/core.go

+	// In general, there are a variety of attacks that a byzantine proposer might attempt. Conceptually,
+	// there are two classes:
+	//	 (I) Protocol-level attacks by either sending individually invalid blocks, or by equivocating with
+	//       by sending a pair of conflicting blocks (individually valid and/or invalid) for the same view.
+	//	(II) Resource exhaustion attacks by sending a large number of individually invalid or valid blocks.
+	// Category (I) will be eventually detected. Attacks of category (II), typically contain elements of (I).
+	// This is because the protocol is purposefully designed such that there is few degrees of freedom available
+	// to a byzantine proposer attempting to mount a resource exhaustion attack (type II), unless the proposer
+	// violates protocol rules i.e. provides evidence of their wrongdoing (type I).
+	// However, we have to make sure that the nodes survive an attack of category (II), and stays responsive.
+	// If the node crashes, the node will lose the evidence of the attack, and the byzantine proposer
+	// will have succeeded in their goal of mounting a denial-of-service attack without being held accountable.
+	//
+	// The general requirements for a BFT system are:
+	// 1. withstand attacks (don't crash and remain responsive)
+	// 2. detect attacks and collect evidence for slashing challenges
+	// 3. suppress the attack by slashing and ejecting the offending node(s)
+	//
+	// The primary responsibility of compliance engine is to protect the business logic from attacks of
+	// category (II) and to collect evidence for attacks of category (I) for blocks that are _individually_
+	// invalid. The compliance engine may detect some portion of equivocation attacks (type I), in order
+	// to better protect itself from resource exhaustion attacks (type II). Though, the primary responsibility
+	// for detecting equivocation is with the hotstuff layer. The reason is that, in case of equivocation with
+	// multiple valid blocks, the compliance engine can't know which block might get certified and potentially
+	// finalized. So it can't reject _valid_ equivocating blocks outright, as that might lead to liveness issues.
+	//
+	// The compliance engine must be resilient to the following classes of resource exhaustion attacks:
+	//  1. A byzantine proposers might attempt to create blocks at many different future views. Mitigations:
+	//     • Only proposals whose proposer is the valid leader for the respective view should pass the compliance
+	//     engine. Block that are not proposed by a valid leader are outright reject and we create a slashing
+	//     challenge against the proposer. This filtering should be done by the compliance engine. Such blocks
+	//     should never reach the higher-level business logic.
+	//     • A byzantine proposer might attempt to create blocks for a large number of different future views,
+	//     for which it is valid leader. This is mitigated by dropping blocks that are too far ahead of the
+	//     locally finalized view. The threshold is configured via `SkipNewProposalsThreshold` parameter.
+	//     This does not lead to a slashing challenge, as we can't reliably detect without investing significant
+	//     CPU resources validating the QC, whether the proposer is violating protocol rules by making up an
+	//     invalid QC / TC. Valid blocks will eventually be retrieved via sync again, once the local finalized
+	//     view catches up, even if they were dropped at first.
+	//  2. A byzantine proposers might spam us with many different _valid_ blocks for the same view, for which
+	//     it is the leader. This is particularly dangerous for
+	//
+


⚠️ Potential issue | 🟡 Minor

Good documentation on BFT resilience.

This comment block provides valuable context about attack categories and the compliance engine's role in mitigating resource exhaustion attacks. However, the comment appears to end abruptly at line 159 with an incomplete sentence fragment about "particularly dangerous for".

🔎 Consider completing the sentence

The comment at lines 156-159 ends with:

// 2. A byzantine proposers might spam us with many different _valid_ blocks for the same view, for which // it is the leader. This is particularly dangerous for //

This appears to be incomplete. Consider finishing the explanation or removing the trailing fragment.

🤖 Prompt for AI Agents

In engine/consensus/compliance/core.go around lines 117-159, the comment block ends with an incomplete fragment at lines 156-159 ("This is particularly dangerous for"); fix it by either completing the sentence to state the intended risk (e.g., "This is particularly dangerous for CPU and memory exhaustion, increased validation latency, and potential liveness/finality stalls when the leader floods the node with competing valid proposals for the same view.") or remove the trailing fragment entirely if it is redundant; ensure the chosen edit keeps the surrounding formatting and punctuation consistent with the existing comment style.

module/buffer/pending_blocks.go

coderabbitai · 2025-12-22T01:43:58Z

module/metrics/example/verification/main.go


 		// creates consensus cache for follower engine, and registers size method of backend for metrics
-		pendingBlocks := buffer.NewPendingBlocks()
+		pendingBlocks := buffer.NewPendingBlocks(0, 100_000)


⚠️ Potential issue | 🟡 Minor

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # First, check if the file exists and examine lines around 111 and 176 cat -n module/metrics/example/verification/main.go | sed -n '105,120p' echo "---" cat -n module/metrics/example/verification/main.go | sed -n '170,185p'

Repository: onflow/flow-go

Length of output: 1348

🏁 Script executed:

#!/bin/bash # Search for NewPendingBlocks usage in the codebase to understand typical patterns echo "=== NewPendingBlocks usage patterns ===" rg 'NewPendingBlocks' --type=go -B 2 -A 2 echo "" echo "=== Error handling for pendingBlocks.Add() ===" rg '\.Add\(flow\.Slashable' --type=go -B 2 -A 2 | head -50 echo "" echo "=== Check what buffer.PendingBlocks signature is ===" fd 'pending.*\.go' | xargs grep -l 'NewPendingBlocks'

Repository: onflow/flow-go

Length of output: 5072

🏁 Script executed:

#!/bin/bash # Examine the PendingBlocks implementation and Add() method echo "=== PendingBlocks.Add() method signature and implementation ===" rg -A 20 'func \(pb \*PendingBlocks\) Add' module/buffer/pending_blocks.go echo "" echo "=== Check tests to see how Add() errors are handled ===" rg -B 5 -A 5 '\.Add\(' module/buffer/pending_blocks_test.go | head -60 echo "" echo "=== Check all production usages of Add() ===" rg '\.Add\(flow\.Slashable' --type=go -B 3 -A 3 --exclude-dir=test

Repository: onflow/flow-go

Length of output: 2519

🏁 Script executed:

#!/bin/bash # Get the Add() method signature more carefully echo "=== PendingBlocks.Add() method signature ===" grep -n "func.*Add.*flow.Slashable" module/buffer/pending_blocks.go echo "" sed -n '100,200p' module/buffer/pending_blocks.go | head -60 echo "" echo "=== All production usages of pendingBlocks.Add (not in test files) ===" rg 'pendingBlocks\.Add|proposals\.Add' --type=go | grep -v test | grep -v example echo "" echo "=== Check how compliance engine uses pending blocks ===" rg -A 10 'pending\.Add' engine/consensus/compliance/ --type=go | head -40

Repository: onflow/flow-go

Length of output: 3419

Handle error from Add() call.

Line 176: The error from pendingBlocks.Add() is currently ignored with _. While this is demo code, the Add() method can return BeyondActiveRangeError when a block exceeds the active view range. Even in probabilistic test scenarios, consider at least logging errors to surface potential issues.

Also applies to: 176-180

🤖 Prompt for AI Agents

module/metrics/example/verification/main.go around lines 111 and 176-180: the return value from pendingBlocks.Add(...) is ignored — capture the error, check if err != nil and log it with context (for example using log.Printf or similar project logger) and optionally handle BeyondActiveRangeError specially if it is expected (e.g., increment a counter or skip) otherwise surface or return the error; ensure you do not discard the error silently.

…8170-ln-perm-block-buffer

coderabbitai

Actionable comments posted: 1

Fix all issues with AI Agents 🤖

In @module/buffer/pending_blocks.go:
- Around line 43-67: The TODO in the GenericPendingBlocks documentation is
unfinished: clarify how the buffer limits unbounded memory growth from width
attacks or remove the placeholder; update the comment for GenericPendingBlocks
to state whether mitigation beyond activeViewRangeSize exists (e.g., per-parent
child caps, global entry limits, eviction policy, validation filtering, or
relying solely on LevelledForest pruning) and describe the implemented strategy
(referencing activeViewRangeSize and forest/LevelledForest) so readers know how
conflicting-block explosion is handled or that no extra width-limiting mechanism
is present.

🧹 Nitpick comments (2)

module/buffer/pending_blocks.go (2)
117-127: Consider safe type assertion.

Line 126 uses an unchecked type assertion vertex.(proposalVertex[T]) that could panic if the vertex type is unexpected. While this should never occur in correct usage, defensive coding would use the comma-ok idiom to handle unexpected types gracefully.
🔎 Proposed defensive type assertion
 func (b *GenericPendingBlocks[T]) ByID(blockID flow.Identifier) (flow.Slashable[*flow.GenericProposal[T]], bool) {
 	b.lock.Lock()
 	defer b.lock.Unlock()
 	vertex, ok := b.forest.GetVertex(blockID)
 	if !ok {
 		return flow.Slashable[*flow.GenericProposal[T]]{}, false
 	}
-	return vertex.(proposalVertex[T]).proposal, true
+	pv, ok := vertex.(proposalVertex[T])
+	if !ok {
+		// This should never happen; indicates a serious bug
+		return flow.Slashable[*flow.GenericProposal[T]]{}, false
+	}
+	return pv.proposal, true
 }
129-147: Consider safe type assertion in iterator loop.

Line 143 uses an unchecked type assertion vertex.(proposalVertex[T]) within the iterator loop. Similar to ByID, this could panic if an unexpected vertex type is encountered. Consider using the comma-ok idiom to handle this defensively, or at minimum, add a comment explaining why this assertion is safe.
🔎 Proposed defensive type assertion
 	children := make([]flow.Slashable[*flow.GenericProposal[T]], 0, n)
 	iterator := b.forest.GetChildren(parentID)
 	for iterator.HasNext() {
 		vertex := iterator.NextVertex()
-		children = append(children, vertex.(proposalVertex[T]).proposal)
+		pv, ok := vertex.(proposalVertex[T])
+		if !ok {
+			// This should never happen; indicates a serious bug
+			continue
+		}
+		children = append(children, pv.proposal)
 	}
 
 	return children, true

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 5108c97 and b001e9b.

📒 Files selected for processing (1)

module/buffer/pending_blocks.go

🧰 Additional context used

📓 Path-based instructions (2)

**/*.go

📄 CodeRabbit inference engine (.cursor/rules/coding_conventions.mdc)

Follow Go coding conventions as documented in @docs/agents/CodingConventions.md

Follow Go coding standards and conventions as documented in @docs/agents/GoDocs.md

**/*.go: Follow the existing module structure in /module/, /engine/, /model/ and use dependency injection patterns for component composition
Implement proper interfaces before concrete types
Follow Go naming conventions and the project's coding style defined in /docs/CodingConventions.md
Use mock generators: run make generate-mocks after interface changes
All inputs must be considered potentially byzantine; error classification is context-dependent and no code path is safe unless explicitly proven and documented
Use comprehensive error wrapping for debugging; avoid fmt.Errorf, use irrecoverable package for exceptions
NEVER log and continue on best effort basis; ALWAYS explicitly handle errors
Uses golangci-lint with custom configurations (.golangci.yml) and custom linters for Flow-specific conventions (struct write checking)

Files:

module/buffer/pending_blocks.go

{module,engine,cmd}/**/*.go

📄 CodeRabbit inference engine (AGENTS.md)

All major processing components must implement the Component interface from /module/component/component.go to ensure consistent lifecycle management and graceful shutdown patterns

Files:

module/buffer/pending_blocks.go

🧬 Code graph analysis (1)

module/buffer/pending_blocks.go (3)

model/flow/block.go (2)

HashablePayload (14-16)

GenericProposal (168-173)

module/forest/leveled_forest.go (2)

LevelledForest (24-29)

NewLevelledForest (51-57)

module/mempool/errors.go (1)

NewBeyondActiveRangeError (67-71)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (37)

GitHub Check: Lint (./integration/)
GitHub Check: Lint (./)
GitHub Check: Lint (./insecure/)
GitHub Check: Integration Tests Others (integration)
GitHub Check: Unit Tests Insecure (insecure/integration/functional/test/gossipsub/rpc_inspector)
GitHub Check: Unit Tests Insecure (insecure/integration/functional/test/gossipsub/scoring)
GitHub Check: Unit Tests Insecure (insecure)
GitHub Check: Unit Tests (module/dkg)
GitHub Check: Unit Tests (network/p2p/node)
GitHub Check: Unit Tests (network/p2p/connection)
GitHub Check: Unit Tests (network/p2p/scoring)
GitHub Check: Unit Tests (others)
GitHub Check: Unit Tests (module)
GitHub Check: Unit Tests (network/test/cohort1)
GitHub Check: Unit Tests (network)
GitHub Check: Unit Tests (network/p2p)
GitHub Check: Unit Tests (engine/collection)
GitHub Check: Unit Tests (network/test/cohort2)
GitHub Check: Unit Tests (engine/verification)
GitHub Check: Unit Tests (network/alsp)
GitHub Check: Unit Tests (engine/common)
GitHub Check: Unit Tests (cmd)
GitHub Check: Unit Tests (engine/execution)
GitHub Check: Unit Tests (engine/access)
GitHub Check: Unit Tests (state)
GitHub Check: Unit Tests (engine/execution/ingestion)
GitHub Check: Unit Tests (utils)
GitHub Check: Unit Tests (engine/execution/computation)
GitHub Check: Unit Tests (storage)
GitHub Check: Unit Tests (engine)
GitHub Check: Unit Tests (admin)
GitHub Check: Unit Tests (fvm)
GitHub Check: Unit Tests (consensus)
GitHub Check: Unit Tests (engine/consensus)
GitHub Check: Unit Tests (ledger)
GitHub Check: Docker Build
GitHub Check: analyze-code (go)

🔇 Additional comments (7)

module/buffer/pending_blocks.go (7)

13-26: LGTM: Clean vertex adapter with cached ID.

The proposalVertex wrapper efficiently adapts generic proposals to the forest model, caching the block ID to avoid repeated computation.

69-73: LGTM: Type aliases and interface checks.

The type aliases provide a clean API, and the compile-time interface compliance checks ensure type safety.

75-91: LGTM: Constructors now consistent.

Both constructors correctly initialize the forest with finalizedView + 1, and the comments clearly explain that LevelledForest.LowestLevel is inclusive.

149-157: LGTM: Correct pruning with clear offset documentation.

The method correctly adjusts the view parameter (+1) when calling PruneUpToLevel, and the comment clearly explains the inclusive/exclusive boundary semantics.

159-164: LGTM: Size reporting.

The method correctly reports the buffer size with appropriate thread safety.

166-171: LGTM: Helper with clear locking requirement.

The method correctly calculates the finalized view, and the CAUTION comment appropriately warns callers about the locking requirement.

93-115: The duplicate handling in LevelledForest.AddVertex correctly aligns with the documented behavior in Add(). The forest's AddVertex method already checks for duplicate vertices at lines 194-195 (if !f.isEmptyContainer(container) { return }) and only increments the size counter for new vertices. The code is working as intended, and no changes are required.

module/buffer/pending_blocks.go

coderabbitai

Actionable comments posted: 1

Fix all issues with AI Agents 🤖

In @engine/consensus/compliance/core_test.go:
- Line 203: The test mock for PendingBlockBuffer is out of date: Update the
cs.pending mock expectation to match the new PendingBlockBuffer.Add signature
(single argument `proposal`) and specify the returned error value; for example
replace the current cs.pending.On("Add", mock.Anything, mock.Anything) with a
single-arg expectation and chain a .Return(...) that returns an error or nil as
appropriate (e.g., .Return(nil)) so the test handles the Add error return
correctly and matches the interface.

♻️ Duplicate comments (1)

engine/consensus/compliance/core.go (1)

156-159: Incomplete documentation sentence.

The comment at lines 156-159 ends with an incomplete fragment: "This is particularly dangerous for" without completing the thought. This was previously flagged.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 3326e63 and cb267a5.

📒 Files selected for processing (2)

engine/consensus/compliance/core.go
engine/consensus/compliance/core_test.go

🧰 Additional context used

📓 Path-based instructions (5)

**/*.go

📄 CodeRabbit inference engine (.cursor/rules/coding_conventions.mdc)

Follow Go coding conventions as documented in @docs/agents/CodingConventions.md

Follow Go coding standards and conventions as documented in @docs/agents/GoDocs.md

**/*.go: Follow the existing module structure in /module/, /engine/, /model/ and use dependency injection patterns for component composition
Implement proper interfaces before concrete types
Follow Go naming conventions and the project's coding style defined in /docs/CodingConventions.md
Use mock generators: run make generate-mocks after interface changes
All inputs must be considered potentially byzantine; error classification is context-dependent and no code path is safe unless explicitly proven and documented
Use comprehensive error wrapping for debugging; avoid fmt.Errorf, use irrecoverable package for exceptions
NEVER log and continue on best effort basis; ALWAYS explicitly handle errors
Uses golangci-lint with custom configurations (.golangci.yml) and custom linters for Flow-specific conventions (struct write checking)

Files:

engine/consensus/compliance/core.go
engine/consensus/compliance/core_test.go

{module,engine,cmd}/**/*.go

📄 CodeRabbit inference engine (AGENTS.md)

All major processing components must implement the Component interface from /module/component/component.go to ensure consistent lifecycle management and graceful shutdown patterns

Files:

engine/consensus/compliance/core.go
engine/consensus/compliance/core_test.go

{crypto,fvm,ledger,access,engine}/**/*.go

📄 CodeRabbit inference engine (AGENTS.md)

Cryptographic operations require careful handling; refer to crypto library documentation for proper implementation

Files:

engine/consensus/compliance/core.go
engine/consensus/compliance/core_test.go

{network,engine,consensus}/**/*.go

📄 CodeRabbit inference engine (AGENTS.md)

Network messages must be authenticated and validated

Files:

engine/consensus/compliance/core.go
engine/consensus/compliance/core_test.go

**/*_test.go

📄 CodeRabbit inference engine (AGENTS.md)

**/*_test.go: Unit tests should be co-located with the code they test
Follow the existing pattern of *_test.go files for test naming
Use fixtures for realistic test data as defined in /utils/unittest/

Files:

engine/consensus/compliance/core_test.go

🧠 Learnings (1)

📚 Learning: 2025-12-23T00:28:41.005Z

Learnt from: CR
Repo: onflow/flow-go PR: 0
File: AGENTS.md:0-0
Timestamp: 2025-12-23T00:28:41.005Z
Learning: Applies to **/*.go : All inputs must be considered potentially byzantine; error classification is context-dependent and no code path is safe unless explicitly proven and documented

Applied to files:

engine/consensus/compliance/core.go

🧬 Code graph analysis (1)

engine/consensus/compliance/core_test.go (1)

module/buffer/pending_blocks.go (1)

NewPendingBlocks (78-85)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (37)

GitHub Check: Lint (./)
GitHub Check: Lint (./insecure/)
GitHub Check: Lint (./integration/)
GitHub Check: Unit Tests (engine/common)
GitHub Check: Unit Tests (network/test/cohort2)
GitHub Check: Unit Tests (network)
GitHub Check: Unit Tests (network/test/cohort1)
GitHub Check: Unit Tests (engine/verification)
GitHub Check: Unit Tests (network/p2p)
GitHub Check: Unit Tests (engine/access)
GitHub Check: Unit Tests (others)
GitHub Check: Unit Tests (utils)
GitHub Check: Unit Tests (network/p2p/connection)
GitHub Check: Unit Tests (network/alsp)
GitHub Check: Unit Tests (admin)
GitHub Check: Unit Tests (engine/consensus)
GitHub Check: Unit Tests (network/p2p/node)
GitHub Check: Unit Tests (engine/execution/computation)
GitHub Check: Unit Tests (engine)
GitHub Check: Unit Tests (module/dkg)
GitHub Check: Unit Tests (storage)
GitHub Check: Unit Tests (module)
GitHub Check: Unit Tests (network/p2p/scoring)
GitHub Check: Unit Tests (engine/execution/ingestion)
GitHub Check: Unit Tests (fvm)
GitHub Check: Unit Tests (engine/execution)
GitHub Check: Unit Tests (consensus)
GitHub Check: Unit Tests (cmd)
GitHub Check: Unit Tests (state)
GitHub Check: Unit Tests (engine/collection)
GitHub Check: Unit Tests (ledger)
GitHub Check: Unit Tests Insecure (insecure)
GitHub Check: Unit Tests Insecure (insecure/integration/functional/test/gossipsub/scoring)
GitHub Check: Unit Tests Insecure (insecure/integration/functional/test/gossipsub/rpc_inspector)
GitHub Check: Integration Tests Others (integration)
GitHub Check: Docker Build
GitHub Check: analyze-code (go)

🔇 Additional comments (10)

engine/consensus/compliance/core_test.go (4)

223-223: LGTM!

The PruneByView mock correctly returns nil to match the new error-returning signature.

586-586: LGTM!

The real buffer instantiation correctly uses the new parameterized constructor with finalizedView and activeViewRangeSize, aligning with the updated NewPendingBlocks signature.

593-606: LGTM!

The per-proposal validation expectations correctly reflect the behavioral change where proposal validation now occurs before caching. Each proposal has a properly scoped .Once() expectation set immediately before processing.

637-642: LGTM!

The validation expectation for missingProposal is correctly added before processing, completing the validation setup for all proposals in the test scenario.

engine/consensus/compliance/core.go (6)

103-106: LGTM!

Proper error handling for the newly error-returning ProcessFinalizedBlock during initialization. This ensures any pruning failures are surfaced early.

235-272: Good security improvement: validation before caching.

Moving proposal validation before the caching step is a sound BFT resilience improvement. This prevents invalid proposals from consuming cache space and ensures only validated proposals enter the pending buffer.

The error handling is comprehensive:

Invalid proposals trigger slashing notifications and VoteAggregator updates

Unknown epoch errors are gracefully handled (assuming malicious far-future blocks)

Unexpected errors are properly propagated

287-296: LGTM!

The error handling for pending.Add correctly distinguishes between BeyondActiveRangeError (expected, logged at debug level) and unexpected errors (propagated with context). The defensive comment about potentially differing thresholds is helpful.

424-426: LGTM!

The hotstuffProposal is appropriately created upfront and used consistently for error wrapping in NewInvalidProposalErrorf and for VoteAggregator.AddBlock.

454-467: LGTM!

The refactored ProcessFinalizedBlock correctly:

Propagates errors from PruneByView

Updates finalized counters only after successful pruning

Always records mempool metrics

This aligns with the PR's goal of surfacing buffer errors in compliance engines.

299-303: LGTM!

The parent request logic correctly checks if the parent is already cached before issuing a sync request, preventing redundant network requests.

coderabbitai · 2026-01-05T15:42:08Z

engine/consensus/compliance/core_test.go

 	// set up pending module mock
 	cs.pending = &module.PendingBlockBuffer{}
-	cs.pending.On("Add", mock.Anything, mock.Anything).Return(true)
+	cs.pending.On("Add", mock.Anything, mock.Anything)


⚠️ Potential issue | 🟡 Minor

Missing mock return value for Add method.

The Add method now returns an error according to the new PendingBlockBuffer interface, but the mock expectation does not specify a return value. This could cause unexpected test behavior or panics if the code checks the error.

🔎 Proposed fix

- cs.pending.On("Add", mock.Anything, mock.Anything) + cs.pending.On("Add", mock.Anything).Return(nil)

Note: The method signature also appears to have changed to accept a single argument (proposal) rather than two arguments. Verify against the interface definition.

🤖 Prompt for AI Agents

In @engine/consensus/compliance/core_test.go around line 203, The test mock for PendingBlockBuffer is out of date: Update the cs.pending mock expectation to match the new PendingBlockBuffer.Add signature (single argument `proposal`) and specify the returned error value; for example replace the current cs.pending.On("Add", mock.Anything, mock.Anything) with a single-arg expectation and chain a .Return(...) that returns an error or nil as appropriate (e.g., .Return(nil)) so the test handles the Add error return correctly and matches the interface.

…8170-ln-perm-block-buffer

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

module/buffer/pending_blocks_test.go (1)
38-48: Consider adding a safety check to the view selection loop.

The loop at lines 40-45 increments the view until an unused one is found. While unlikely in practice, this could theoretically loop indefinitely if a large number of views are already taken. Consider adding a maximum iteration count as a safeguard.
🔎 Suggested enhancement
 func (suite *PendingBlocksSuite) blockWithParent(parent *flow.Header, usedViews map[uint64]struct{}) flow.Slashable[*flow.Proposal] {
 	block := unittest.BlockWithParentFixture(parent)
+	maxAttempts := 1000
 	for {
+		maxAttempts--
+		if maxAttempts == 0 {
+			suite.FailNow("unable to find unused view after 1000 attempts")
+		}
 		if _, hasForbiddenView := usedViews[block.View]; !hasForbiddenView {
 			break
 		}
 		block.View++
 	}
 	usedViews[block.View] = struct{}{} // add the block's view to `usedViews` to prevent future re-usage
 	return unittest.AsSlashable(unittest.ProposalFromBlock(block))
 }

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between cb267a5 and ad038a3.

📒 Files selected for processing (2)

module/buffer/pending_blocks.go
module/buffer/pending_blocks_test.go

🧰 Additional context used

📓 Path-based instructions (3)

**/*.go

📄 CodeRabbit inference engine (.cursor/rules/coding_conventions.mdc)

Follow Go coding conventions as documented in @docs/agents/CodingConventions.md

Follow Go coding standards and conventions as documented in @docs/agents/GoDocs.md

**/*.go: Follow the existing module structure in /module/, /engine/, /model/ and use dependency injection patterns for component composition
Implement proper interfaces before concrete types
Follow Go naming conventions and the project's coding style defined in /docs/CodingConventions.md
Use mock generators: run make generate-mocks after interface changes
All inputs must be considered potentially byzantine; error classification is context-dependent and no code path is safe unless explicitly proven and documented
Use comprehensive error wrapping for debugging; avoid fmt.Errorf, use irrecoverable package for exceptions
NEVER log and continue on best effort basis; ALWAYS explicitly handle errors
Uses golangci-lint with custom configurations (.golangci.yml) and custom linters for Flow-specific conventions (struct write checking)

Files:

module/buffer/pending_blocks.go
module/buffer/pending_blocks_test.go

{module,engine,cmd}/**/*.go

📄 CodeRabbit inference engine (AGENTS.md)

All major processing components must implement the Component interface from /module/component/component.go to ensure consistent lifecycle management and graceful shutdown patterns

Files:

module/buffer/pending_blocks.go
module/buffer/pending_blocks_test.go

**/*_test.go

📄 CodeRabbit inference engine (AGENTS.md)

**/*_test.go: Unit tests should be co-located with the code they test
Follow the existing pattern of *_test.go files for test naming
Use fixtures for realistic test data as defined in /utils/unittest/

Files:

module/buffer/pending_blocks_test.go

🧠 Learnings (1)

📚 Learning: 2025-12-23T00:28:41.005Z

Learnt from: CR
Repo: onflow/flow-go PR: 0
File: AGENTS.md:0-0
Timestamp: 2025-12-23T00:28:41.005Z
Learning: Applies to **/*_test.go : Use fixtures for realistic test data as defined in `/utils/unittest/`

Applied to files:

module/buffer/pending_blocks_test.go

🧬 Code graph analysis (2)

module/buffer/pending_blocks.go (3)

model/flow/block.go (2)

HashablePayload (14-16)

GenericProposal (168-173)

module/forest/leveled_forest.go (2)

LevelledForest (24-29)

NewLevelledForest (51-57)

module/mempool/errors.go (1)

NewBeyondActiveRangeError (67-71)

module/buffer/pending_blocks_test.go (3)

module/buffer/pending_blocks.go (2)

GenericPendingBlocks (66-70)

NewPendingBlocks (78-85)

utils/unittest/fixtures.go (4)

AsSlashable (309-315)

ProposalFromBlock (282-287)

BlockHeaderFixture (580-600)

IdentifierFixture (1151-1155)

module/mempool/errors.go (2)

IsBeyondActiveRangeError (82-85)

IsBelowPrunedThresholdError (55-58)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (37)

GitHub Check: Lint (./insecure/)
GitHub Check: Lint (./)
GitHub Check: Lint (./integration/)
GitHub Check: Integration Tests Others (integration)
GitHub Check: Unit Tests Insecure (insecure)
GitHub Check: Unit Tests Insecure (insecure/integration/functional/test/gossipsub/scoring)
GitHub Check: Unit Tests Insecure (insecure/integration/functional/test/gossipsub/rpc_inspector)
GitHub Check: Unit Tests (module)
GitHub Check: Unit Tests (module/dkg)
GitHub Check: Unit Tests (consensus)
GitHub Check: Unit Tests (network/test/cohort1)
GitHub Check: Unit Tests (engine/execution)
GitHub Check: Unit Tests (engine/common)
GitHub Check: Unit Tests (network)
GitHub Check: Unit Tests (network/p2p/connection)
GitHub Check: Unit Tests (network/p2p/node)
GitHub Check: Unit Tests (engine/execution/computation)
GitHub Check: Unit Tests (engine/execution/ingestion)
GitHub Check: Unit Tests (network/p2p)
GitHub Check: Unit Tests (state)
GitHub Check: Unit Tests (engine/verification)
GitHub Check: Unit Tests (network/test/cohort2)
GitHub Check: Unit Tests (network/alsp)
GitHub Check: Unit Tests (network/p2p/scoring)
GitHub Check: Unit Tests (engine/collection)
GitHub Check: Unit Tests (others)
GitHub Check: Unit Tests (utils)
GitHub Check: Unit Tests (cmd)
GitHub Check: Unit Tests (ledger)
GitHub Check: Unit Tests (engine/consensus)
GitHub Check: Unit Tests (fvm)
GitHub Check: Unit Tests (storage)
GitHub Check: Unit Tests (admin)
GitHub Check: Unit Tests (engine)
GitHub Check: Unit Tests (engine/access)
GitHub Check: Docker Build
GitHub Check: analyze-code (go)

🔇 Additional comments (12)

module/buffer/pending_blocks_test.go (5)

24-28: Excellent test setup using the appropriate fixtures.

The test suite properly initializes with finalized view 0 and no range limit, allowing individual tests to configure their own constraints. Good use of unittest fixtures as per coding guidelines.

100-120: Well-designed boundary test for active view range enforcement.

This test correctly verifies that blocks exceeding finalizedView + activeViewRangeSize are rejected with a BeyondActiveRangeError, and that the buffer remains empty. The use of mempool.IsBeyondActiveRangeError properly validates the error type.

241-284: Robust pruning test with randomized block structure.

The test builds a realistic tree structure with 10% unrelated blocks and 90% children of existing blocks, then verifies correct pruning behavior. The randomized approach provides good coverage of different tree topologies.

369-392: Good concurrent access testing.

The test verifies thread-safety by having 10 goroutines concurrently add blocks. The comment correctly notes that correctness depends on block() not returning duplicates (which it doesn't, thanks to unittest.BlockFixture() generating unique fixtures).

343-343: The max built-in function is fully supported. The project requires Go 1.25.0 (as specified in go.mod), which is well above the minimum version (Go 1.21) when max was introduced. No action needed.

module/buffer/pending_blocks.go (7)

13-41: Clean implementation of forest.Vertex interface.

The proposalVertex[T] wrapper correctly implements the forest.Vertex interface by providing VertexID(), Level(), and Parent() methods. The immutable struct directive and caching of the block ID in the constructor are good practices for performance.

43-70: Excellent documentation of memory exhaustion mitigation strategy.

The documentation clearly explains how the buffer prevents unbounded memory growth:

activeViewRangeSize limits depth (view range)

One proposal per view limits width (conflicting blocks at same view)

Reliance on sync engine for liveness if wrong proposal is cached

This addresses the security concerns around byzantine proposers mounting memory exhaustion attacks.

78-94: Constructors correctly handle LevelledForest's inclusive boundary.

Both constructors now consistently use finalizedView + 1 for the forest's lowestLevel, with comments explaining that LowestLevel is inclusive. This matches the inverse calculation in highestPrunedView() at line 176 (LowestLevel - 1). The symmetry confirms correct handling of the boundary semantics.

96-122: Add method correctly enforces view range and prevents width attacks.

The implementation properly:

Locks for thread safety

Checks activeViewRangeSize boundary (when > 0) and returns appropriate error

Enforces one-proposal-per-view by checking GetNumberOfVerticesAtLevel(blockView) == 0

Silently ignores duplicates and below-finalized blocks (no-op behavior as documented)

The boundary check at line 109 correctly accepts blocks at finalizedView + activeViewRangeSize (inclusive) and rejects beyond.

135-153: ByParentID efficiently retrieves children using forest iterator.

The method pre-allocates the slice with the correct capacity (GetNumberOfChildren) and uses the iterator pattern to collect children. This is efficient and correctly returns (nil, false) when no children exist.

155-163: PruneByView correctly handles forest's exclusive boundary.

The comment at line 161 correctly notes that PruneUpToLevel is exclusive, so view + 1 is passed. This ensures blocks at the specified view are pruned (inclusive pruning from caller's perspective). The error from the forest (likely BelowPrunedThresholdError) is properly propagated.

172-177: Helper method correctly calculates highest pruned view.

The calculation forest.LowestLevel - 1 is correct since LowestLevel represents the lowest unpruned view. The caution comment properly warns that callers must hold the lock, which all callers do.

make Proposal type generic

e6b4e8c

this should allow removing some type overhead in the BlockBuffer types

jordanschalm added 20 commits November 27, 2025 11:22

make Buffer definition generic to both proposal types

880557b

use leveled forest for block buffer

09eb86c

add docs, todos

46000ac

ai: remove bool return from pending blocks

d54e974

ai: remove DropByParent method

9530e1e

update mocks

7235f7a

add lower view boundary

e3acbf0

update comments

56be6f0

add mutex

760434b

check retained level error

99b181f

add missing return

3ab8dd5

update buffer

a4529e8

update mocks

b281bc0

fix tests for signature change

5dbfe91

bug: forest can get child count for nonextnt node

39f010a

ai: add tests for pending blocks suite

49a196b

adjust tests

b3b1460

remove some ai-generated tests and adjust implementation and docs

remove old buffer backend

beec64c

ai: add threshold also to mempool

e813f5c

it already exists in the compliance engine as well, but including in mempool provides stronger localized guarantee of behaviour

ai: address lint errors

e1bed68

jordanschalm requested review from durkmurder and tim-barry November 28, 2025 21:37

jordanschalm marked this pull request as ready for review November 28, 2025 21:40

jordanschalm requested a review from a team as a code owner November 28, 2025 21:40

jordanschalm commented Nov 28, 2025

View reviewed changes

module/buffer.go Outdated Show resolved Hide resolved

jordanschalm added 2 commits November 28, 2025 14:49

Apply suggestion from @jordanschalm

718a820

Merge branch 'master' into jord/8170-ln-perm-block-buffer

846a5f8

durkmurder reviewed Dec 1, 2025

View reviewed changes

durkmurder approved these changes Dec 1, 2025

View reviewed changes

Merge branch 'master' into jord/8170-ln-perm-block-buffer

292a175

tim-barry approved these changes Dec 9, 2025

View reviewed changes

Use HashablePayload as base generic type parameter

acccf49

jordanschalm added 2 commits December 11, 2025 13:52

Merge pull request #8248 from onflow/tim/8196-refactor-generic-types-…

7f89681

…experiment Refactor generic type parameters in BlockBuffer

Merge branch 'master' into jord/8170-ln-perm-block-buffer

e21fa82

jordanschalm commented Dec 13, 2025

View reviewed changes

Changed how compliance engine treats blocks in the cache. Updated godoc

37c9569

durkmurder self-assigned this Dec 18, 2025

durkmurder and others added 2 commits December 18, 2025 17:16

Merge branch 'master' of https://github.com/onflow/flow-go into jord/…

8fb4c54

…8170-ln-perm-block-buffer

documentation extensions

5108c97

coderabbitai bot reviewed Dec 22, 2025

View reviewed changes

durkmurder added 4 commits December 22, 2025 19:33

Merge branch 'master' into jord/8170-ln-perm-block-buffer

d350dbe

Fixed initialization for PendingClusterBlocks

df61acd

Merge branch 'master' of https://github.com/onflow/flow-go into jord/…

5279647

…8170-ln-perm-block-buffer

Updated godoc

b001e9b

coderabbitai bot reviewed Jan 5, 2026

View reviewed changes

module/buffer/pending_blocks.go Show resolved Hide resolved

durkmurder added 2 commits January 5, 2026 16:55

Updated pending blocks buffer to store one proposal per view

3326e63

Moved validation of proposal before adding it to the cache

cb267a5

coderabbitai bot reviewed Jan 5, 2026

View reviewed changes

durkmurder added 2 commits January 6, 2026 16:17

Fixed tests. Updated godoc

cffc38d

Merge branch 'master' of https://github.com/onflow/flow-go into jord/…

ad038a3

…8170-ln-perm-block-buffer

coderabbitai bot reviewed Jan 6, 2026

View reviewed changes

Improve BFT resilience of BlockBuffer stores #8196

Are you sure you want to change the base?

Improve BFT resilience of BlockBuffer stores #8196

Uh oh!

Conversation

jordanschalm commented Nov 26, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

github-actions bot commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Dependency Review

Scanned Files

Uh oh!

codecov-commenter commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

durkmurder Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

tim-barry Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

durkmurder left a comment

Choose a reason for hiding this comment

Uh oh!

tim-barry left a comment

Choose a reason for hiding this comment

Uh oh!

tim-barry commented Dec 9, 2025

Uh oh!

jordanschalm Dec 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Specific problematic case:

Uh oh!

durkmurder Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related issues

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

jordanschalm commented Nov 26, 2025 •

edited by coderabbitai bot

Loading

github-actions bot commented Nov 26, 2025 •

edited

Loading

codecov-commenter commented Nov 26, 2025 •

edited

Loading

jordanschalm Dec 13, 2025 •

edited

Loading

coderabbitai bot commented Dec 22, 2025 •

edited

Loading