v24.2.25
Bug Fixes
- Allow partition balancing to opearte in case when space management was enabled, but local target capacity was unset. by @ztlpn in #26305
- Enable TCP keepalive for cloud storage connections. by @Lazin in #26410
- Fix Redpanda crash if
partition_autobalancing_concurrent_moves
was set to 0. by @ztlpn in #26305 - When Tiered Storage is paused and data is allowed to expire from local storage there will be gaps between last offset in tiered storage and first offset in local storage. If local storage was truncated in the middle of a segment (i.e. time based retention or via trim-prefix/delete records commands) tiered storage might get stuck with the following exception:
Failed to schedule upload: std::runtime_error (ntp {kafka/foo/0}: log offset N is outside the translation range (starting at M > N))
. Fix this by adjusting upload start offset to the first available and valid offset. Although we might have a bit more data in the segment, other information about that data (i.e. offset translation) is gone with prefix truncation. by @nvartolomei in #26064 partition_autobalancing_mode=off
now stops on-demand partition rebalance as well. by @ztlpn in #26305
Improvements
- In AlterPartitionReassignmentsResponse per-partition response REASSIGNMENT_IN_PROGRESS error code is used if a reassignment is requested while Partition Balancer is moving partition replicas. by @bashtanov in #26350
- #26135 Swap out an internal data structure in the
storage
layer to prevent oversized allocations and crashes when a large number ofsegment
s are present in apartition
. by @WillemKauf in #26138 rpk transform
now uses the tinygo v37 to compile golang to Wasm. by @r-vasquez in #26217- rpk debug bundle: improve reliability of debug bundle collection in k8s environments. by @r-vasquez in #26214
- PR #26010 [v24.2.x] create STMs based on original topic cfg by @bashtanov
- PR #26340 [v24.2.x] c/partition_manager: added log entries when partition is being shutdown by @mmaslankaprv
- PR #26344 [v24.2.x] raft/test/leadership_transfer_delay: increase tolerance by @bashtanov
- PR #26345 [v24.2.x] make ntp_callbacks actually support multiple callbacks by @bashtanov
- PR #26354 Revert "[v24.2.x] raft/log_eviction_stm: avoid unnecessary wait on visible offset" by @bharathv
- PR #26420 [v24.2.x] Fix archival STM shutdown race by @bashtanov
- PR #26428 [backport] [v24.2.x] raft/c: warn on struck truncation. by @bharathv
- PR #26442 [v24.2.x]
storage
: callreserve()
instorage::range()
by @WillemKauf
Full Changelog: v24.2.24...v24.2.25