Replies: 2 comments
-
I think this question should be posted on the Kafka channels (i.e. kafka users mailing list) because it looks to be not related to Strimzi. |
Beta Was this translation helpful? Give feedback.
-
hi, thanks for the reply. |
Beta Was this translation helpful? Give feedback.
-
Hi,
I have an unusual situation where I have a cluster running Kafka 3.5.1 in
strimzi where 4 of the __consumer_offset partitions have dropped under min
isr.
Everything else appears to be working fine.
Upon investigating, i've found that the partition followers appear to be
out of sync with the leader in terms of leader epoch
For example the leader-epoch-checkpoint file on the leader partition is
0
4
0 0
1 4
4 6
27 10
while the followers are
0
5
0 0
1 4
4 6
5 7
6 9
which appears to me like the followers are 2 elections ahead of the leader
and i'm not sure how they got to this situation.
I've attempted to force a new leader election via kafka-leader-elections
but it refused for both PREFERRED and UNCLEAN.
I've also tried a manual partition assignment to move the leader to another
broker but it wont do it.
What is even more strange is that if i watch the leader-epoch-checkpoint
file on one of the followers I can see it constantly changing as it tries
to sort itself out.
I have tried to manually remove the followers partition files on disk in an
attempt to get it to sync from the leader but it keeps returning to the
inconsistent state.
Restarting the broker with the partition leader on it doesn't seem to move
leadership either.
The follower keeps logging the following constantly
Any ideas of how to look at this further and why it happens?
Thanks
Cristian
Beta Was this translation helpful? Give feedback.
All reactions