-
Notifications
You must be signed in to change notification settings - Fork 215
Description
I have the following setup: 2 consumers of different topics inside the same consumer group distributed among 3 nodes and utilizing partition_assignment_strategy=callback_implemented
.
Everything works great but there is 1 thing that worries me. During a shutdown, I can consistently see the following statement printed out on different nodes:
group_subscriber_v2 *group-id* failed to flush commits before termination :timeout
This is logged as an error so I treat it as an abnormal execution.
This seems to be a safety mechanism to prevent the call to the group coordinator hang forever:
brod/src/brod_group_subscriber_v2.erl
Line 404 in e18151c
ok = flush_offset_commits(GroupId, Coordinator), |
Could it be related to the usage of the callback-implemented partition assignment strategy? For example, the original group leader is already shutdown, a new one is elected, it starts doing preparatory work and that's when the call to flush offsets call comes in.
Are there any logs/other information I could provide to simplify the investigation?