DLX logs "Cannot forward any dead-letter messages from source quorum queue" #12626
Replies: 3 comments 7 replies
-
Potentially related: In some of the clusters we see that node restarts (with 3-node-quorum quorum queues and 5 brokers) don't go through, logging that queues would should down if certain nodes would be taken offline. |
Beta Was this translation helpful? Give feedback.
-
@timbuchwaldt we cannot help given the amount of information provided. We do not guess in this community and certainly won't guess what your topology looks like, roughly what what your clients do and what else may be in the logs. |
Beta Was this translation helpful? Give feedback.
-
The error message mentions a few possible scenarios. DLX could not do its job, for one reason or another. Maybe the topology was changed concurrently. Maybe a node was restarted and it was one of the effects of what #12412 seeks to address w.r.t. policy applications. Dead lettering is not magic, it depends on routing topologies as any other publisher. And topologies can change during upgrades, in particular when classic queues are involved or policies are changed concurrently with node restarts. |
Beta Was this translation helpful? Give feedback.
-
Describe the bug
We see the following warning upon broker restart:
2024-10-31 10:43:15.411289+00:00 [warning] <0.3158.0> Cannot forward any dead-letter messages from source quorum queue 'input-pending-retry' in vhost 'my-vhost' with configured dead-letter-exchange exchange '' in vhost 'my-vhost' and configured dead-letter-routing-key 'input-pending'. This can happen either if the dead-letter routing topology is misconfigured (for example no queue bound to dead-letter-exchange or wrong dead-letter-routing-key configured) or if non-mirrored classic queues are bound whose host node is down. Fix this issue to prevent dead-lettered messages from piling up in the source quorum queue. This message will not be logged again.
This occurs as soon as a message is stuck in said queue. By wiping the policies that configure this behavior, previously dead-lettered messages (only showing up in total, not ready/unacked) get dropped, but the problem re-occurs again.
Reproduction steps
I was unable to reproduce this outside of the clusters that show this behaviour. I suspect this is connected to those clusters having lived quite a few days and seen a few upgrades.
Expected behavior
Dead-lettering works across upgrades
Additional context
Clusters of 3/5 nodes show this behavior, running the current 4.0.3 release.
Beta Was this translation helpful? Give feedback.
All reactions