-
Notifications
You must be signed in to change notification settings - Fork 768
Description
PodDisruptionBudget per node type (master, data, injest, ...)
If we are performing maintenance on a kubernetes cluster (cordon, drain, updates/upgrades, then reboot) an ECK cluster can end up blocking the upgrade for a lengthy period of time while the ECK cluster goes into yellow status while a data node is removed from service but replaced on a separate node. This disruption often causes re-balancing of shards and now the Elastic PDB prevents other ECK cluster pods from being evicted even though they may not have an impact on data loss such as a master pod. If we had a PDB per node type we could mitigate some of the nodes that would be blocked while waiting for the ECK cluster to return to healthy green state.
It also appears that regardless of the underlying data redundancy the cluster is limited to a single disruption at this point.