Disaggregated coordinators with strong consistency via leader election #23692

tdcmeehan · 2024-09-20T20:05:56Z

tdcmeehan
Sep 20, 2024
Maintainer

There are two recent developments that make me think we can improve on disaggregated coordinators architecture.

As background, originally, one of the design goals for disaggregated coordinators was to avoid any additional infrastructure. For example, we didn't want to introduce a dependency on Zookeeper, because this would increase the infrastructure requirements for anyone who wanted to use the feature.

However, there are two recent developments that I think change the picture:

With Amazon S3 announcing support for conditional writes, now all major cloud storage providers have the ability to provide distributed locks: GCS and Azure through compare-and-swap, and S3 with conditional writes. For on-prem installations, Ceph supports conditional PUT, and MinIO also supports conditional writes. So, there's an argument to be made that at this point, it's difficult to not find a storage platform that doesn't provide the ability to let you use conditional locks, and because distributed storage is essentially a requirement for most use cases of Presto, we could implement leader election using distributed storage. Of course, the mechanism to establish the leader would be encapsulated behind a plugin, and potentially other mechanisms like Zookeeper or etcd could be implemented as well.
The PUT API recently introduced could be used by the leader, established by the route above, to distribute query execution management amongst multiple coordinators.

There are still some questions to figure out, such as where to place the discovery server and whether or not we will still need a resource manager. One thought is that the leader takes on the responsibility of the resource manager. Or we could keep that process as is. If the leader takes on the responsibility of the resource manager, then the discovery server would need to redirect nodes to the leader in case of a failover.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Presto

Disaggregated coordinators with strong consistency via leader election #23692

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

Presto

Disaggregated coordinators with strong consistency via leader election #23692

tdcmeehan Sep 20, 2024 Maintainer

Replies: 0 comments

tdcmeehan
Sep 20, 2024
Maintainer