WIP lgalloc limiter #32602

antiguru · 2025-05-28T13:52:32Z

Signed-off-by: Moritz Hoffmann [email protected]

Part of MaterializeInc/database-issues/issues/9306

Motivation

Tips for reviewer

Checklist

This PR has adequate test coverage / QA involvement has been duly considered. (trigger-ci for additional test/nightly runs)
This PR has an associated up-to-date design doc, is a design doc (template), or is sufficiently small to not require a design.
If this PR evolves an existing $T ⇔ Proto$T mapping (possibly in a backwards-incompatible way), then it is tagged with a T-proto label.
If this PR will require changes to cloud orchestration or tests, there is a companion cloud PR to account for those changes that is tagged with the release-blocker label (example).
If this PR includes major user-facing behavior changes, I have pinged the relevant PM to schedule a changelog post.

Signed-off-by: Moritz Hoffmann <[email protected]>

bkirwi

(Sorry, started commenting on this before I realized it was still in draft! Submitting since it's already typed out, but feel free to ignore.)

bkirwi · 2025-05-30T14:55:25Z

src/compute/src/lgalloc.rs

+
+            self.tx
+                .send(Update::DiskLimit(Some(disk_limit)))
+                .expect("Sender exists");


Suggested change

.expect("Sender exists");

.expect("Receiver exists");

(Which is what you have on the first send call, and seems more accurate.)

bkirwi · 2025-05-30T15:01:16Z

src/compute/src/lgalloc.rs

+    Interval(Duration),
+    DiskLimit(Option<usize>),
+    BurstBudget(usize),
+}


Sending these as two/three separate messages every time the config changes leads to some odd transient states -- for example, we can briefly have the disk-limit re-enabled but be using a stale value for the burst budget. May not matter, but seems easier to reason about if this were a struct and all configs were updated atomically?

teskje · 2025-06-10T13:12:45Z

src/clusterd/src/lib.rs

@@ -354,6 +355,7 @@ async fn run(args: Args) -> Result<(), anyhow::Error> {
        ComputeInstanceContext {
            scratch_directory: args.scratch_directory,
            worker_core_affinity: args.worker_core_affinity,
+            announce_memory_limit: args.announce_memory_limit,


I am confused about this being called announce_memory_limit instead of just memory_limit. Is there a good reason? If not, it's probably too much of a hassle to change the command-line option, but we can change the internal variable names.

teskje · 2025-06-10T13:13:58Z

src/compute-types/src/dyncfgs.rs

+    "Multiplicative bias to lgalloc_limiter_usage_factor.",
+);
+
+/// Bias to the lgalloc limiter usage factor.


Suggested change

/// Bias to the lgalloc limiter usage factor.

/// Burst factor to disk limit.

teskje · 2025-06-10T13:14:12Z

src/compute-types/src/dyncfgs.rs

+
+/// Bias to the lgalloc limiter usage factor.
+pub const LGALLOC_LIMITER_BURST_FACTOR: Config<f64> = Config::new(
+    "lgalloc_limiter_BURST_FACTOR",


Suggested change

"lgalloc_limiter_BURST_FACTOR",

"lgalloc_limiter_burst_factor",

teskje · 2025-06-10T13:24:22Z

src/compute/src/lgalloc.rs

+/// but it might delete previous metrics. If we ever want to change this, we should
+/// remove the shared static mutex and make this function return a handle to the metrics.
+///
+/// This function is async, because it needs to be called from a tokio runtime context.


I'm wondering how much value this pattern adds. You add some noise but needing the async keyword, the allow, and this comment. You get protection against calling this function outside an async (not necessarily tokio!) context, but I think you also have this protection through CI, as spawning a task would immediately panic if no tokio runtime was available.

If we want to use this pattern here, shouldn't Limiter::create_task also be async?

teskje · 2025-06-10T13:28:11Z

src/compute/src/lgalloc.rs

+///
+/// This function is async, because it needs to be called from a tokio runtime context.
+#[allow(clippy::unused_async)]
+pub async fn register_metrics_into(


I think this should be called run_limiter or somesuch. A call to "register metrics" spawning a non-metrics task in the background is surprising. A call to "run limiter" running a limiter that also registers metrics into the provided registry is not.

teskje · 2025-06-10T13:30:14Z

src/compute/src/lgalloc.rs

+        // Get lgalloc stats and obtain the disk utilization from file stats, summed across all
+        // files and size classes. Compare the disk utilization against the configured disk limit,
+        // and if it exceeds the limit, reduce the burst budget by the amount of disk utilization
+        // that exceeds the limit. If the burst budget is exhausted, we will not allow any more disk
+        // access and terminate the process.


Nit: This fits better in the module docstring imo.

teskje · 2025-06-10T13:36:53Z

src/compute-types/src/dyncfgs.rs

+/// Bias to the lgalloc limiter usage factor.
+pub const LGALLOC_LIMITER_BURST_FACTOR: Config<f64> = Config::new(
+    "lgalloc_limiter_BURST_FACTOR",
+    0.5,


0.5 seems like a too-small value, assuming it means you get "0.5 x limit byte-seconds" of burst. If a hydration takes 10 minute that's less than 0.1% of additional disk you can use.

In any case, we should probably disable bursting by default, and make switching it on a conscious decision. Then we can also decide how much bursting we need based on the use case we want to enable.

Yep, it should be disabled by default.

teskje · 2025-06-10T13:43:30Z

src/compute/src/lgalloc.rs

+                        disk_utilization,
+                        disk_limit
+                    );
+                    std::process::exit(1);


We need to check what implications existing with code 1 has. It might trigger alerts in our monitoring or show the replica as crashed due to an error in the console. Ideally we want this case the look the same as an OOD scenario.

Also, we should log a warning, not an error. Errors will produce noise in Sentry we don't want.

WIP lgalloc limiter

24ab24b

Signed-off-by: Moritz Hoffmann <[email protected]>

bkirwi reviewed May 30, 2025

View reviewed changes

teskje reviewed Jun 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP lgalloc limiter #32602

WIP lgalloc limiter #32602

Uh oh!

antiguru commented May 28, 2025 •

edited

Loading

Uh oh!

bkirwi left a comment

Uh oh!

bkirwi May 30, 2025

Uh oh!

bkirwi May 30, 2025

Uh oh!

teskje Jun 10, 2025

Uh oh!

teskje Jun 10, 2025

Uh oh!

teskje Jun 10, 2025

Uh oh!

teskje Jun 10, 2025

Uh oh!

teskje Jun 10, 2025

Uh oh!

teskje Jun 10, 2025

Uh oh!

teskje Jun 10, 2025

Uh oh!

antiguru Jun 10, 2025

Uh oh!

teskje Jun 10, 2025

Uh oh!

Uh oh!

	/// Bias to the lgalloc limiter usage factor.
	/// Burst factor to disk limit.

	"lgalloc_limiter_BURST_FACTOR",
	"lgalloc_limiter_burst_factor",

WIP lgalloc limiter #32602

Are you sure you want to change the base?

WIP lgalloc limiter #32602

Uh oh!

Conversation

antiguru commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Tips for reviewer

Checklist

Uh oh!

bkirwi left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

antiguru commented May 28, 2025 •

edited

Loading