Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HDDS-12135. Set RM default deadline to 12 minutes and the datanode offset to 6 minutes #7747

Merged
merged 1 commit into from
Jan 26, 2025

Conversation

sodonnel
Copy link
Contributor

What changes were proposed in this pull request?

We recently found that delete commands can run for a long time once picked off the queue, and the default of a 10 minute deadline on SCM and 30 seconds less deadline on the datanodes can result in currently running commands being seen as expired in SCM.

This PR is to make the defaults less aggressive - giving a SCM / RM timeout of 12 minutes and a datanode timeout of 6 minutes. That way, there is longer for commands to be processed before RM will resend them.

With the throttling that RM employs, there should not be a large number of commands on the queue anyway, as the goal of RM is to schedule only the number of commands which can be processed in a heartbeat or two.

Other related Jiras to this one are: HDDS-12127, HDDS-12115, HDDS-12114

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-12135

How was this patch tested?

Simple config change. No new tests added or modified.

@adoroszlai adoroszlai merged commit d7616ec into apache:master Jan 26, 2025
42 checks passed
@adoroszlai
Copy link
Contributor

Thanks @sodonnel for the patch.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants