[NBS] Asynchronous device cleanup (deallocation) and synchronous allocation for local disks #2945

ya-ksgamora · 2025-01-29T13:37:23Z

Currently, legacy local disks in Compute Node work as follows:

There is a Local Disks Controller from which the node allocates/deallocates disks. The deallocation operation is always instant - it returns the disk to the controller, which then begins to clean it asynchronously. The allocation operation is blocking: the controller first tries to provide clean disks, and if only non-wiped disks are available, it blocks allocation until they are wiped.

Due to this approach, in the vast majority of cases, instances with local disks are created and deleted instantly, and only in a small number of cases do we get prolonged creation times while waiting for disk wiping.

Local disks over NBS work differently:

Creation is always instant because only clean devices are selected. Deletion is always slow because the disk deletion process (from the deletion operation perspective) includes devices cleaning. At this point, by switching to local disks over NBS, we're degrading the user experience. We would like the Disk Registry to have the same creation/deletion logic for Local disks over NBS as the current Local Disks Controller in Compute Node

ya-ksgamora · 2025-03-20T16:48:44Z

Now we also need to modify the Disk Manager code to prevent it from failing disk creation task during prolonged local disk creation attempts. Specifically, we need to ensure that it does not exceed the limit of retriable errors (currently, the limit is set at 100 errors) and to implement retries with a slower time limit (currently, the limit is set at 10 seconds).

ya-ksgamora added the blockstore Add this label to run only cloud/blockstore build and tests on PR label Jan 29, 2025

ya-ksgamora self-assigned this Jan 29, 2025

ya-ksgamora mentioned this issue Jan 29, 2025

[NBS] Asynchronous disks allocation #2763

Closed

ya-ksgamora linked a pull request Feb 13, 2025 that will close this issue

[NBS] Asynchronous device cleanup (deallocation) and synchronous allocation for local disks #3037

Merged

ya-ksgamora changed the title ~~[NBS] Asynchronous local disks cleanup and allocation of dirty devices for local disks~~ [NBS] Asynchronous devices cleanup and synchronous allocation for local disks Mar 3, 2025

ya-ksgamora changed the title ~~[NBS] Asynchronous devices cleanup and synchronous allocation for local disks~~ [NBS] Asynchronous device cleanup (deallocation) and synchronous allocation for local disks Mar 3, 2025

ya-ksgamora closed this as completed in #3037 Mar 19, 2025

ya-ksgamora reopened this Mar 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NBS] Asynchronous device cleanup (deallocation) and synchronous allocation for local disks #2945

[NBS] Asynchronous device cleanup (deallocation) and synchronous allocation for local disks #2945

ya-ksgamora commented Jan 29, 2025

ya-ksgamora commented Mar 20, 2025 •

edited

Loading

[NBS] Asynchronous device cleanup (deallocation) and synchronous allocation for local disks #2945

[NBS] Asynchronous device cleanup (deallocation) and synchronous allocation for local disks #2945

Comments

ya-ksgamora commented Jan 29, 2025

ya-ksgamora commented Mar 20, 2025 • edited Loading

ya-ksgamora commented Mar 20, 2025 •

edited

Loading