Skip to content

Unable to delete Dynamic Host Volume #26140

Open
@shochdoerfer

Description

@shochdoerfer

Nomad version

v1.10.2 (Nomad Master, Nomad Clients, and my local CLI client)

Issue

Yesterday, I tried to delete a Dynamic Host Volume in 2 of our Nomad clusters, and both failed to delete the volume.

Nomad CLI returned the following error:
Error deleting volume: Unexpected response code: 500 (could not delete volume 7f8d6ba2-992e-556a-5382-ba976c38febd in use by alloc b102ceb2-8fe3-9e8e-d465-d1c84f36502b)

In the Nomad Web UI the storage state got set to: unavailable

According to this PR, the Nomad client now has a --force flag to force the deletion of a volume, but even that had no effect. Before trying the force command, I updated both Nomad clusters to version 1.10.2.

Reproduction steps

I don't think its easily reproducible. I created and deleted various dynamic host volumes in the last few weeks and never run into an issue.

Expected Result

The dynamic host volume should be deleted since no job was depending on it anymore.

Actual Result

The dynamic host volume is not deleted and is now stuck in an unavailable state.

The only way to solve the issue was to drain the node, stop the Nomad client, run the garbage collector, and start the client again. Afterwards, I could delete the volume with the same command as before.

The volume was not in use for several days. Maybe somehow Nomad thought it still is?

Also, what's weird, I could not get the 1.10.2 version of Nomad for my Ubuntu Focal instance. Apt kept saying that my 1.10.1 version is the latest. Is Ubuntu Focal no longer supported?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    Status

    Triaging

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions