Description
Nomad version
v1.10.2 (Nomad Master, Nomad Clients, and my local CLI client)
Issue
Yesterday, I tried to delete a Dynamic Host Volume in 2 of our Nomad clusters, and both failed to delete the volume.
Nomad CLI returned the following error:
Error deleting volume: Unexpected response code: 500 (could not delete volume 7f8d6ba2-992e-556a-5382-ba976c38febd in use by alloc b102ceb2-8fe3-9e8e-d465-d1c84f36502b)
In the Nomad Web UI the storage state got set to: unavailable
According to this PR, the Nomad client now has a --force flag to force the deletion of a volume, but even that had no effect. Before trying the force command, I updated both Nomad clusters to version 1.10.2.
Reproduction steps
I don't think its easily reproducible. I created and deleted various dynamic host volumes in the last few weeks and never run into an issue.
Expected Result
The dynamic host volume should be deleted since no job was depending on it anymore.
Actual Result
The dynamic host volume is not deleted and is now stuck in an unavailable state.
The only way to solve the issue was to drain the node, stop the Nomad client, run the garbage collector, and start the client again. Afterwards, I could delete the volume with the same command as before.
The volume was not in use for several days. Maybe somehow Nomad thought it still is?
Also, what's weird, I could not get the 1.10.2 version of Nomad for my Ubuntu Focal instance. Apt kept saying that my 1.10.1 version is the latest. Is Ubuntu Focal no longer supported?
Metadata
Metadata
Assignees
Type
Projects
Status