-
Notifications
You must be signed in to change notification settings - Fork 23
Open
Labels
HackathonTopics to discuss in the HackathonTopics to discuss in the HackathonImprovementCode ImprovementsCode Improvementspotential problemAvoid foreseeable misuseAvoid foreseeable misuse
Description
Recently, a black hole like situation has occurred on one of our HPC clusters. The automated configuration of HTCondor on the Drone has not worked anymore, due to a full system disk on the remote git server. TARDIS relentlessly tried to boot up new Drones, which end up in a sort of DDoS situation on the remote git server.
Would be nice to implement a mechanism, that stops deploying new Drones if the life time of a Drone is too short or to many Drones are spawned in a defined interval.
Metadata
Metadata
Assignees
Labels
HackathonTopics to discuss in the HackathonTopics to discuss in the HackathonImprovementCode ImprovementsCode Improvementspotential problemAvoid foreseeable misuseAvoid foreseeable misuse