Skip to content

How to ensure that the nvidia-container-toolkit start before anyone else? #1734

@Wennn

Description

@Wennn

Once a node is restarted, the absence of a guaranteed sequence of pod restarts may result in pods that were started before the nvidia-container-toolkit cannot access the GPU devices, including device plugin.

Could you please provide some guidance on how to handle this situation? Is there any config from gpu-operator side to ensure nvidia-container-toolkit gets started at first?

Thanks.

Metadata

Metadata

Assignees

No one assigned

    Labels

    lifecycle/staleDenotes an issue or PR has remained open with no activity and has become stale.questionCategorizes issue or PR as a support question.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions