Skip to content

GPU-related Errors when installing Red Hat OpenShift AI Addon on Red Hat OpenShift Container Platform in IBM Cloud #1709

@Saikiran9824-pronteff

Description

@Saikiran9824-pronteff

Important Note: NVIDIA AI Enterprise customers can get support from NVIDIA Enterprise support. Please open a case here.

Describe the bug
I want to configure RedHat OpenShift AI on IBM Cloud for that I were trying to install OpenShift AI and their pre-requisites( OpenShift Pipelines, Node Feature Discovery, NVIDIA GPU Operator) Operator's but getting NVIDIA operator error after installing.
Error logs:
oc logs gpu-feature-discovery-p2jfj -c toolkit-validation | tail -n 1
2025-09-05T03:46:38.612205612Z waiting for nvidia container stack to be setup

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions