-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FIPS issue submitting DDPJobDefinition job from the CodeFlare Notebook #357
Comments
Full message:
|
@KPostOffice created a special While on the FIPS cluster:
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Describe the Bug
On non-FIPS, when you submit the guided-demos/2_basic_jobs DDPJobDefinition mnisttest, the job is scheduled as pending, then switches to running and then completes.
On a FIPS cluster, I'm noticing the following error - (I'll post the entire output below in a comment)
Codeflare Stack Component Versions
Please specify the component versions in which you have encountered this bug.
Codeflare SDK:
MCAD: Unknown, integrated into CodeFlare Operator v1.0.0-rc.1
Instascale: Unknown, integrated into CodeFlare Operator v1.0.0-rc.1
Codeflare Operator: v1.0.0-rc.1
Other: OpenShift 4.12.22 with FIPS enabled:
All master and worker nodes report FIPS enabled, for example:
Steps to Reproduce the Bug
Issue with path: /tmp/torchx_workspacel83oit3q
issue.What Have You Already Tried to Debug the Issue?
I tried it on non-FIPS and it worked fine. I also tried a second FIPS cluster to make sure it wasn't just a bad cluster.
Expected Behavior
I expected the job to be scheduled, run and complete successfully.
Screenshots, Console Output, Logs, etc.
More detail of the codeflare-notebook error message will be posted below.
Affected Releases
main
Additional Context
Add as applicable and when known:
Enabled with FIPS
The text was updated successfully, but these errors were encountered: