-
Notifications
You must be signed in to change notification settings - Fork 1.8k
feat: added fix for flaky test #12260
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
Signed-off-by: Sahil Silare <[email protected]>
Hi @sahil9001. Thanks for your PR. I'm waiting for a kubeflow member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
🚫 This command cannot be processed. Only organization members or owners can use the commands. |
Signed-off-by: Sahil Silare <[email protected]>
@sahil9001 thank you for the PR. Can you fix the identation and remove the spaces you added. @akagami-harsh @pschoen-itsc for review |
Signed-off-by: Sahil Silare <[email protected]>
Signed-off-by: Sahil Silare <[email protected]>
Thanks @juliusvonkohout. I have updated the code @pschoen-itsc @akagami-harsh can you check? |
/ok-to-test |
@juliusvonkohout I see the tests are failing on the master from few commits, but passes intermittently. Is that expected? |
seaweedfs-create-admin-user-job.yaml is still referenced in kustomization.yaml |
Signed-off-by: Sahil Silare <[email protected]>
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Signed-off-by: Sahil Silare <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
@sahil9001 Could you please take a look at the intermittent PostStartHookError failures during the SeaweedFS deployment. |
Sure @VaniHaripriya , can you tell me how can I test this locally? Is there any command to replicate those tests locally to check the correctness? |
@sahil9001 You can create a Kind cluster and run the following to mirror what CI does:
|
@VaniHaripriya I am getting this issue while running :
I already have python and pip installed on my system, but still I am facing this issue. |
@sahil9001 you can also install it as shown here to replicate it locally https://github.com/kubeflow/manifests/blob/8cc8dcfcb749bf50b2d525c01f13ec91d14ea258/.github/workflows/full_kubeflow_integration_test.yaml#L70 You can even just raise a dummy PR against kubeflow/manifests to use the testing infrastructure. |
Summary
Fix intermittent SeaweedFS S3 auth race causing artifact upload failures in CI (“Signed request requires setting up SeaweedFS S3 authentication”).
Root Cause
Argo uploaded signed S3 requests before SeaweedFS finished configuring S3 users. Auth setup previously ran in a separate Job, creating a timing window.
Changes
envFrom
to readaccesskey
/secretkey
frommlpipeline-minio-artifact
.lifecycle.postStart
to:mlpipeline
bucket (idempotent).s3.configure -user kubeflow-admin -access_key $accesskey -secret_key $secretkey -actions Admin -apply
.init-seaweedfs
Job (seaweedfs-create-admin-user-job.yaml
).Impact
Testing
Rollout
mlpipeline-minio-artifact
secret.Risks / Mitigations
Checklist: