fix incorrect torch env population #1361

Jeffwan · 2021-08-14T21:45:06Z

Address #1358

PR concludes two changes

Apply strings.ToLower() on ReplicaType before string comparison. This is definitely a bug when we migrate PyTorch to kubelfow/common fashion.

https://github.com/kubeflow/tf-operator/blob/52cddeceba1e31e54a2e34551f486675fafabda2/pkg/controller.v1/pytorch/pytorch.go#L32-L33

Update incorrect logs names. I think @zw0610 may forget to clean them up in last PR add pytorch API and controller #1294

/cc @andreyvelich @zw0610 @kubeflow/wg-training-leads

Jeffwan · 2021-08-14T21:46:41Z

@andreyvelich Please try this image "kubeflow/training-operator:4d4cf6485eb40d4e6e4badb03d341b8b78c2ec92"

andreyvelich

Thanks @Jeffwan!
/lgtm

terrytangyuan

/lgtm

Jeffwan · 2021-08-15T01:13:24Z

Seems cleanRunPolicy test case is flaky. This PR doesn't even touch TensorFlow codes..

/test kubeflow-tf-operator-presubmit

Jeffwan · 2021-08-15T02:33:04Z

/approve

google-oss-robot · 2021-08-15T02:33:11Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: Jeffwan, terrytangyuan

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [Jeffwan]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

fix incorrect torch env population

3bee975

google-oss-robot requested review from jinchihe and terrytangyuan August 14, 2021 21:45

google-oss-robot added the size/S label Aug 14, 2021

andreyvelich reviewed Aug 14, 2021

View reviewed changes

google-oss-robot assigned andreyvelich Aug 14, 2021

google-oss-robot added the lgtm label Aug 14, 2021

terrytangyuan approved these changes Aug 15, 2021

View reviewed changes

google-oss-robot added the approved label Aug 15, 2021

google-oss-robot merged commit fb013e4 into kubeflow:master Aug 15, 2021

Jeffwan deleted the fix_pytorch_rank_issue branch August 15, 2021 02:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix incorrect torch env population #1361

fix incorrect torch env population #1361

Jeffwan commented Aug 14, 2021 •

edited

Loading

Jeffwan commented Aug 14, 2021

andreyvelich left a comment

terrytangyuan left a comment •

edited

Loading

Jeffwan commented Aug 15, 2021 •

edited

Loading

Jeffwan commented Aug 15, 2021

google-oss-robot commented Aug 15, 2021

fix incorrect torch env population #1361

fix incorrect torch env population #1361

Conversation

Jeffwan commented Aug 14, 2021 • edited Loading

Jeffwan commented Aug 14, 2021

andreyvelich left a comment

Choose a reason for hiding this comment

terrytangyuan left a comment • edited Loading

Choose a reason for hiding this comment

Jeffwan commented Aug 15, 2021 • edited Loading

Jeffwan commented Aug 15, 2021

google-oss-robot commented Aug 15, 2021

Jeffwan commented Aug 14, 2021 •

edited

Loading

terrytangyuan left a comment •

edited

Loading

Jeffwan commented Aug 15, 2021 •

edited

Loading