-
Notifications
You must be signed in to change notification settings - Fork 697
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
KEP-2170: Add TrainJob conditions #2322
base: master
Are you sure you want to change the base?
Conversation
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
128a039
to
e1bd573
Compare
8c638b4
to
fd61ff2
Compare
fd61ff2
to
8ca03df
Compare
@@ -51,7 +51,7 @@ type Framework struct { | |||
} | |||
|
|||
func (f *Framework) Init() *rest.Config { | |||
log.SetLogger(zap.New(zap.WriteTo(ginkgo.GinkgoWriter), zap.UseDevMode(true))) | |||
ctrl.SetLogger(zap.New(zap.WriteTo(ginkgo.GinkgoWriter), zap.Level(zapcore.Level(-5)), zap.UseDevMode(true))) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Surfacing the debug logs.
8ca03df
to
665b605
Compare
Pull Request Test Coverage Report for Build 11732628418Details
💛 - Coveralls |
4bb08b7
to
12514ab
Compare
/hold for review |
/assign @kubeflow/wg-training-leads |
Signed-off-by: Yuki Iwai <[email protected]>
12514ab
to
d5230f7
Compare
What this PR does / why we need it:
I implemented the TrainJob condition mechanism based on https://github.com/kubeflow/training-operator/tree/master/docs/proposals/2170-kubeflow-training-v2#state-transition
However, the current implementation depends on the JobSet status.conditions opposed to the status.terminalState since the terminalState was introduced in JobSet v0.6, then the JobSet depends on the K8s lib 1.30 in #2299.
So, after we upgrade the K8s libs to 1.30, we can revisit the JobSet status.terminalState.
Which issue(s) this PR fixes (optional, in
Fixes #<issue number>, #<issue number>, ...
format, will close the issue(s) when PR gets merged):Part-of: #2207
Relates to #2170
Checklist: