File tree 6 files changed +82
-16
lines changed
6 files changed +82
-16
lines changed Original file line number Diff line number Diff line change @@ -70,13 +70,19 @@ Install the AppWrapper Operator
70
70
``` sh
71
71
kubectl apply --server-side -k setup.k8s-v1.25/appwrapper
72
72
```
73
-
74
- - the AppWrapper controller is enabled and configured as follows:
75
- - ` userRBACAdmissionCheck ` is disabled,
76
- - ` schedulerName ` is set to ` scheduler-plugins-scheduler ` ,
77
- - ` queueName ` is set to ` default-queue ` ,
78
-
79
- TODO: ** * UNDER CONSTRUCTION **
73
+ The provided configuration differs from the default configuration of the
74
+ operators as follows:
75
+ - Kubeflow Training Operator:
76
+ - ` gang-scheduler-name ` is set to ` scheduler-plugins-scheduler ` ,
77
+ - Kueue:
78
+ - ` manageJobsWithoutQueueName ` is enabled,
79
+ - ` batch/job ` integration is disabled,
80
+ - ` waitForPodsReady ` is disabled,
81
+ - AppWrapper operator:
82
+ - ` userRBACAdmissionCheck ` is disabled,
83
+ - ` schedulerName ` is set to ` scheduler-plugins-scheduler ` ,
84
+ - ` queueName ` is set to ` default-queue ` ,
85
+ - pod priorities, resource requests and limits have been adjusted.
80
86
81
87
## Kueue Configuration
82
88
Original file line number Diff line number Diff line change
1
+ kind : ConfigMap
2
+ apiVersion : v1
3
+ metadata :
4
+ name : appwrapper-operator-config
5
+ namespace : appwrapper-system
6
+ data :
7
+ config.yaml : |
8
+ appwrapper:
9
+ enableKueueIntegrations: true
10
+ kueueJobReconciller:
11
+ manageJobsWithoutQueueName: true
12
+ waitForPodsReady: false
13
+ queueName: default-queue
14
+ schedulerName: scheduler-plugins-scheduler
15
+ userRBACAdmissionCheck: false
16
+ controllerManager:
17
+ health:
18
+ bindAddress: ":8081"
19
+ metrics:
20
+ bindAddress: "127.0.0.1:8080"
21
+ leaderElection: true
Original file line number Diff line number Diff line change
1
+ apiVersion : kustomize.config.k8s.io/v1beta1
2
+ kind : Kustomization
3
+
4
+ namespace : mlbatch-system
5
+
6
+ resources :
7
+ - " https://github.com/project-codeflare/appwrapper/config/default?ref=v0.21.0"
8
+
9
+ images :
10
+ - name : quay.io/ibm/appwrapper
11
+ newTag : v0.21.0
12
+
13
+ patches :
14
+ - path : manager_resources_patch.yaml
15
+ - path : config_patch.yaml
Original file line number Diff line number Diff line change
1
+ apiVersion : apps/v1
2
+ kind : Deployment
3
+ metadata :
4
+ name : controller-manager
5
+ namespace : system
6
+ spec :
7
+ template :
8
+ spec :
9
+ priorityClassName : system-node-critical
10
+ containers :
11
+ - name : manager
12
+ resources :
13
+ requests :
14
+ cpu : 250m
15
+ memory : 250Mi
16
+ limits :
17
+ cpu : 1000m
18
+ memory : 1000Mi
Original file line number Diff line number Diff line change @@ -22,8 +22,8 @@ clientConnection:
22
22
qps : 50
23
23
burst : 100
24
24
# pprofBindAddress: :8083
25
- # waitForPodsReady:
26
- # enable: false
25
+ waitForPodsReady :
26
+ enable : false
27
27
# timeout: 5m
28
28
# blockAdmission: false
29
29
# requeuingStrategy:
Original file line number Diff line number Diff line change @@ -148,13 +148,19 @@ Install the AppWrapper Operator
148
148
```sh
149
149
{{ .KUBECTL }} apply --server-side -k setup.{{ .VERSION }}/appwrapper
150
150
```
151
-
152
- - the AppWrapper controller is enabled and configured as follows:
153
- - `userRBACAdmissionCheck` is disabled,
154
- - `schedulerName` is set to `scheduler-plugins-scheduler`,
155
- - `queueName` is set to `default-queue`,
156
-
157
- TODO: *** UNDER CONSTRUCTION **
151
+ The provided configuration differs from the default configuration of the
152
+ operators as follows:
153
+ - Kubeflow Training Operator:
154
+ - `gang-scheduler-name` is set to `scheduler-plugins-scheduler`,
155
+ - Kueue:
156
+ - `manageJobsWithoutQueueName` is enabled,
157
+ - `batch/job` integration is disabled,
158
+ - `waitForPodsReady` is disabled,
159
+ - AppWrapper operator:
160
+ - `userRBACAdmissionCheck` is disabled,
161
+ - `schedulerName` is set to `scheduler-plugins-scheduler`,
162
+ - `queueName` is set to `default-queue`,
163
+ - pod priorities, resource requests and limits have been adjusted.
158
164
159
165
{{- end }}
160
166
You can’t perform that action at this time.
0 commit comments