Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
147 commits
Select commit Hold shift + click to select a range
fbd05e7
Draft version of kubectl-apply implemented
sharabiani Aug 19, 2024
480416d
Add description as a hint to "missed required variable" error
mr0re1 Aug 21, 2024
5fb1414
wait_for_rollout added to kubectl module and retry increased.
sharabiani Aug 28, 2024
d955e03
SlurmGCP. Don't skip clean up of "stopped" instances
mr0re1 Aug 30, 2024
d9a829b
documents updated
sharabiani Sep 4, 2024
39b2fab
minor doc style
sharabiani Sep 4, 2024
a208b21
minor doc update
sharabiani Sep 4, 2024
22e26fa
Updating image builder to use terraformrc file for google-private pro…
cdunbar13 Sep 4, 2024
399f153
wait_for_rollout set to true in manifests object
sharabiani Sep 4, 2024
a56e988
Merge branch 'develop' into kubectl-apply-module
sharabiani Sep 4, 2024
beeea1f
doc fix
sharabiani Sep 4, 2024
bde1b24
Default to zonal bulkInsert
mr0re1 Sep 5, 2024
d74c2e0
Merge pull request #2980 from sharabiani/kubectl-apply-module
sharabiani Sep 5, 2024
4ddaf61
Merge pull request #2988 from mr0re1/kill_em_all
mr0re1 Sep 5, 2024
cd07341
Fix Slurm tag on A3 high integration test to match v6 blueprint
tpdownes Sep 5, 2024
ee96846
Add enable-maintenance-reservation flag in slurm to control reservation
harshthakkar01 Aug 30, 2024
9a781df
Merge pull request #2987 from harshthakkar01/slurm-maintenance
harshthakkar01 Sep 6, 2024
2188442
Enable public IPs by default in a3-megagpu-8g blueprint
tpdownes Sep 6, 2024
b21b5e3
Initial commit of A3 Mega cluster integration test
tpdownes Sep 6, 2024
f82ef07
Merge pull request #3002 from cdunbar13/add-terraformrc-to-tests
cdunbar13 Sep 6, 2024
240656d
Merge pull request #3009 from tpdownes/a3mega_cluster_test
tpdownes Sep 6, 2024
9c2e6e2
Update gVNIC driver in a3 mega solution
tpdownes Sep 6, 2024
4985307
Update Debian 12 image in A3 Mega blueprint
tpdownes Sep 6, 2024
b168cf5
Merge pull request #2957 from tpdownes/a3_mega_update_gvnic
tpdownes Sep 6, 2024
bfb3f81
Implement udev-based approach to mounting aperture devices
tpdownes Sep 6, 2024
e011345
Merge pull request #2955 from tpdownes/new_aperture
tpdownes Sep 6, 2024
93df5b7
adding module cache to prevent repeated module downloads during modul…
RachaelSTamakloe Sep 6, 2024
06bd8a6
add support for enabling tcpx/o in a3 and a3mega vm, provide script f…
chengcongdu Sep 6, 2024
ae3c76e
fix typo
chengcongdu Sep 6, 2024
255ac57
Bump golang.org/x/sys from 0.24.0 to 0.25.0
dependabot[bot] Sep 9, 2024
2bf9206
Merge pull request #3007 from GoogleCloudPlatform/fix_a3high_tag
tpdownes Sep 9, 2024
ea2caeb
Merge pull request #2958 from tpdownes/a3_mega_update_image
tpdownes Sep 9, 2024
cb4ccd3
Merge pull request #3016 from GoogleCloudPlatform/release-candidate
nick-stroud Sep 9, 2024
7515cb6
address comments
chengcongdu Sep 9, 2024
f4b5be1
Merge pull request #3005 from mr0re1/zonal_bi
mr0re1 Sep 9, 2024
b90db64
remove link to internal doc
chengcongdu Sep 9, 2024
e0c9f18
Add documentation for reservation for scheduled maintenance
harshthakkar01 Sep 9, 2024
6c6e771
Merge pull request #3014 from harshthakkar01/slurm-maintenance
harshthakkar01 Sep 9, 2024
5ada0ac
update user_workload_path description
chengcongdu Sep 10, 2024
515f516
Use Terraform naming conventions and TFLint rules
tpdownes Sep 10, 2024
0812e9b
Merge pull request #3013 from GoogleCloudPlatform/dependabot/go_modul…
alyssa-sm Sep 10, 2024
6009f51
Update slurm references to 6.7.0
alyssa-sm Sep 9, 2024
26f5abb
Merge pull request #3015 from alyssa-sm/update-slurm-references-to-6.7.0
alyssa-sm Sep 10, 2024
c8267e3
Add enroot/pyxis step to a3 series integration tests
tpdownes Sep 10, 2024
99e5247
Merge pull request #2983 from tpdownes/enroot_pyxis_test
tpdownes Sep 10, 2024
2c7d90e
Remove default service account
alyssa-sm Aug 27, 2024
0f8c3a7
remove user_workload_path parameter and update output instruction
chengcongdu Sep 11, 2024
b827380
Merge pull request #2976 from alyssa-sm/remove-default-service-account
alyssa-sm Sep 11, 2024
62acb6a
Add machine type- availability zone precondition. Validate infra is p…
annuay-google Sep 4, 2024
dd5647e
Merge pull request #3003 from annuay-google/annuay/add-machine-type-a…
annuay-google Sep 11, 2024
ef1f88b
Merge pull request #3010 from RachaelSTamakloe/versioned_blueprints_i…
RachaelSTamakloe Sep 11, 2024
22cb509
update instructions
chengcongdu Sep 12, 2024
5bfdbff
Create default service account resource in controller IFF necessary
alyssa-sm Sep 11, 2024
5c2be83
Update zone for HCLS v5 (legacy) blueprint integration test
tpdownes Sep 12, 2024
85d55d6
Merge pull request #3026 from tpdownes/fix_hcls_legacy_test
tpdownes Sep 12, 2024
8fe7748
Merge pull request #3028 from GoogleCloudPlatform/main
rohitramu Sep 12, 2024
67944a7
add warning about rxdm version
chengcongdu Sep 12, 2024
18c682d
Update module references to v1.39.0
rohitramu Sep 12, 2024
1256e60
only show gpu_direct instructions for a3/a3mega vm
chengcongdu Sep 12, 2024
ab43928
fix comment
chengcongdu Sep 13, 2024
0cf64ca
Merge pull request #3012 from chengcongdu/develop
chengcongdu Sep 13, 2024
fccea9e
Start daos-agent service in mount script
harshthakkar01 Sep 13, 2024
8ada8f3
Merge pull request #3019 from tpdownes/fix_gke_modules
tpdownes Sep 13, 2024
91f05f0
Merge pull request #3022 from alyssa-sm/remove-default-service-accoun…
alyssa-sm Sep 13, 2024
5d0100e
Merge pull request #3030 from rohitramu/roramu/update_refs
rohitramu Sep 13, 2024
5b961dc
Remove remaining references to hpc-centos-7 image
rohitramu Sep 12, 2024
dbd9d63
Update PBS Pro CentOS 7 references to point to a specific image by na…
rohitramu Sep 13, 2024
1ea56c8
Re-enable PBS Pro test
rohitramu Sep 13, 2024
5f78d93
Merge pull request #3035 from rohitramu/roramu/remove_centos_refs
rohitramu Sep 13, 2024
27bb10e
SlurmGCP. Remove usage of `google_compute_default_service_account`
mr0re1 Sep 16, 2024
a76aba0
Merge pull request #3033 from harshthakkar01/parallelstore-support
harshthakkar01 Sep 16, 2024
2ccc1dd
SlurmGCP V6. Remove CentOS7 image support.
mr0re1 Sep 16, 2024
5437ab8
Merge pull request #3037 from mr0re1/fix_sa
mr0re1 Sep 16, 2024
964fd0e
support ghpc_stage function in kubectl-apply module
sharabiani Sep 13, 2024
310ac3a
Merge pull request #3036 from sharabiani/relative-path-kubectl
sharabiani Sep 16, 2024
11769c7
Change zone to have valid reservation
annuay-google Sep 17, 2024
60144a6
Merge pull request #3042 from annuay-google/annuay/fix-test-hpc-slurm…
annuay-google Sep 17, 2024
1c29e9d
adding golden copy test for versioned blueprints feature
RachaelSTamakloe Sep 17, 2024
d810992
Retry mounting daos container
harshthakkar01 Sep 17, 2024
84c61d2
Merge pull request #3038 from mr0re1/slurm_centos7
mr0re1 Sep 17, 2024
8884f25
Merge pull request #3045 from harshthakkar01/feedback-ps
harshthakkar01 Sep 18, 2024
0b5d58d
Merge pull request #2964 from mr0re1/desc_hint
mr0re1 Sep 18, 2024
22b25ed
Merge pull request #3044 from RachaelSTamakloe/v_blueprint_add_golden…
RachaelSTamakloe Sep 18, 2024
cb8ceaa
Adding gcloud path override for staging use
cdunbar13 Sep 18, 2024
6b0ed10
add additional vpc validation for a3/a3mega machine
chengcongdu Sep 18, 2024
040413d
fix typo
chengcongdu Sep 18, 2024
a2affe4
Merge pull request #3048 from cdunbar13/wait-for-startup-pathoverride
cdunbar13 Sep 18, 2024
7b18fb1
Merge branch 'GoogleCloudPlatform:develop' into develop
chengcongdu Sep 18, 2024
91f1e81
adding unit test for version blueprint caching mechanism
RachaelSTamakloe Sep 19, 2024
83bbc59
make gke-node-pool variable additional_networks nullable false
chengcongdu Sep 19, 2024
18f99fc
Update module READMEs using terraform-docs@0.19.0
tpdownes Sep 19, 2024
2f12bb1
Merge pull request #3053 from tpdownes/fix_docs
tpdownes Sep 19, 2024
347b9af
Install Kueue and Jobset as a setting on the kubectl apply module and…
ankitkinra Sep 9, 2024
a7db2ec
Merge branch 'GoogleCloudPlatform:develop' into develop
chengcongdu Sep 19, 2024
dafba02
Merge pull request #3049 from chengcongdu/develop
chengcongdu Sep 19, 2024
bdd41ff
Merge pull request #3017 from ankitkinra/add-kueue-module
ankitkinra Sep 19, 2024
d085d06
Make legacy test for A3 high blueprint compatible with deprecation of…
tpdownes Sep 19, 2024
f3efb5f
Move a3-highgpu-8g tests to us-west1-a due to available capacity for …
tpdownes Sep 19, 2024
d46ec06
adding documentation for versioned blueprint feature
RachaelSTamakloe Sep 19, 2024
074d662
Merge pull request #3050 from GoogleCloudPlatform/fix_a3_high_legacy_…
tpdownes Sep 19, 2024
b70f151
add argparse dependency to cloud build
chengcongdu Sep 19, 2024
c28331f
Validate Reservations in GKE Blueprints
arajmane-g Sep 20, 2024
3bae11d
Merge pull request #3024 from arajmane-g/reservations
arajmane-g Sep 20, 2024
b8adc01
Merge pull request #3057 from chengcongdu/develop
chengcongdu Sep 20, 2024
98ac5a8
update readme for gpudirect
chengcongdu Sep 20, 2024
3bd751d
Merge branch 'GoogleCloudPlatform:develop' into develop
chengcongdu Sep 20, 2024
b5740c5
fix comment
chengcongdu Sep 20, 2024
f352578
Merge branch 'develop' of https://github.com/chengcongdu/hpc-toolkit …
chengcongdu Sep 20, 2024
fe4a73f
Merge pull request #3055 from RachaelSTamakloe/v_blueprint_documentation
RachaelSTamakloe Sep 23, 2024
da41a34
using table test
RachaelSTamakloe Sep 23, 2024
aeb6955
Add initial_node_count support to gke-node-pool
sharabiani Sep 24, 2024
165e59c
Merge pull request #3059 from chengcongdu/develop
chengcongdu Sep 24, 2024
95d6a75
Update wording about DWS Flex
mr0re1 Sep 24, 2024
65fe99d
Merge pull request #3069 from mr0re1/upd_flex
mr0re1 Sep 24, 2024
82e0993
Merge pull request #3052 from RachaelSTamakloe/v_blueprint_cache_unit…
RachaelSTamakloe Sep 24, 2024
f8c8fd6
Use self_link for subnetwork instead of name.
wiktorn Sep 18, 2024
935e9f5
Merge pull request #3046 from wiktorn/fix_multivpc_region
tpdownes Sep 24, 2024
1c39f43
Bump google.golang.org/grpc from 1.64.0 to 1.64.1
dependabot[bot] Sep 17, 2024
c76fcd0
Allow users to provide a commit hash instead of git tag for Spack and…
rohitramu Sep 25, 2024
0c0e0dd
preconditions added for initial_node_count
sharabiani Sep 25, 2024
2015427
Fix missing argument
mr0re1 Sep 25, 2024
9df79e0
Improve install doas script
harshthakkar01 Sep 24, 2024
1d307cb
Merge pull request #3074 from mr0re1/max_hops
mr0re1 Sep 25, 2024
4737b3e
Merge pull request #3073 from rohitramu/roramu/fix_spack_git_ref
rohitramu Sep 26, 2024
ece0d4a
Merge pull request #3068 from sharabiani/initial-nodes-support
sharabiani Sep 26, 2024
94c2268
Merge pull request #3071 from harshthakkar01/a3mega-ps
harshthakkar01 Sep 26, 2024
4ddff3c
fix Parallelstore installation on vanilla RHEL and Rocky
wiktorn Sep 21, 2024
0d3e4f0
Merge pull request #3043 from GoogleCloudPlatform/dependabot/go_modul…
cdunbar13 Sep 26, 2024
d25ec7e
Merge pull request #3061 from wiktorn/fix_parallelstore_on_rocky_rhel
harshthakkar01 Sep 26, 2024
e327084
removing deprecated spack setup variables
RachaelSTamakloe Sep 16, 2024
4171c18
removing deprecated ramble setup variables
RachaelSTamakloe Sep 16, 2024
31a3781
Merge pull request #3040 from RachaelSTamakloe/remove_deprecated_spac…
RachaelSTamakloe Sep 26, 2024
1580ace
Merge pull request #3041 from RachaelSTamakloe/remove_deprecated_ramb…
RachaelSTamakloe Sep 26, 2024
72780f9
resolving error when var.initial_node_count is null
RachaelSTamakloe Sep 27, 2024
1e76caf
Merge pull request #3081 from RachaelSTamakloe/resolve_initial_node_c…
RachaelSTamakloe Sep 28, 2024
d34cda5
A3 High blueprint prolog solution updates
tpdownes Sep 30, 2024
aa5dc45
Merge pull request #3088 from tpdownes/fix_prolog
tpdownes Oct 1, 2024
ab076af
Increase version to 1.40.0
RachaelSTamakloe Sep 26, 2024
c6eebdc
Merge pull request #3077 from GoogleCloudPlatform/version/v1.40.0
RachaelSTamakloe Oct 1, 2024
85ba1c9
nemo readme instructions for preloading gpt2 tokenizer
koallison Sep 25, 2024
2dc9899
update a3-high readme
koallison Sep 26, 2024
e48486d
Merge pull request #3075 from koallison/nemo-hf-cache
tpdownes Oct 1, 2024
7f86cbd
update NeMo 23.11 to 24.07
akiki-liang0 Oct 1, 2024
ec048a0
Fix whitespace in NeMo README files
tpdownes Oct 1, 2024
c85d349
Merge pull request #3090 from GoogleCloudPlatform/update-nemo-version
tpdownes Oct 2, 2024
f75d801
Increase resume-timeout for TPU partition in blueprint
harshthakkar01 Oct 2, 2024
1b12bed
Merge pull request #3094 from harshthakkar01/fix-tpu-bp
harshthakkar01 Oct 2, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion cmd/root.go
Original file line number Diff line number Diff line change
Expand Up @@ -53,7 +53,7 @@ HPC deployments on the Google Cloud Platform.`,
logging.Fatal("cmd.Help function failed: %s", err)
}
},
Version: "v1.39.0",
Version: "v1.40.0",
Annotations: annotation,
}
)
Expand Down
2 changes: 1 addition & 1 deletion community/examples/hpc-build-slurm-image.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@ vars:
image_build_machine_type: n2d-standard-16
build_from_image_family: hpc-rocky-linux-8
build_from_image_project: cloud-hpc-image-public
build_from_git_ref: 6.5.9
build_from_git_ref: 6.7.0
built_image_family: my-custom-slurm
built_instance_image:
family: $(vars.built_image_family)
Expand Down
1 change: 1 addition & 0 deletions community/examples/hpc-slurm6-tpu.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -49,6 +49,7 @@ deployment_groups:
use: [tpu_nodeset]
settings:
partition_name: tpu
resume_timeout: 600

- id: slurm_login
source: community/modules/scheduler/schedmd-slurm-gcp-v6-login
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -71,12 +71,12 @@ limitations under the License.
| <a name="input_folder_id"></a> [folder\_id](#input\_folder\_id) | Folder ID where the project should be created. It can be skipped if already setting organization\_id. Leave blank if the project should be created directly underneath the Organization node. | `string` | `""` | no |
| <a name="input_image_family"></a> [image\_family](#input\_image\_family) | DEPRECATED: Image of the AI notebook. | `string` | `null` | no |
| <a name="input_image_project"></a> [image\_project](#input\_image\_project) | DEPRECATED: Google Cloud project where the image is hosted. | `string` | `null` | no |
| <a name="input_instance_image"></a> [instance\_image](#input\_instance\_image) | Image of the AI notebook.<br><br>Expected Fields:<br>name: The name of the image. Mutually exclusive with family.<br>family: The image family to use. Mutually exclusive with name.<br>project: The project where the image is hosted. | `map(string)` | <pre>{<br> "family": "tf-latest-cpu",<br> "project": "deeplearning-platform-release"<br>}</pre> | no |
| <a name="input_instance_image"></a> [instance\_image](#input\_instance\_image) | Image of the AI notebook.<br/><br/>Expected Fields:<br/>name: The name of the image. Mutually exclusive with family.<br/>family: The image family to use. Mutually exclusive with name.<br/>project: The project where the image is hosted. | `map(string)` | <pre>{<br/> "family": "tf-latest-cpu",<br/> "project": "deeplearning-platform-release"<br/>}</pre> | no |
| <a name="input_ip_cidr_range"></a> [ip\_cidr\_range](#input\_ip\_cidr\_range) | Unique IP CIDR Range for AI Notebooks subnet | `string` | `"10.142.190.0/24"` | no |
| <a name="input_machine_type"></a> [machine\_type](#input\_machine\_type) | Type of VM you would like to spin up | `string` | `"n1-standard-1"` | no |
| <a name="input_network_name"></a> [network\_name](#input\_network\_name) | Name of the network to be created. | `string` | `"ai-notebook"` | no |
| <a name="input_organization_id"></a> [organization\_id](#input\_organization\_id) | Organization ID where GCP Resources need to get spin up. It can be skipped if already setting folder\_id | `string` | `""` | no |
| <a name="input_owner_id"></a> [owner\_id](#input\_owner\_id) | Billing Account associated to the GCP Resources | `list(any)` | <pre>[<br> ""<br>]</pre> | no |
| <a name="input_owner_id"></a> [owner\_id](#input\_owner\_id) | Billing Account associated to the GCP Resources | `list(any)` | <pre>[<br/> ""<br/>]</pre> | no |
| <a name="input_project"></a> [project](#input\_project) | Project in which to launch the AI Notebooks. | `string` | `""` | no |
| <a name="input_project_name"></a> [project\_name](#input\_project\_name) | Project name or ID, if it's an existing project. | `string` | `"gcluster-discovery"` | no |
| <a name="input_random_id"></a> [random\_id](#input\_random\_id) | Adds a suffix of 4 random characters to the `project_id` | `string` | `null` | no |
Expand Down
14 changes: 7 additions & 7 deletions community/modules/compute/htcondor-execute-point/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -212,7 +212,7 @@ limitations under the License.
|------|--------|---------|
| <a name="module_execute_point_instance_template"></a> [execute\_point\_instance\_template](#module\_execute\_point\_instance\_template) | terraform-google-modules/vm/google//modules/instance_template | 10.1.1 |
| <a name="module_mig"></a> [mig](#module\_mig) | terraform-google-modules/vm/google//modules/mig | 10.1.1 |
| <a name="module_startup_script"></a> [startup\_script](#module\_startup\_script) | github.com/GoogleCloudPlatform/hpc-toolkit//modules/scripts/startup-script | v1.36.0&depth=1 |
| <a name="module_startup_script"></a> [startup\_script](#module\_startup\_script) | github.com/GoogleCloudPlatform/hpc-toolkit//modules/scripts/startup-script | v1.39.0&depth=1 |

## Resources

Expand All @@ -226,7 +226,7 @@ limitations under the License.

| Name | Description | Type | Default | Required |
|------|-------------|------|---------|:--------:|
| <a name="input_allow_automatic_updates"></a> [allow\_automatic\_updates](#input\_allow\_automatic\_updates) | If false, disables automatic system package updates on the created instances. This feature is<br>only available on supported images (or images derived from them). For more details, see<br>https://cloud.google.com/compute/docs/instances/create-hpc-vm#disable_automatic_updates | `bool` | `true` | no |
| <a name="input_allow_automatic_updates"></a> [allow\_automatic\_updates](#input\_allow\_automatic\_updates) | If false, disables automatic system package updates on the created instances. This feature is<br/>only available on supported images (or images derived from them). For more details, see<br/>https://cloud.google.com/compute/docs/instances/create-hpc-vm#disable_automatic_updates | `bool` | `true` | no |
| <a name="input_central_manager_ips"></a> [central\_manager\_ips](#input\_central\_manager\_ips) | List of IP addresses of HTCondor Central Managers | `list(string)` | n/a | yes |
| <a name="input_deployment_name"></a> [deployment\_name](#input\_deployment\_name) | Cluster Toolkit deployment name. HTCondor cloud resource names will include this value. | `string` | n/a | yes |
| <a name="input_disk_size_gb"></a> [disk\_size\_gb](#input\_disk\_size\_gb) | Boot disk size in GB | `number` | `100` | no |
Expand All @@ -236,21 +236,21 @@ limitations under the License.
| <a name="input_enable_shielded_vm"></a> [enable\_shielded\_vm](#input\_enable\_shielded\_vm) | Enable the Shielded VM configuration (var.shielded\_instance\_config). | `bool` | `false` | no |
| <a name="input_execute_point_runner"></a> [execute\_point\_runner](#input\_execute\_point\_runner) | A list of Toolkit runners for configuring an HTCondor execute point | `list(map(string))` | `[]` | no |
| <a name="input_execute_point_service_account_email"></a> [execute\_point\_service\_account\_email](#input\_execute\_point\_service\_account\_email) | Service account for HTCondor execute point (e-mail format) | `string` | n/a | yes |
| <a name="input_guest_accelerator"></a> [guest\_accelerator](#input\_guest\_accelerator) | List of the type and count of accelerator cards attached to the instance. | <pre>list(object({<br> type = string,<br> count = number<br> }))</pre> | `[]` | no |
| <a name="input_guest_accelerator"></a> [guest\_accelerator](#input\_guest\_accelerator) | List of the type and count of accelerator cards attached to the instance. | <pre>list(object({<br/> type = string,<br/> count = number<br/> }))</pre> | `[]` | no |
| <a name="input_htcondor_bucket_name"></a> [htcondor\_bucket\_name](#input\_htcondor\_bucket\_name) | Name of HTCondor configuration bucket | `string` | n/a | yes |
| <a name="input_instance_image"></a> [instance\_image](#input\_instance\_image) | HTCondor execute point VM image<br><br>Expected Fields:<br>name: The name of the image. Mutually exclusive with family.<br>family: The image family to use. Mutually exclusive with name.<br>project: The project where the image is hosted. | `map(string)` | <pre>{<br> "family": "hpc-rocky-linux-8",<br> "project": "cloud-hpc-image-public"<br>}</pre> | no |
| <a name="input_instance_image"></a> [instance\_image](#input\_instance\_image) | HTCondor execute point VM image<br/><br/>Expected Fields:<br/>name: The name of the image. Mutually exclusive with family.<br/>family: The image family to use. Mutually exclusive with name.<br/>project: The project where the image is hosted. | `map(string)` | <pre>{<br/> "family": "hpc-rocky-linux-8",<br/> "project": "cloud-hpc-image-public"<br/>}</pre> | no |
| <a name="input_labels"></a> [labels](#input\_labels) | Labels to add to HTConodr execute points | `map(string)` | n/a | yes |
| <a name="input_machine_type"></a> [machine\_type](#input\_machine\_type) | Machine type to use for HTCondor execute points | `string` | `"n2-standard-4"` | no |
| <a name="input_max_size"></a> [max\_size](#input\_max\_size) | Maximum size of the HTCondor execute point pool. | `number` | `5` | no |
| <a name="input_metadata"></a> [metadata](#input\_metadata) | Metadata to add to HTCondor execute points | `map(string)` | `{}` | no |
| <a name="input_min_idle"></a> [min\_idle](#input\_min\_idle) | Minimum number of idle VMs in the HTCondor pool (if pool reaches var.max\_size, this minimum is not guaranteed); set to ensure jobs beginning run more quickly. | `number` | `0` | no |
| <a name="input_name_prefix"></a> [name\_prefix](#input\_name\_prefix) | Name prefix given to hostnames in this group of execute points; must be unique across all instances of this module | `string` | n/a | yes |
| <a name="input_network_self_link"></a> [network\_self\_link](#input\_network\_self\_link) | The self link of the network HTCondor execute points will join | `string` | `"default"` | no |
| <a name="input_network_storage"></a> [network\_storage](#input\_network\_storage) | An array of network attached storage mounts to be configured | <pre>list(object({<br> server_ip = string,<br> remote_mount = string,<br> local_mount = string,<br> fs_type = string,<br> mount_options = string,<br> client_install_runner = map(string)<br> mount_runner = map(string)<br> }))</pre> | `[]` | no |
| <a name="input_network_storage"></a> [network\_storage](#input\_network\_storage) | An array of network attached storage mounts to be configured | <pre>list(object({<br/> server_ip = string,<br/> remote_mount = string,<br/> local_mount = string,<br/> fs_type = string,<br/> mount_options = string,<br/> client_install_runner = map(string)<br/> mount_runner = map(string)<br/> }))</pre> | `[]` | no |
| <a name="input_project_id"></a> [project\_id](#input\_project\_id) | Project in which the HTCondor execute points will be created | `string` | n/a | yes |
| <a name="input_region"></a> [region](#input\_region) | The region in which HTCondor execute points will be created | `string` | n/a | yes |
| <a name="input_service_account_scopes"></a> [service\_account\_scopes](#input\_service\_account\_scopes) | Scopes by which to limit service account attached to central manager. | `set(string)` | <pre>[<br> "https://www.googleapis.com/auth/cloud-platform"<br>]</pre> | no |
| <a name="input_shielded_instance_config"></a> [shielded\_instance\_config](#input\_shielded\_instance\_config) | Shielded VM configuration for the instance (must set var.enabled\_shielded\_vm) | <pre>object({<br> enable_secure_boot = bool<br> enable_vtpm = bool<br> enable_integrity_monitoring = bool<br> })</pre> | <pre>{<br> "enable_integrity_monitoring": true,<br> "enable_secure_boot": true,<br> "enable_vtpm": true<br>}</pre> | no |
| <a name="input_service_account_scopes"></a> [service\_account\_scopes](#input\_service\_account\_scopes) | Scopes by which to limit service account attached to central manager. | `set(string)` | <pre>[<br/> "https://www.googleapis.com/auth/cloud-platform"<br/>]</pre> | no |
| <a name="input_shielded_instance_config"></a> [shielded\_instance\_config](#input\_shielded\_instance\_config) | Shielded VM configuration for the instance (must set var.enabled\_shielded\_vm) | <pre>object({<br/> enable_secure_boot = bool<br/> enable_vtpm = bool<br/> enable_integrity_monitoring = bool<br/> })</pre> | <pre>{<br/> "enable_integrity_monitoring": true,<br/> "enable_secure_boot": true,<br/> "enable_vtpm": true<br/>}</pre> | no |
| <a name="input_spot"></a> [spot](#input\_spot) | Provision VMs using discounted Spot pricing, allowing for preemption | `bool` | `false` | no |
| <a name="input_subnetwork_self_link"></a> [subnetwork\_self\_link](#input\_subnetwork\_self\_link) | The self link of the subnetwork HTCondor execute points will join | `string` | `null` | no |
| <a name="input_target_size"></a> [target\_size](#input\_target\_size) | Initial size of the HTCondor execute point pool; set to null (default) to avoid Terraform management of size. | `number` | `null` | no |
Expand Down
2 changes: 1 addition & 1 deletion community/modules/compute/htcondor-execute-point/main.tf
Original file line number Diff line number Diff line change
Expand Up @@ -120,7 +120,7 @@ resource "google_storage_bucket_object" "execute_config" {
}

module "startup_script" {
source = "github.com/GoogleCloudPlatform/hpc-toolkit//modules/scripts/startup-script?ref=v1.36.0&depth=1"
source = "github.com/GoogleCloudPlatform/hpc-toolkit//modules/scripts/startup-script?ref=v1.39.0&depth=1"

project_id = var.project_id
region = var.region
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -25,6 +25,6 @@ terraform {
}

provider_meta "google" {
module_name = "blueprints/terraform/hpc-toolkit:htcondor-execute-point/v1.39.0"
module_name = "blueprints/terraform/hpc-toolkit:htcondor-execute-point/v1.40.0"
}
}
2 changes: 1 addition & 1 deletion community/modules/compute/mig/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,7 @@ No modules.
| <a name="input_name"></a> [name](#input\_name) | Name of the MIG. If not provided, will be generated from `var.deployment_name` | `string` | `null` | no |
| <a name="input_project_id"></a> [project\_id](#input\_project\_id) | Project in which the MIG will be created | `string` | n/a | yes |
| <a name="input_target_size"></a> [target\_size](#input\_target\_size) | Target number of instances in the MIG | `number` | `0` | no |
| <a name="input_versions"></a> [versions](#input\_versions) | Application versions managed by this instance group. Each version deals with a specific instance template | <pre>list(object({<br> name = string<br> instance_template = string<br> target_size = optional(object({<br> fixed = optional(number)<br> percent = optional(number)<br> }))<br> }))</pre> | n/a | yes |
| <a name="input_versions"></a> [versions](#input\_versions) | Application versions managed by this instance group. Each version deals with a specific instance template | <pre>list(object({<br/> name = string<br/> instance_template = string<br/> target_size = optional(object({<br/> fixed = optional(number)<br/> percent = optional(number)<br/> }))<br/> }))</pre> | n/a | yes |
| <a name="input_wait_for_instances"></a> [wait\_for\_instances](#input\_wait\_for\_instances) | Whether to wait for all instances to be created/updated before returning | `bool` | `false` | no |
| <a name="input_zone"></a> [zone](#input\_zone) | Compute Platform zone. Required, currently only zonal MIGs are supported | `string` | n/a | yes |

Expand Down
2 changes: 1 addition & 1 deletion community/modules/compute/mig/versions.tf
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,6 @@ terraform {
}
}
provider_meta "google" {
module_name = "blueprints/terraform/hpc-toolkit:mig/v1.39.0"
module_name = "blueprints/terraform/hpc-toolkit:mig/v1.40.0"
}
}
2 changes: 1 addition & 1 deletion community/modules/compute/notebook/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -76,7 +76,7 @@ No modules.
|------|-------------|------|---------|:--------:|
| <a name="input_deployment_name"></a> [deployment\_name](#input\_deployment\_name) | Name of the HPC deployment; used as part of name of the notebook. | `string` | n/a | yes |
| <a name="input_gcs_bucket_path"></a> [gcs\_bucket\_path](#input\_gcs\_bucket\_path) | Bucket name, can be provided from the google-cloud-storage module | `string` | `null` | no |
| <a name="input_instance_image"></a> [instance\_image](#input\_instance\_image) | Instance Image | `map(string)` | <pre>{<br> "family": "tf-latest-cpu",<br> "name": null,<br> "project": "deeplearning-platform-release"<br>}</pre> | no |
| <a name="input_instance_image"></a> [instance\_image](#input\_instance\_image) | Instance Image | `map(string)` | <pre>{<br/> "family": "tf-latest-cpu",<br/> "name": null,<br/> "project": "deeplearning-platform-release"<br/>}</pre> | no |
| <a name="input_labels"></a> [labels](#input\_labels) | Labels to add to the resource Key-value pairs. | `map(string)` | n/a | yes |
| <a name="input_machine_type"></a> [machine\_type](#input\_machine\_type) | The machine type to employ | `string` | n/a | yes |
| <a name="input_mount_runner"></a> [mount\_runner](#input\_mount\_runner) | mount content from the google-cloud-storage module | `map(string)` | n/a | yes |
Expand Down
Loading