This directory holds the Terraform modules for maintaining Notify.gov's API infrastructure. You might want to:
- Set up the Sandbox and develop Terraform,
- Maintain software versions or CI/CD,
- Learn about the directory structure, or
- Troubleshoot error messages
The Admin app repo has its own terraform directory but a lot of the below instructions apply to both apps.
📺 Video introduction to Notify infrastructure
📗 New developers start here!
Assuming initial setup is complete — which it should be if Notify.gov is online — Terraform state is stored in a shared remote backend. If you are going to be writing Terraform for any of our deployment environments you'll need to hook up to this backend. (You don't need to do this if you are just writing code for the development
module, because it stores state locally on your laptop.)
- Enter the bootstrap module with
cd bootstrap
- Run
./import.sh
to import the bucket containing remote terraform state into your local state - Follow instructions under Use bootstrap credentials
- Run
./run.sh show -json
. - In the output, locate
access_key_id
andsecret_access_key
within thebucket_creds
resource. These values are secret, so don't share them with anyone or copy them to anywhere online. - Add the following to
~/.aws/credentials
:[notify-terraform-backend] aws_access_key_id = <access_key_id> aws_secret_access_key = <secret_access_key>
- Check which AWS profile you are using with
aws configure list
. If needed, useexport AWS_PROFILE=notify-terraform-backend
to change to the profile and credentials you just added.
These credentials will allow Terraform to access the AWS/Cloud.gov bucket in which developers share Terraform state files. Now you are ready to develop Terraform using the Workflow for deployed environments.
These are the steps for developing Terraform code for our deployed environment modules (sandbox
, demo
, staging
and production
) locally on your laptop. Or for setting up a new deployment environment, or otherwise for running Terraform manually in any module that uses remote state. You don't need to do all this to run code in the development
module, because it is not a deployed environment and it does not use remote state.
Caution
There is one risky step below (apply
) which is safe only in the sandbox
environment and should not be run in any other deployed environment.
These steps assume shared Terraform state credentials exist in s3, and that you are Using those credentials.
-
cd
to the environment you plan to work in. When developing new features/resources, try out your code insandbox
. Only once the code is proven should you copy-and-paste it to each higher environment. -
Run
cf spaces
and, from the output, copy the space name for the environment you are working in, such asnotify-sandbox
. -
Next you will set up a SpaceDeployer service account instance. This is something like a stub user account, just for deployment. Note these two values which you will use both to create and destroy the account:
<SPACE_NAME>
will be the string you copied from the prior step<ACCOUNT_NAME>
can be anything, although we recommend something that communicates the purpose of the deployer. For example: "circleci-deployer" for the credentials CircleCI uses to deploy the application, or "sandbox-<your_name>" for credentials to run terraform manually.
Put those two values into this command:
../create_service_account.sh -s <SPACE_NAME> -u <ACCOUNT_NAME> > secrets.auto.tfvars
The script will output the
username
(ascf_user
) andpassword
(ascf_password
) for your<ACCOUNT_NAME>
. The cloud.gov service account documentation has more information.Some resources you might work on require a SpaceDeployer account with higher permissions. Add the
-m
flag to the command to get this.The command uses the redirection operator (
>
) to write that output to thesecrets.auto.tfvars
file. Terraform will find the username and password there, and use them as input variables. -
While still in an environment directory, initialize Terraform:
terraform init
If this command fails, you may need to run
terraform init -upgrade
to make sure new module versions are picked up. Or,terraform init -migrate-state
to bump the remote backend. -
Then, run Terraform in a non-destructive way:
terraform plan
This will show you any pending changes that Terraform is ready to make.
📝 Now is the time to write any HCL code (aka Terraform code) you are planning to write, re-running
terraform plan
to confirm that the code works as you develop. Keep in mind that any changes to the codebase that you commit will be run by the CI/CD pipeline. -
Only if it is safe to do so, apply your changes.
💀 Applying changes in the wrong directory can mess up a deployed environment that people are relying on
Double-check what directory you are in, like with the
pwd
command. You should probably only apply while in thesandbox
directory / environment.Once you are sure it is safe, run:
terraform apply
This command will deploy your changes to the cloud. This is a healthy part of testing your code in the sandbox, or if you are creating a new environment (a new directory). Do not apply in environments that people are relying upon.
If you need to go on to deploy application code on top of the resources you just instantiated, you will use
cf push
-
Remove the space deployer service instance when you are done manually running Terraform.
# <SPACE_NAME> and <ACCOUNT_NAME> have the same values as used above. ./destroy_service_account.sh -s <SPACE_NAME> -u <ACCOUNT_NAME>
List
cf services
if you are unsure which space deployer service instances still existOptionally, you can also
rm secrets.auto.tfvars
These version numbers are hardcoded in Terraform or shell scripts. We should periodically check them for upgrades.
- Cloud Foundry Terraform plugin in every module in the API and Admin apps, here for example.
- The terraform-cloudgov module, the version of which is referred to serveral times in most modules, here for example.
- Cloud Service Broker (CSB) version in the SMS and the SMTP download scripts of the usnotify-ssb repo.
- SMS and SMTP brokerpak versions, also in the download scripts of the usnotify-ssb repo. (And we may have to help maintain the SMTP brokerpak project itself.)
- The version of Redis used in deployed environment modules, here for example. To upgrade, the resource must be destroyed and replaced. The versions supported are limited by Cloud.gov.
- A required minimum version of Terraform is noted in every providers.tf file, here for example. It would be best to keep it in sync with the version used by our CI/CD deployment pipeline. But, it does not need to be updated with every new Terraform release.
📺 Some of these version upgrades are discussed in our video introduction.
A SpaceDeployer account is required to run terraform or deploy the application from the CI/CD pipeline. During CI/CD maintenance you might need to create a new account:
./create_service_account.sh -s <SPACE_NAME> -u <ACCOUNT_NAME>
SpaceDeployers are also needed to run Terraform locally — they fill user and password input variables (via deployers
within main.tf
) that some of our Terraform modules require when they start running. Using a SpaceDeployer account locally is covered in Workflow for deployed environments.
The terraform
directory contains sub-directories (staging
, production
, etc.) named for deployment environments. Each of these is a module, which is just Terraform's word for a directory with some .tf files in it. Each module governs the infrastructure of the environment for which it is named. This directory structure forms "bulkheads" which isolate Terraform commands to a single environment, limiting accidental damage.
The development
module is rather different from the other environment modules. While the other environments can be used to create (or destroy) cloud resources, the development module mostly just sets up access to pre-existing resources needed for local software development.
The bootstrap
directory is not an environment module. Instead, it sets up infrastructure needed to deploy Terraform in any of the environments. If you are new to the project, this is where you should start.
Similarly, shared
is not an environment. It is a module that lends code to all the environments. Please note that changes to shared
codebase will be applied to all envrionments the next time CI/CD (or a user) runs Terraform in that environment.
Warning
Editing shared
code is risky because it will be applied to production
Files within these directories look like this:
- bootstrap/
|- main.tf
|- providers.tf
|- variables.tf
|- run.sh
|- teardown_creds.sh
|- import.sh
- <env>/
|- main.tf
|- providers.tf
|- secrets.auto.tfvars
|- variables.tf
In the environment-specific modules:
providers.tf
lists the required providersmain.tf
calls the shared Terraform code, but this is also a place where you can add any other services, resources, etc, which you would like to set up for that environmentvariables.tf
lists the variables that will be needed, either to pass through to the child module or for use in this modulesecrets.auto.tfvars
is a file which contains the information about the service-key and other secrets that should not be shared
In the bootstrap module:
providers.tf
lists the required providersmain.tf
sets up s3 bucket to be shared across all environments. It lives inprod
to communicate that it should not be deletedvariables.tf
lists the variables that will be needed. Most values are hard-coded in this modulerun.sh
Helper script to set up a space deployer and run terraform. The terraform action (show
/plan
/apply
/destroy
) is passed as an argumentteardown_creds.sh
Helper script to remove the space deployer setup as part ofrun.sh
import.sh
Helper script to create a new local state file in case terraform changes are needed
The token expired, was revoked, or the token ID is incorrect. Please log back in to re-authenticate.
You need to re-authenticate with the Cloud Foundry CLI
cf login -a api.fr.cloud.gov --sso
You may also need to log in again to the Cloud.gov website.
Error: You are not authorized to perform the requested action
This error indicates that the Cloud Foundry user account (or service account) needs OrgManager permissions to take the action.
- When you create a SpaceDeployer service account, use the
-m
flag when running the./create_service_account.sh
script - Your own CF user may may also require OrgManager permissions to run the script
You have exceeded your organization's services limit.
Too many Cloud Foundry services have been created without being destroyed. Perhaps Terraform developers have forgotten to delete their SpaceDeployers after they finish with them. List cf services
to see.
Error: Service Instance xx-name-xx failed xx-UUID-xx, reason: [Job (xx-UUID-xx) failed: An unknown error occurred.]
This unhelpful message may be clarified by looking in the Cloud.gov web UI. Among the list of service instances (Cloud Foundry → Organizations → gsa-tts-benefits-studio → Spaces → your-space-name → Service instances) check for pending or erroring items. Refer below if you discover a domain identity verification error.
The audit event logs may also provide insight. They are visible in web UI or in the terminal.
Error: Error creating SES domain identity verification: Expected domain verification Success, but was in state Pending
This error comes via the Supplementary Service Broker and originates from the SMTP Brokerpak it uses. You can run the broker provisioning locally to tinker with the error.
Error: validating provider credentials: retrieving caller identity from STS: operation error STS: GetCallerIdentity, https response error StatusCode: 403
The steps in Use bootstrap credentials may not be complete. Or the AWS CLI may have reverted to the default profile, in which case, re-run:
export AWS_PROFILE=notify-terraform-backend
Error: No valid credential sources found
Please see https://www.terraform.io/docs/language/settings/backends/s3.html for more information about providing credentials.
Error: failed to refresh cached credentials, no EC2 IMDS role found, operation error ec2imds: GetMetadata, request canceled, context deadline exceeded
You are not hooked up to the remote backend that stores Terraform state Run steps in Retrieving existing bucket credentials.
# module.egress-space.cloudfoundry_space_users.deployers will be updated in-place
~ resource "cloudfoundry_space_users" "deployers" {
~ developers = [
- "xxx-GUID-xxx",
+ "yyy-GUID-yyy",
The environment was last deployed by someone other than you, using a different Space Deployer account. If you are working in the Sandbox environment, this is fine; go ahead and apply the changes. After you do, the other person evidently also working in the Sandbox env will then see the same message. The two of you might play tug-of-war with different GUIDs, but this is inconsequential.