Skip to content

Metric for exceeding limits #2112

Discussion options

You must be logged in to vote

I've managed to test this myself and found the relevant message in the logs.

all available instance types exceed limits for nodepool:

Our Karpenter pod logs are exported to CloudWatch using FluentBit so we're able to add a metric filter on our log group with a relevant alarm

Here's the Terraform code I used, if it's helpful for anyone. I've configured a SNS topic to send our alerts to, which creates them on our OpsGenie platform.

resource "aws_cloudwatch_log_metric_filter" "karpenter_node_limit_exceeded" {
  name           = "Karpenter Node Limits Exceeded - ${var.cluster_name}"
  log_group_name = "/aws/containerinsights/${var.cluster_name}/application"
  pattern        = "{ $.log_process…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by dtwilliamsWork
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
1 participant