-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing BottlerocketShadow CRD causes excessive logging #478
Comments
The excessive logging and controller API calls can occur in other situations as well. I've seen this cause the |
Thanks for opening this issue. We're looking at the issue now and will reach out to you back soon. For now, we will investigate and work on reducing the logs. Meanwhile, for any versions <=v1.1.0, we recommend to use this installation instead of helm chart since we haven't released a new version which contains the new feature helm yet (but we will release it soon). @jackgill can you share with us more what was your situation? thanks |
@gthao313 my situation is a bit different, as far as I know the CRD is installed fine. (And I used the manifest file to install, not the helm chart) I believe the problem is that the API server has an invalid TLS cert for some reason, for instance trying to list the BottleRocketShadows has this error:
I can open a separate issue for the TLS problem if you want, I'm still hoping I can figure it out somehow. The reason that I commented here is that the brupop controller is logging this error thousands of times per minute:
This makes it more difficult to troubleshoot because I have to remove the controller to avoid overwhelming my logging infrastructure. Regardless of what the root cause of my problem turns out to be, brupop should not have such a high volume of retries and error messages. |
@jackgill yeah, we will work on reducing the logs! Can you open a new issue about TLS problem? I think that maybe better to track the issue : ) Meanwhile can you share with us what brupop container version you were using? |
@gthao313 thanks! |
The fixes for this will be released with Brupop 1.3.0. Please feel free to follow the tracking issue for that release: #508 Thanks again for the report and all of the supporting data! |
https://github.com/bottlerocket-os/bottlerocket-update-operator/issues?q=is%3Aissue+backoff
Image I'm using:
v1.0.0
Issue or Feature Request:
We've started using the helm chart on the develop branch. We missed that the shadow chart had to be installed. When the operator started it rightly wrote error messages because the API didn't exist. We didn't see the error because the controller went healthy.
We saw around 53.6 million error messages in a 7 minute window and a peak of 130K messages / sec when the bottlerocket shadow CRD was not installed.
I would have expected the controller to not be healthy --or-- there's exponential backoffs used when these kind of errors are hit (could potentially DoS the kube-api server too). Controllers I've written in the past used backoff to prevent crazy logs and hammering the API. I saw that at least one of the errors was had a static backoff of 5 secs, but there's some others that seem to have no backoff.
Slightly better view of logged message patterns:
The text was updated successfully, but these errors were encountered: