add check to access agent-metrics endpoint #151

scott-cotton · 2024-12-19T13:50:27Z

This is intended as minimal change to fix

https://github.com/signadot/signadot/issues/5172

tested with warp, it causes restarts and we are able to connect whereas previously it hangs

regarding discussion around:

do we need to make this visible in status?
I changed it so that it is more visible in status, returning unhealthy when a restart is needed instead of healthy.
do we need to make check period configurable?

I think not, because it is actually complicated because there 2 check periods for healthy and unhealthy state. Also, the check period relates to the timeout for the added checking request.

daniel-de-vera · 2024-12-20T14:31:38Z

internal/locald/rootmanager/tp_monitor.go

+		cli := &http.Client{
+			Transport: &http.Transport{},
+		}


Maybe it is worth configuring a timeout here?

I added a timeout and made a couple of other changes and fixes:

access agent-metrics (we don't have any health endpoints exposed, this was always restarting)

make the health more pessimistic, return unhealthy when restart is needed

daniel-de-vera

LGTM, just left one comment.

This is intended as minimal change to fix signadot/signadot#5172 tested with warp, it causes restarts and we are able to connect whereas previously it hangs still to consider: - do we need to make this visible in status? - do we need to make check period configurable? add check to agent-metrics endpoints in tp monitor

scott-cotton · 2024-12-20T15:17:27Z

LGTM, just left one comment.

thanks, I made a couple of changes after more testing. could you look again?

daniel-de-vera · 2024-12-20T16:18:02Z

LGTM, just left one comment.

thanks, I made a couple of changes after more testing. could you look again?

Yes, still LGTM.

scott-cotton · 2024-12-20T16:36:40Z

More testing revealed the status visibility pessimism is too pessimistic sometimes, looking at that.

scott-cotton requested review from foxish and daniel-de-vera December 19, 2024 13:50

daniel-de-vera reviewed Dec 20, 2024

View reviewed changes

daniel-de-vera approved these changes Dec 20, 2024

View reviewed changes

scott-cotton force-pushed the local-status-improvements branch from 7e8ac89 to ed05c99 Compare December 20, 2024 15:08

scott-cotton changed the title ~~add check to access agent readiness endpoint~~ add check to access agent-metrics endpoint Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add check to access agent-metrics endpoint #151

add check to access agent-metrics endpoint #151

scott-cotton commented Dec 19, 2024 •

edited

Loading

daniel-de-vera Dec 20, 2024

scott-cotton Dec 20, 2024

daniel-de-vera left a comment

scott-cotton commented Dec 20, 2024

daniel-de-vera commented Dec 20, 2024

scott-cotton commented Dec 20, 2024

add check to access agent-metrics endpoint #151

Are you sure you want to change the base?

add check to access agent-metrics endpoint #151

Conversation

scott-cotton commented Dec 19, 2024 • edited Loading

daniel-de-vera Dec 20, 2024

Choose a reason for hiding this comment

scott-cotton Dec 20, 2024

Choose a reason for hiding this comment

daniel-de-vera left a comment

Choose a reason for hiding this comment

scott-cotton commented Dec 20, 2024

daniel-de-vera commented Dec 20, 2024

scott-cotton commented Dec 20, 2024

scott-cotton commented Dec 19, 2024 •

edited

Loading