Skip to content

Prometheus metrics for workers last seen or workers heartbeat. #6004

@mehrdad-khojastefar

Description

@mehrdad-khojastefar

Is your feature request related to a problem? Please describe.
Right now the only way to get notified if a worker is up or down is using the boundary's api and find the worker. It's somehow frustrating since the boundary has rich metrics and telemetry and we have some alerts using alertmanager.

Describe the solution you'd like
That would be very nice if we had a worker heartbeat or worker last seen time metric, although I think metric's cardinality is something to consider here. That being said, it would be very helpful to have this feature.

Describe alternatives you've considered
To get around this issue for now, we've written a crawler that scrapes the workers' endpoint and then exposes metrics.

Explain any additional use-cases

Additional context
I would love to help developing this feature.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions