Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Document how to monitor and interpret Waku node health #165

Open
jm-clius opened this issue Feb 16, 2024 · 4 comments
Open

Document how to monitor and interpret Waku node health #165

jm-clius opened this issue Feb 16, 2024 · 4 comments

Comments

@jm-clius
Copy link
Contributor

We need to document how operators or encapsulating applications can monitor the health of their Waku nodes and how to interpret monitoring results for various use cases.

This is primarily a documentation exercise, but may require some engineering to ensure the appropriate metrics and information is retrievable from the node.

The necessary health information needs to be available both via the REST API and in any of the software bindings (Nim, C, etc.)

As a starting point, the following information from a node should be monitorable and we should provide clear documented guidelines on how to interpret this:

  1. Is the Waku node running?
  2. What protocols are currently mounted on the node?
  3. If relay is mounted, how many peers is connected for each pubsub topic.
  4. If relay is moutned, what is the current relay bandwidth for each pubsub topic.
@romanzac
Copy link

@jm-clius Thanks a lot for opening this. It would be also great, if we already have interested operator/integrator to comment on what info they would like to have. How they would like to interact with Waku node. Vac QA team could then test what we agree on and save time and trouble for integrator.

@chaitanyaprem
Copy link
Contributor

Tagging a related issue for applications monitoring node health waku-org/go-waku#1021 which was done for Status.
Also can refer to status-im/status-go#4628 which gives a brief on how apps can interpret node health.
Note that above is for static sharding scenarios and in similar lines can be abstracted for users of autosharded network as well.

@romanzac
Copy link

Adding issue proposing solution in some scenarios:
waku-org/go-waku#921

@vpavlin
Copy link
Member

vpavlin commented Mar 13, 2024

  • Is the Waku node running?

This is a bit vague - what does "node running" mean?

I think the following items make sense though - which protocols are mounted and details about each - number of peers and some info on bandwith

Do we need to prioritize waku-org/nwaku#2173 to get the detailed info to REST API?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: No status
Development

No branches or pull requests

4 participants