Start reporting a few process metrics, like CPU and memory usage for backend platforms #49178

danielkhan · 2023-05-10T09:21:56Z

danielkhan
May 10, 2023

As we look into extending the capabilities of our SDKs, I would like to collect some feedback on the topic of process metrics.

Thesis

Metrics like memory-, or CPU usage can provide insights into the overall health of an application.

CPU metrics can identify heavy lifting on the userland - especially if compared between releases.
Memory metrics can show the overall memory footprint of an application and help identify memory leaks

How could we send such metrics to Sentry?

Each backend platform provides some API to collect such metrics from the userland.
These metrics don't directly relate to traces or transactions, and it would also provide little value to try to track them in the context of a trace.
To avoid creating a dedicated metrics ingest, we could use transactions to report them by adding them to spans. The data volume and performance impact would be neglectable. Especially if we choose to apply some frequency at which the collection happens.

What would we do with this data?

Relay would need to extract these metrics to make them available. It would need more ideation to see how we could embed them into our performance product and detect anomalies automatically.

Caveats

If we choose the span transport, metrics can only be reported when spans are created / the application is under load.
This means that for apps with meager traffic, creating a continuous time series would be impossible.

Request for feedback

As I said, we are currently discussing this idea and would love your feedback.

Is this something you'd like to see in Sentry?
Do you think the proposed approach sounds feasible?
Which features would you like to see on top of such metrics?
Are there any other metrics you'd like to see in Sentry?

indragiek · 2023-05-11T12:13:43Z

indragiek
May 11, 2023
Maintainer

The Profiling team started collecting CPU and memory metrics on Android and iOS as part of the profile payload. The schema for "measurements" (time series metrics that are collected over the timeframe of a profile) is defined here: https://github.com/getsentry/relay/blob/fcbd996ace227a9fe9c69b736db91482e5e178d5/relay-profiling/src/measurements.rs#L6.

The MDX team has also started working on a proof-of-concept to display the CPU & memory metrics for Android in the transaction details view: #46532

We are interested in an effort to expand support for this across other platforms.

0 replies

dsoprea · 2023-12-19T13:35:12Z

dsoprea
Dec 19, 2023

I'd love to see some minimum memory profiling (minimal, in that it probably wouldn't be much more than simple quantities in order to not have a performance impact) and maybe some thresholds in order to trigger warnings.

0 replies

shaedrich · 2023-12-20T14:17:32Z

shaedrich
Dec 20, 2023

Taking one SDK as an example, Laravel, there are, for example, quite a few metrics, Laravel developers are able to collect via, let's say, Pulse:

Users online
CPU percentage/absolute
RAM percentage/absolute
Storage available/used
Cache hits/misses
Queue idle/processing
Requests

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Start reporting a few process metrics, like CPU and memory usage for backend platforms #49178

{{title}}

Replies: 3 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

Start reporting a few process metrics, like CPU and memory usage for backend platforms #49178

danielkhan May 10, 2023

Thesis

How could we send such metrics to Sentry?

What would we do with this data?

Caveats

Request for feedback

Replies: 3 comments

indragiek May 11, 2023 Maintainer

dsoprea Dec 19, 2023

shaedrich Dec 20, 2023

danielkhan
May 10, 2023

indragiek
May 11, 2023
Maintainer

dsoprea
Dec 19, 2023

shaedrich
Dec 20, 2023