Querying job_stats on large cluster(s) takes multiple seconds

Hello, 

we have identified a performance issue with the lustre_exporter when querying metrics on large Lustre file systems with a significant number of jobstats. The problem seems to be related to the procfs.go script repeatedly accessing the same job_stats file in the procfs, resulting in a delay of 4-5 seconds per query.
I think it should be possible to open the file once and scan each line for needed information aldough im not very well versed in GO and dont know if this would require significant refactor of the code. 
Is this issue known? Are there any workarounds around this?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Querying job_stats on large cluster(s) takes multiple seconds #30

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Querying job_stats on large cluster(s) takes multiple seconds #30

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions