Skip to content

Conversation

ErwanAliasr1
Copy link
Collaborator

Thermal metrics are used in some hwgraph to study the thermal evolution during a given benchmark but also mixed with other metrics like fan speeds or power consumption.

The current code avoids noisy graphs by limiting the thermal sensors list to CPU / DIMM & Inlet.

In a recent study, while analyzing Fan's behavior, it was impossible to study the impacts of an increased rotation speeds on other components like VR or OCP Nics.

HPE servers usually has ~20 sensors being reported in the redfish endpoints and not collecting them prevents deeper thermal analysis.

Instead of filtering components in hwbench, let's collect all the sensors and let hwgraph filter out some if needed.

Dell's specific code is not filtering thermal sensors, so removing this HPE-specific code will unify rendering across vendors.

This commit:

  • removes the explicit component filtering
  • collects all thermal sensors
  • update the test suite to match this new behavior

A dedicated hwgraph commit will come later to manage the thermal filtering.

Thermal metrics are used in some hwgraph to study the thermal evolution
during a given benchmark but also mixed with other metrics like fan speeds or power consumption.

The current code avoids noisy graphs by limiting the thermal sensors
list to CPU / DIMM & Inlet.

In a recent study, while analyzing Fan's behavior, it was impossible to
study the impacts of an increased rotation speeds on other components
like VR or OCP Nics.

HPE servers usually has ~20 sensors being reported in the redfish
endpoints and not collecting them prevents deeper thermal analysis.

Instead of filtering components in hwbench, let's collect all the
sensors and let hwgraph filter out some if needed.

Dell's specific code is not filtering thermal sensors, so removing this
HPE-specific code will unify rendering across vendors.

This commit:
- removes the explicit component filtering
- collects all thermal sensors
- update the test suite to match this new behavior

A dedicated hwgraph commit will come later to manage the thermal
filtering.

Signed-off-by: Erwan Velu <[email protected]>
Copy link
Contributor

@anisse anisse left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@anisse
Copy link
Contributor

anisse commented Oct 17, 2025

A dedicated hwgraph commit will come later to manage the thermal filtering.

Did you make progress on this front?

@ErwanAliasr1 ErwanAliasr1 merged commit 16601f8 into main Oct 17, 2025
4 checks passed
@ErwanAliasr1 ErwanAliasr1 deleted the thermal branch October 17, 2025 09:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants