[metrics] harbor_up() metric: Wrong / misleading container, pod, and service reported (exporter vs real component name)

**Steps to reproduce the problem:**

1. Deploy Harbor [_v2.14.0_](https://github.com/goharbor/harbor/releases/tag/v2.14.0) via a harbor-helm [_v1.18.0_](https://github.com/goharbor/harbor-helm/releases/tag/v1.18.0) Helm chart & enable Harbor Prometheus [metrics](https://goharbor.io/docs/2.14.0/administration/metrics/#harbor-exporter-metrics)
2. Intentionally make some of the Harbor component(s) not to run (fail to start), for example by.:
- For the _portal_ component intentionally provide an invalid Nginx config directive to _harbor-portal_ ConfigMap, diff example below:
<img width="1390" height="413" alt="Image" src="https://github.com/user-attachments/assets/03829045-bb6e-4d37-9ab3-7521f805a94b" />

- For the _registry_ component intentionally provide some invalid option in the _harbor-registry_ ConfigMap, diff example below:
<img width="1407" height="327" alt="Image" src="https://github.com/user-attachments/assets/0acc1190-f0ce-4ad6-a9e2-f381e106db39" />

3. Restart both the _harbor-portal_ and _harbor-registry_ pods to ensure, they load their new configs & end up with CrashLoopBackOff error
4. Now issue _harbor\_up()_ Prometheus query (either in Prometheus UI, or by defining a new Prometheus alert, using this metric)
5. Check the reported _container_, _pod_, and _service_ names

**Actual behaviour:**
The _portal_, _registry_, and _registryctl_ components are **correctly reported as failing ones**.

But a wrong / misleading information is reported in _container_, _pod_, and _service_ fields (namely _"exporter"_ is reported as container, _"harbor-exporter-.*"_ as pod, and _"harbor-exporter"_ as service name for all components)

See output below (the _instance_ IP & _namespace_ were intentionally changed):
```
Load time: 307ms □~@~B Result series: 8

harbor_up{component="core", container="exporter", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-exporter-5f6b484c6-rpp7h", service="harbor-exporter"}   1
harbor_up{component="database", container="exporter", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-exporter-5f6b484c6-rpp7h", service="harbor-exporter"}       1
harbor_up{component="jobservice", container="exporter", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-exporter-5f6b484c6-rpp7h", service="harbor-exporter"}     1
harbor_up{component="portal", container="exporter", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-exporter-5f6b484c6-rpp7h", service="harbor-exporter"} 0
harbor_up{component="redis", container="exporter", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-exporter-5f6b484c6-rpp7h", service="harbor-exporter"}  1
harbor_up{component="registry", container="exporter", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-exporter-5f6b484c6-rpp7h", service="harbor-exporter"}       0
harbor_up{component="registryctl", container="exporter", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-exporter-5f6b484c6-rpp7h", service="harbor-exporter"}    0
harbor_up{component="trivy", container="exporter", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-exporter-5f6b484c6-rpp7h", service="harbor-exporter"}  1
```
Note: When you look to the _harbor-exporter_ pod, there's no container like 'portal', or 'registry', or 'registryctl' present there.

**Expected behavior:**
1. For the failing _portal_ component, existing failing _harbor-portal-7c49df4b68-qphzg_ pod is reported in _pod_ field, _portal_ in the _container_ field, and _harbor-portal_ in the service field. In other words for every failing component, the corresponding / real pod, container, and service name is reported for that component, instead of the _exporter_ "placeholder" used currently.
2. Analogous for the failing _registry_ component, "registry" is reported as container, real registry pod name, and "harbor-registry" reported as service name.

If I should try to adjust the aforementioned (current metric output) to the proposed one (so it would better reflect to the situation in the K8s namespace), it would look as follows (note the **changed** container, pod, and service field values):

```
Load time: 307ms □~@~B Result series: 8

harbor_up{component="core", container="core", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-core-5b4f678f4d-tgpt8", service="harbor-core"}      1
harbor_up{component="database", container="database", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-database-0", service="harbor-database"}     1
harbor_up{component="jobservice", container="jobservice", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-jobservice-65cbd58bbd-pnjq5", service="harbor-jobservice"}      1
harbor_up{component="portal", container="portal", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-portal-7c49df4b68-qphzg", service="harbor-portal"}      0
harbor_up{component="redis", container="redis", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-redis-0", service="harbor-redis"} 1
harbor_up{component="registry", container="registry", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-registry-5d479956bb-nq9jm", service="harbor-registry"}      0
harbor_up{component="registryctl", container="registryctl", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-registry-5d479956bb-nq9jm", service="harbor-registry"}        0
harbor_up{component="trivy", container="trivy", endpoint="http-metrics", instance="<<redacted>>", job="harbor", namespace="<<redacted>>", pod="harbor-trivy-0", service="harbor-trivy"} 1
```

**Versions:**
Please specify the versions of following systems.
- harbor version: [2.14.0]
- harbor-helm version: [1.18.0]

**Additional context:**
Suppose you want to define Prometheus alerting rules for Harbor, and for each of the failing rules you want to provide as much as possible / detailed information, about the failing component.

When e.g. the _portal_ or _registry_ Harbor components are failing, for example the _kube_pod_container_status_restarts_total_() metric correctly reports the "harbor-portal-7c49df4b68-qphzg" and "harbor-registry-5d479956bb-nq9jm" as the failing pods, so you can point the users to look into that pod logs to investigate the reasons of the failure further.

But in the very same scenario, the _harbor_up_() metric reports the aforementioned "exporter" as container, pod, and service names, so it's not possible to direct users to the actual pod, container, or service to investigate further. 

- **Harbor config files:** Not needed, see pictures above for sample/desired _harbor-portal_ & _harbor-registry_ ConfigMap modifications.
- **Log files:** Not relevant here either. The issue is in the outputs, metric reports.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[metrics] harbor_up() metric: Wrong / misleading container, pod, and service reported (exporter vs real component name) #22463

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[metrics] harbor_up() metric: Wrong / misleading container, pod, and service reported (exporter vs real component name) #22463

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions