Thanos-Querier " component=endpointset msg="update of node failed" err="getting metadata: fallback fetching info from test-component-metrics-grpc-consumer.test:10901: rpc error: code = DeadlineExceeded desc = context deadline exceeded" address=test-component-metrics-grpc-consumer.test:10901" #7971
Labels
Thanos, Prometheus and Golang version used:
thanos, version 0.33.0 (branch: HEAD, revision: 0f025f1)
build user: root@9fb4d3c2f7ec
build date: 20231217-16:49:45
go version: go1.21.5
platform: linux/amd64
tags: netgo
AWS EKS version: 1.29
Istio version: 1.20.2
note- same error exists with old Thanos v0.26.0 and latest Thanos v0.37.0 version as well.
Object Storage Provider:
What happened:
we have a Thanos-Querier running on our Centralised Cluster A which queries thanos-store and thanos-sidecar running on other EKS clusters
we recently upgraded Istio to 1.20 from 1.19 since then Grafana does not show latest prometheus metrics on other clusters. however we can see prometheus metrics before 2 hours. This means- Thanos-Querier can talk to Thanos-Store but not able to connect to live Thanos-SideCar which has latest prometheus metrics.
What you expected to happen:
Centralised Grafana should show latest prometheus metrics from other clusters.
How to reproduce it (as minimally and precisely as possible):
Two EKS 1.29 Clusters
Istio 1.20
Prometheus Operator running on both the Clusters and one of the cluster working as Centralised Cluster.
Full logs to relevant components:
Anything else we need to know:
What we have done so far-
Thanos Querier Deployment in Cluster A
Service Entry in Cluster A
Destination Rule in Cluster A
Attached- Grafana screenshot - we cannot see the latest metrics, however metrics older than 2 hours can be seen
The text was updated successfully, but these errors were encountered: