fix(otel): Exporter creating Monitored Resource with task_id for Cloud Run #14923

DouglasHeriot · 2025-01-10T00:49:54Z

When inside a Cloud Run environment, the MonitoredResource in a CreateTimeSeriesRequest to the Cloud Monitoring API does not include the necessary fields for the generic_task resource type, and is rejected.

Also updating the GenericNode mapping to match Golang.

This change is

Fixes googleapis#14925 When inside a Cloud Run environment, the `MonitoredResource` in a `CreateTimeSeriesRequest` to the Cloud Monitoring API does not include the necessary fields for the `generic_task` resource type, and is rejected. Should follow the well-tested Golang implementation where the `faas.instance` OTel Resource Attribute is mapped to `MonitoredResource` `task_id`. As the `service.namespace` OTel Resource Attribute is not set by the Resource Detector from within Cloud Run, it should be mapped as an empty string, rather than being left absent. https://github.com/GoogleCloudPlatform/opentelemetry-operations-go/blob/8da0f42dab085c916987891419461d583a2aa96e/internal/resourcemapping/resourcemapping.go#L153

scotthart · 2025-01-10T16:37:14Z

/gcbrun

dbolduc

Thanks for the PR!

The code looks good, although I offered an optional refactor. Unit tests for the new code paths would be nice. Either way, we need to fix the tests that are now broken.

You can run the unit test with:

bazelisk test --test_output=all //google/cloud/opentelemetry:internal_monitored_resource_test

dbolduc · 2025-01-10T17:27:25Z

google/cloud/opentelemetry/internal/monitored_resource.cc

      } else if (kv.second.fallback) {
        mr.labels[kv.first] = *kv.second.fallback;
+      } else {
+        mr.labels[kv.first] = "";


So it seems like we always fallback. Sometimes to a value like "global", but at least to "".

optional nit: It seems slightly cleaner to make fallback a std::string:

google-cloud-cpp/google/cloud/opentelemetry/internal/monitored_resource.cc

Line 55 in 644efe9

absl::optional<std::string> fallback = absl::nullopt;

and then have this code just be:

} else { mr.labels[kv.first] = kv.second.fallback; }

(If this doesn't work for whatever reason, just let me know, and I can look into cleaning it up in a future PR)

That does seem cleaner, I’ll change the fallback to default to empty string.

dbolduc · 2025-01-10T17:35:28Z

google/cloud/opentelemetry/internal/monitored_resource.cc

@@ -174,7 +176,7 @@ MonitoredResourceProvider GenericNode() {
          {"location",
           {{sc::kCloudAvailabilityZone, sc::kCloudRegion}, "global"}},
          {"namespace", {{sc::kServiceNamespace}}},
-          {"node_id", {{sc::kHostId}}},
+          {"node_id", {{sc::kHostId, sc::kHostName}}},


Good catch, thanks!

Can we have a unit test for this?

google-cloud-cpp/google/cloud/opentelemetry/internal/monitored_resource_test.cc

Line 337 in 90d72c0

{sc::kHostId, "test-instance"},

I would probably change TestCase to something like LocationTestCase, and then introduce a NodeIdTestCase, and loop over:

for (auto l : location_tests) { for (auto n : node_tests) { // verify l.expected_location and n.expected_node_id } }

dbolduc · 2025-01-10T17:38:18Z

google/cloud/opentelemetry/internal/monitored_resource.cc

+          {"job", {{sc::kServiceName, sc::kFaasName}}},
+          {"task_id", {{sc::kServiceInstanceId, sc::kFaasInstance}}},


Again, good catch and thanks.

Can we have unit tests for these too? Or at least fix the broken tests.

Also, consider leaving a comment in a test case where we fallback to the empty string. Just something that says "Verify that we fallback to the empty string if no matches are found".

Yep, I’ll add some tests for these cases.

DouglasHeriot changed the title ~~Fix OTel Exporter creating Monitored Resource for Cloud Run.~~ fix(otel): Exporter creating Monitored Resource with task_id for Cloud Run Jan 10, 2025

DouglasHeriot mentioned this pull request Jan 10, 2025

OpenTelemetry on Cloud Run does not include task_id in Monitored Resource #14925

Open

DouglasHeriot force-pushed the cloudrun branch from 1c0ab86 to 7a8688d Compare January 10, 2025 02:31

DouglasHeriot force-pushed the cloudrun branch from 7a8688d to 644efe9 Compare January 10, 2025 02:34

DouglasHeriot marked this pull request as ready for review January 10, 2025 08:13

DouglasHeriot requested a review from a team as a code owner January 10, 2025 08:13

DouglasHeriot temporarily deployed to external January 10, 2025 16:37 — with GitHub Actions Inactive

dbolduc reviewed Jan 10, 2025

View reviewed changes

dbolduc mentioned this pull request Jan 10, 2025

impl(otel): copy service resource labels into metric labels #14825

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(otel): Exporter creating Monitored Resource with task_id for Cloud Run #14923

fix(otel): Exporter creating Monitored Resource with task_id for Cloud Run #14923

DouglasHeriot commented Jan 10, 2025 •

edited

Loading

scotthart commented Jan 10, 2025

dbolduc left a comment

dbolduc Jan 10, 2025

DouglasHeriot Jan 11, 2025

dbolduc Jan 10, 2025

dbolduc Jan 10, 2025

DouglasHeriot Jan 11, 2025

		{"job", {{sc::kServiceName, sc::kFaasName}}},
		{"task_id", {{sc::kServiceInstanceId, sc::kFaasInstance}}},

fix(otel): Exporter creating Monitored Resource with task_id for Cloud Run #14923

Are you sure you want to change the base?

fix(otel): Exporter creating Monitored Resource with task_id for Cloud Run #14923

Conversation

DouglasHeriot commented Jan 10, 2025 • edited Loading

scotthart commented Jan 10, 2025

dbolduc left a comment

Choose a reason for hiding this comment

dbolduc Jan 10, 2025

Choose a reason for hiding this comment

DouglasHeriot Jan 11, 2025

Choose a reason for hiding this comment

dbolduc Jan 10, 2025

Choose a reason for hiding this comment

dbolduc Jan 10, 2025

Choose a reason for hiding this comment

DouglasHeriot Jan 11, 2025

Choose a reason for hiding this comment

DouglasHeriot commented Jan 10, 2025 •

edited

Loading