DEBUG-3700 Telemetry integration test #4588

p-datadog · 2025-04-16T14:52:19Z

What does this PR do?

Adds integration tests for agent and agentless telemetry

Motivation:
Test coverage for upcoming code changes to the core transport within telemetry

Change log entry
None

Additional Notes:
I added a wait to the termination code for async worker, without it the workers were not terminated within the test's scope and the thread leak checker was complaining

How to test the change?
PR is tests only

pr-commenter · 2025-04-16T15:14:51Z

Benchmarks

Benchmark execution time: 2025-04-23 18:03:40

Comparing candidate commit d1e9ac4 in PR branch telemetry-integration-test with baseline commit 5de3de1 in branch master.

Found 3 performance improvements and 0 performance regressions! Performance is the same for 28 metrics, 2 unstable metrics.

scenario:tracing - 100 span trace - no writer

🟩 throughput [+17.465op/s; +18.062op/s] or [+5.640%; +5.833%]

scenario:tracing - Propagation - Datadog

🟩 throughput [+3075.705op/s; +3149.385op/s] or [+10.645%; +10.900%]

scenario:tracing - Tracing.log_correlation

🟩 throughput [+10511.171op/s; +10803.684op/s] or [+10.519%; +10.812%]

datadog-datadog-prod-us1 · 2025-04-16T15:19:54Z

Datadog Report

Branch report: telemetry-integration-test
Commit report: d1e9ac4
Test service: dd-trace-rb

❌ 2 Failed (0 Known Flaky), 20477 Passed, 1363 Skipped, 3m 45.36s Total Time

❌ Failed Tests (2)

Telemetry integration tests agentful first run sends startup payloads - rspec - Details

Expand for error

 expected: 2
      got: 0
 
 (compared using ==)
 
 Failure/Error: expect(sent_payloads.length).to eq 2
 
   expected: 2
        got: 0
 
 ...

Telemetry integration tests agentful not first run sends expected payload - rspec - Details

Expand for error

 expected: 1
      got: 0
 
 (compared using ==)
 
 Failure/Error: expect(sent_payloads.length).to eq 1
 
   expected: 1
        got: 0
 
 ...

Strech

I have few questions about new code and few suggestions about the tests

Strech · 2025-04-24T11:53:05Z

spec/datadog/core/telemetry/integration/telemetry_spec.rb

+      it 'sends expected payload' do
+        component.error('test error')
+
+        sleep 1


is there a way for us to force flush instead of sleeping? Sleeping could be flaky

Strech · 2025-04-24T11:54:52Z

spec/datadog/core/telemetry/integration/telemetry_spec.rb

+
+  let(:sent_payloads) { [] }
+
+  shared_examples 'telemetry' do


I find shared examples are tedious to debug and also to read the test and personally I'm against such DRY for the sake of DRY. But not a blocker

P.S Yes, I would prefer copy-paste here, more obvious is better.

Strech · 2025-04-24T11:57:04Z

spec/datadog/core/telemetry/integration/telemetry_spec.rb

+    end
+  end
+
+  context 'agentful' do


Just a minor note here, I think a description in a way of sentence would be great adjustment, like

context 'when setup is using standard agent mode' do

or so

Strech · 2025-04-24T11:58:09Z

spec/datadog/core/telemetry/integration/telemetry_spec.rb

@@ -0,0 +1,191 @@
+require 'spec_helper'


Suggested change

require 'spec_helper'

# frozen_string_literal: true

require 'spec_helper'

Strech · 2025-04-24T11:59:11Z

lib/datadog/core/workers/async.rb

@@ -47,6 +47,14 @@ def terminate
            @run_async = false
            Datadog.logger.debug { "Forcibly terminating worker thread for: #{self}" }
            worker.terminate
+            # Wait for the worker thread to end
+            begin
+              Timeout.timeout(0.5) do


I have few questions here.

We wait some time and fail with exception, and then continue as always, but what's the difference? The debug log?

Why don't we check is worker running instead of waiting till complete? Just to avoid sleep?

p-datadog requested a review from a team as a code owner April 16, 2025 14:52

p-datadog force-pushed the telemetry-integration-test branch from f4ba5a2 to c67d2b3 Compare April 16, 2025 14:52

anmarchenko approved these changes Apr 16, 2025

View reviewed changes

github-actions bot added the core Involves Datadog core libraries label Apr 16, 2025

p-datadog marked this pull request as draft April 17, 2025 15:13

p added 7 commits April 23, 2025 13:39

telemetry integration test

7b773a7

wait for thread to end

b415048

di add content-type assertion

34944d5

rubocop

6b5ae81

rubocop

aa17fd3

provide a block

b377ece

fix test

d1e9ac4

p-datadog force-pushed the telemetry-integration-test branch from 9cc84c0 to d1e9ac4 Compare April 23, 2025 17:40

Strech reviewed Apr 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DEBUG-3700 Telemetry integration test #4588

DEBUG-3700 Telemetry integration test #4588

p-datadog commented Apr 16, 2025

pr-commenter bot commented Apr 16, 2025 •

edited

Loading

datadog-datadog-prod-us1 bot commented Apr 16, 2025 •

edited

Loading

Strech left a comment

Strech Apr 24, 2025

Strech Apr 24, 2025

Strech Apr 24, 2025

Strech Apr 24, 2025

Strech Apr 24, 2025

DEBUG-3700 Telemetry integration test #4588

Are you sure you want to change the base?

DEBUG-3700 Telemetry integration test #4588

Conversation

p-datadog commented Apr 16, 2025

pr-commenter bot commented Apr 16, 2025 • edited Loading

Benchmarks

scenario:tracing - 100 span trace - no writer

scenario:tracing - Propagation - Datadog

scenario:tracing - Tracing.log_correlation

datadog-datadog-prod-us1 bot commented Apr 16, 2025 • edited Loading

Datadog Report

❌ Failed Tests (2)

Strech left a comment

Choose a reason for hiding this comment

Strech Apr 24, 2025

Choose a reason for hiding this comment

Strech Apr 24, 2025

Choose a reason for hiding this comment

Strech Apr 24, 2025

Choose a reason for hiding this comment

Strech Apr 24, 2025

Choose a reason for hiding this comment

Strech Apr 24, 2025

Choose a reason for hiding this comment

pr-commenter bot commented Apr 16, 2025 •

edited

Loading

datadog-datadog-prod-us1 bot commented Apr 16, 2025 •

edited

Loading