Attempt to fix flaky Harbor E2E setup #20641

NouemanKHAL · 2025-07-02T13:32:31Z

What does this PR do?

The Harbor E2E tests failed recently with the following error:

E           datadog_checks.dev.errors.RetryError: Result: None
E           Error: HTTPConnectionPool(host='localhost', port=80): Max retries exceeded with url: /api/v2.0/users/ (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7ff4b4ff1f70>: Failed to establish a new connection: [Errno 111] Connection refused'))
E           Function: create_simple_user, Args: (), Kwargs: {}

This PR tries to address that issue by increasing the wait time giving more time for the harbor users endpoint to be healthy.

Motivation

Failing job: https://github.com/DataDog/integrations-core/actions/runs/16020744131/job/45196949202

Review checklist (to be filled by reviewers)

Feature or bugfix MUST have appropriate tests (unit, integration, e2e)
Add the qa/skip-qa label if the PR doesn't need to be tested during QA.
If you need to backport this PR to another branch, you can add the backport/<branch-name> label to the PR and it will automatically open a backport PR once this one is merged

codecov · 2025-07-02T13:43:02Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 90.04%. Comparing base (0cc4a1c) to head (f0ad2b8).

Additional details and impacted files

Flag	Coverage Δ
activemq	`?`
cassandra	`?`
confluent_platform	`?`
harbor	`89.04% <ø> (-0.61%)`	⬇️
hive	`?`
hivemq	`?`
hudi	`?`
ignite	`?`
jboss_wildfly	`?`
kafka	`?`
presto	`?`
solr	`?`
tomcat	`?`
weblogic	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

AAraKKe · 2025-07-02T13:53:18Z

harbor/tests/conftest.py

@@ -49,8 +48,7 @@ def dd_environment(e2e_instance):
    expected_log = "http server Running on" if HARBOR_VERSION < [1, 10, 0] else "API server is serving at"
    conditions = [
        CheckDockerLogs(compose_file, expected_log, wait=3),
-        lambda: time.sleep(4),
-        WaitFor(create_simple_user),
+        WaitFor(create_simple_user, wait=5),


Is there a reason to believe that one more second is enough? Just curious.

class WaitFor(LazyFunction): def __init__( self, func, # type: Callable attempts=60, # type: int wait=1, # type: int args=(), # type: Tuple kwargs=None, # type: Dict ):

This is actually increasing the waiting time by 4 seconds for every attempt (60 by default)

Oooh true, I checked how WaitFor works but missed that the sleep in there is in each loop. Seems a crazy increase though we will go from 64 to 240 seconds max haha, hopefully 4 minutes is enough.

I figured it's pointless to increase it just a bit every time it fails, I'd rather wait as much possible cause we have no choice, if it still fails with this big timeout, then we have a different problem, we might need to consider switching to exponential retry backoff

…s-check

increase wait time by 3x for the users endpoint

305a418

NouemanKHAL requested a review from a team as a code owner July 2, 2025 13:32

datadog-agent-integrations-bot bot added integration/harbor team/agent-integrations labels Jul 2, 2025

increase wait to 5s

6d1e7ee

AAraKKe approved these changes Jul 2, 2025

View reviewed changes

Merge branch 'master' into noueman/fix-harbor-users-endpoint-readines…

f0ad2b8

…s-check

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Attempt to fix flaky Harbor E2E setup #20641

Attempt to fix flaky Harbor E2E setup #20641

Uh oh!

NouemanKHAL commented Jul 2, 2025

Uh oh!

codecov bot commented Jul 2, 2025 •

edited

Loading

Uh oh!

AAraKKe Jul 2, 2025

Uh oh!

NouemanKHAL Jul 2, 2025

Uh oh!

AAraKKe Jul 2, 2025

Uh oh!

NouemanKHAL Jul 2, 2025

Uh oh!

Uh oh!

Attempt to fix flaky Harbor E2E setup #20641

Are you sure you want to change the base?

Attempt to fix flaky Harbor E2E setup #20641

Uh oh!

Conversation

NouemanKHAL commented Jul 2, 2025

What does this PR do?

Motivation

Review checklist (to be filled by reviewers)

Uh oh!

codecov bot commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

AAraKKe Jul 2, 2025

Choose a reason for hiding this comment

Uh oh!

NouemanKHAL Jul 2, 2025

Choose a reason for hiding this comment

Uh oh!

AAraKKe Jul 2, 2025

Choose a reason for hiding this comment

Uh oh!

NouemanKHAL Jul 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov bot commented Jul 2, 2025 •

edited

Loading