Skip to content

[Bug] Tests timeouts 6h+, maybe r.futures.calib hangs #1312

@echoix

Description

@echoix

Some tests are hanging and taking 6+ hours before getting killed when running on CI.

Name of the addon
The name of the GRASS GIS addon.
The last line of the log shown is r.futures.calib, but maybe the problem is elsewhere.

Describe the bug
A clear and concise description of what the bug of the addon is.

A couple mornings this week, plus this weekend, I manually stopped some jobs in grass-addons as they were on their way to taking 6h.

To Reproduce
Steps to reproduce the behavior:

See the logs for build and test, most often (if not only) of the hanging jobs are for main+3.11.

  1. Go to '...'
  2. Click on '....'
  3. Compute '....'
  4. See error

Expected behavior
A clear and concise description of what you expected to happen.

Tests are fast, or at least, not hanging.

Screenshots
If applicable, add screenshots to help explain your problem.

System description (please complete the following information):

On CI runners.

  • Operating System: [e.g. Windows, Linux, ... incl. version]
  • GRASS GIS version [e.g. 7.8.1]

Additional context
Add any other context about the problem here.

I don't think my work last weekend is the cause, as some of the PRs weren't timing out, but some did. Maybe I just allowed more tests to pass (with more dependencies or better translatable strings) and we end up with hanging tests. There were some times out in the last weeks before my last work here, but not as constant as this last week. (Even before the caching. One day I tried deleting all cache here, and the next daily run timed out too).

It may be a while loop (that I didn't find in the first two r.futures tests and code), but there's a lot of nested for loops).

It could be a case where there is some input expected and it's waiting, but it's more common with the cmd.exe on windows.

Could a stdin/stdout/stderr be closed and the gunittest test runner doesn't handle it well?

We can give a try with 8.4.0 with Python 3.11 or the 8.5.0dev with the Python version used in the 8.4.0 job (if it is 3.9+), to see if it is a Python problem, or a regression in the development grass in the last month(s) or so.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions