tests: add cram wrapper parallel_cram.sh that runs cram tests in parallel #1667

corneliusroemer · 2024-11-09T21:37:43Z

Description of proposed changes

Cram tests are perfect for parallelization. Each test is independend and
we have 178 of them. The wrapper script allows to run tests in parallel.

In my experiments, on an M1 Pro, total test time was reduced
from 7m30s to 1m23s, a more than 5x speedup. I got best results
with -j8 (instead of all 10).

The wrapper takes many of cram's options, one can still run tests of
a single directory, for example ./parallel_cram.sh tests/functional/tree.

One caveat: it seems that iqtree creates files in the input file directory. As multiple tests use the same input files, they might conflict. So we should copy the input files to a temporary directory before running. This is done in commit 687bd04

We seem to be getting almost a 2x speedup in CI as well, as Github runners come with 2 cores by default. The bottleneck is now RSV pathogen CI which runs around 8 minutes (previously cram tests took 13min, now they are faster than RSV). Overall Github runner time is reduced from around 2h5min to 1h15min, a saving of around a third.

I tested the script to ensure that it correctly reports test failures. I've also been using it in my regular work on various PRs and it's worked exactly as expected, saving me waiting time.

Checklist

Automated checks pass
Check if you need to add a changelog message
Check if you need to add tests
Check if you need to update docs

…f cram tests Cram tests are perfect for parallelization. Each test is independend and we have 178 of them. The wrapper script allows to run tests in parallel. In my experiments, on an M1 Pro, total test time was reduced from 7m30s to 1m23s, a more than 5x speedup. I got best results with `-j8` (instead of all 10). The wrapper takes many of cram's options, one can still run tests of a single directory, for example `./parallel_cram.sh tests/functional/tree`. One caveat: it seems that iqtree creates files in the _input_ file directory. So we should copy the input files to a temporary directory before running.

…est's temp folder

corneliusroemer · 2024-11-10T19:12:00Z

TODO:

Handle case where a file is passed, rather than a directory, currently this results in:

./parallel_cram.sh -j8 tests/functional/parse.t
Error: Directory tests/functional/parse.t does not exist

tsibley · 2024-11-13T18:02:23Z

One caveat: it seems that iqtree creates files in the input file directory. As multiple tests use the same input files, they might conflict. So we should copy the input files to a temporary directory before running. This is done in commit 687bd04

Our tests are revealing a real user interface issue here. Instead of working around it in tests, perhaps we can fix this input directory pollution for good by relocating the temporary alignment file we're already using

augur/augur/tree.py

Line 250 in 3f72c40

    
           tmp_aln_file = str(Path(aln_file).with_name(Path(aln_file).stem + "-delim.fasta"))

into a temporary directory and thus also avoid having to do this junk

augur/augur/tree.py

Lines 304 to 310 in 3f72c40

    
           if clean_up: 
        
               if os.path.isfile(tmp_aln_file): 
        
                   os.remove(tmp_aln_file) 
        
               for ext in [".bionj",".ckp.gz",".iqtree",".mldist",".model.gz",".treefile",".uniqueseq.phy",".model"]: 
        
                   if os.path.isfile(tmp_aln_file + ext): 
        
                       os.remove(tmp_aln_file + ext)

tsibley

Parallel testing is great and we should absolutely do it for our Cram tests. Thank you for tackling it!

OTOH, I don't love the additional overhead of maintaining parallel_cram.sh ourselves. It's re-implementing bits of common tooling like parallel and even prove. My preference would be to write a much smaller bit of code to translate cram's output to TAP and then use prove to run the files in parallel. This would not only get us things like slow test priority out-of-the-box, but also recording of statistics and keeping track of that (i.e. "what's slow") over time if we wanted.

tsibley · 2024-11-13T18:05:12Z

parallel_cram.sh

@@ -0,0 +1,185 @@
+#!/bin/bash


No set -euo pipefail, which is a red flag to me.

tsibley · 2024-11-13T18:10:33Z

parallel_cram.sh

+VERBOSE=0
+KEEP_TMPDIR=0
+SHELL_PATH="/bin/bash"
+SHELL_OPTS=""
+INDENT=2


This is hardcoding things that should be left at defaults, either cram's builtins or our project-level defaults from .cramrc.

tsibley · 2024-11-13T18:17:15Z

parallel_cram.sh

+        --shell=*)
+            SHELL_PATH="${1#*=}"
+            shift
+            ;;
+        --shell-opts=*)
+            SHELL_OPTS="${1#*=}"
+            shift
+            ;;
+        --indent=*)
+            INDENT="${1#*=}"
+            shift
+            ;;


These don't take the common --opt value form, only --opt=value.

tsibley · 2024-11-13T18:20:02Z

parallel_cram.sh

+        *)
+            TEST_DIR="$1"
+            shift
+            ;;


If more than one positional is given, only the last will be used. e.g. you can't do

./parallel_cram.sh tests/functional/cram/{tree,refine}

tsibley · 2024-11-13T18:21:59Z

parallel_cram.sh

+# Build cram command with options
+CRAM_CMD="cram"
+[ $QUIET -eq 1 ] && CRAM_CMD="$CRAM_CMD -q"
+[ $VERBOSE -eq 1 ] && CRAM_CMD="$CRAM_CMD -v"
+[ $KEEP_TMPDIR -eq 1 ] && CRAM_CMD="$CRAM_CMD --keep-tmpdir"
+[ -n "$SHELL_PATH" ] && CRAM_CMD="$CRAM_CMD --shell=$SHELL_PATH"
+[ -n "$SHELL_OPTS" ] && CRAM_CMD="$CRAM_CMD --shell-opts=$SHELL_OPTS"
+[ -n "$INDENT" ] && CRAM_CMD="$CRAM_CMD --indent=$INDENT"


Instead of a string, this should build up an array and then be used as "${CRAM_CMD[@]}" so that it's robust to the multiple applications of word splitting.

tsibley · 2024-11-13T18:22:47Z

parallel_cram.sh

+LOCK_DIR="/tmp/cram_lock_$$"
+mkdir "$LOCK_DIR"


This should be a mktemp -d call.

tsibley · 2024-11-13T19:20:40Z

parallel_cram.sh

+# Run tests in parallel with specified number of jobs
+echo "$ALL_FILES" | xargs -P "$PARALLEL_JOBS" -I {} sh -c '
+    file="$1"
+    lock_dir="$LOCK_DIR"
+    output=$($CRAM_CMD "$file" 2>&1)
+    status=$?
+
+    while ! mkdir "$lock_dir/lock" 2>/dev/null; do
+        sleep 0.05
+    done
+
+    # Record test status
+    echo "$status" >> "$RESULTS_FILE"
+
+    if [ $status -eq 0 ]; then
+        printf "."
+    else
+        echo "$output" | grep -v "^# Ran "
+    fi
+
+    # If --keep-tmpdir is enabled, capture the temporary directory
+    if [ $KEEP_TMPDIR -eq 1 ]; then
+        echo "$output" | grep "Kept temporary directory:" >> "$TMPDIR_FILE"
+    fi
+
+    rm -rf "$lock_dir/lock"
+' - {}


FWIW, this reimplements a bit of functionality in parallel (which is typically more useful than xargs).

tsibley · 2024-11-13T19:26:03Z

My preference would be to write a much smaller bit of code to translate cram's output to TAP and then use prove to run the files in parallel.

Or, alternatively, if we don't want to switch to TAP, modify Cram to support parallel testing. This doesn't seem difficult, from a look at the codebase.

corneliusroemer · 2024-11-13T22:05:25Z

I initially tried parallel but it's less portable. In fact the brew installed parallel I have on my system is the moreutils parallel and not gnu parallel. Could work around this by installing gnu parallel in the conda environment.

What you suggest @tsibley sounds good but realistically I won't do it.

I'm not sure how much maintenance there will have to be? We could go with this for now until we have one of the other alternatives you mention if you feel like you want to do this.

tsibley · 2024-11-14T06:32:01Z

I initially tried parallel but it's less portable.

I mean, GNU parallel is a single-file (15k line!) Perl program using only the Perl stdlib, compatible with a wide range of Perl versions. It's very easy to ship a copy (and I've done so before ;-). But yes, I get your point.

What you suggest @tsibley sounds good but realistically I won't do it.

Yep, I'm not surprised.

I'm not sure how much maintenance there will have to be? We could go with this for now until we have one of the other alternatives you mention if you feel like you want to do this.

Going with this "for now" (really, indefinitely) is fine. Don't take my comments to mean this can't merge! There won't be much maintenance, probably, but there will be little bugs here and there (e.g. in the current pass, -v / --verbose doesn't actually work) which will involve dealing with its approach. That's kinda my point: there's more code here than is necessary, and more code you don't need is always a liability.

Out of curiosity, this afternoon I wrote an alternative because I wanted to see what it'd look like with parallel.

Upon writing the above comment this evening, I got curious again and decided to quickly implement the same output-filter approach to produce TAP instead so I could use it with prove. It would still be better (and not difficult, I think) to add TAP output to Cram directly, but as-is even this little implementation works great.

corneliusroemer added 4 commits November 9, 2024 21:57

Allow iqtree tests to run in parallel by copying input data to each t…

687bd04

…est's temp folder

Try running tests in parallel in ci to see if we get speedup

8f746c3

Fail more reliably

5a516f6

corneliusroemer marked this pull request as ready for review November 9, 2024 21:54

corneliusroemer requested review from tsibley, victorlin, genehack and huddlej November 9, 2024 22:00

add changelog

9884948

corneliusroemer mentioned this pull request Nov 9, 2024

fix(tree): check for all synonyms of conflicting default args #1547

Merged

4 tasks

victorlin mentioned this pull request Nov 11, 2024

Rollout pandas version 2 nextstrain/public#12

Open

4 tasks

tsibley reviewed Nov 13, 2024

View reviewed changes

victorlin mentioned this pull request Jan 8, 2025

Speed up CI run times #1713

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

tests: add cram wrapper parallel_cram.sh that runs cram tests in parallel #1667

tests: add cram wrapper parallel_cram.sh that runs cram tests in parallel #1667

Uh oh!

corneliusroemer commented Nov 9, 2024 •

edited

Loading

Uh oh!

corneliusroemer commented Nov 10, 2024

Uh oh!

tsibley commented Nov 13, 2024

Uh oh!

tsibley left a comment

Uh oh!

tsibley Nov 13, 2024

Uh oh!

tsibley Nov 13, 2024

Uh oh!

tsibley Nov 13, 2024

Uh oh!

tsibley Nov 13, 2024

Uh oh!

tsibley Nov 13, 2024

Uh oh!

tsibley Nov 13, 2024

Uh oh!

tsibley Nov 13, 2024

Uh oh!

tsibley commented Nov 13, 2024

Uh oh!

corneliusroemer commented Nov 13, 2024

Uh oh!

tsibley commented Nov 14, 2024

Uh oh!

Uh oh!

tests: add cram wrapper parallel_cram.sh that runs cram tests in parallel #1667

Are you sure you want to change the base?

tests: add cram wrapper parallel_cram.sh that runs cram tests in parallel #1667

Uh oh!

Conversation

corneliusroemer commented Nov 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of proposed changes

Checklist

Uh oh!

corneliusroemer commented Nov 10, 2024

Uh oh!

tsibley commented Nov 13, 2024

Uh oh!

tsibley left a comment

Choose a reason for hiding this comment

Uh oh!

tsibley Nov 13, 2024

Choose a reason for hiding this comment

Uh oh!

tsibley Nov 13, 2024

Choose a reason for hiding this comment

Uh oh!

tsibley Nov 13, 2024

Choose a reason for hiding this comment

Uh oh!

tsibley Nov 13, 2024

Choose a reason for hiding this comment

Uh oh!

tsibley Nov 13, 2024

Choose a reason for hiding this comment

Uh oh!

tsibley Nov 13, 2024

Choose a reason for hiding this comment

Uh oh!

tsibley Nov 13, 2024

Choose a reason for hiding this comment

Uh oh!

tsibley commented Nov 13, 2024

Uh oh!

corneliusroemer commented Nov 13, 2024

Uh oh!

tsibley commented Nov 14, 2024

Uh oh!

Uh oh!

corneliusroemer commented Nov 9, 2024 •

edited

Loading