Skip to content

unit and functional testing fail to run on derecho #1441

@glemieux

Description

@glemieux

Describe the issue

Attempting to run either unit or functional tests on dececho with fates results in an xml error referencing the attribute gpu_offload. I confirmed that I'm using the ctsm_pylib environment built with the yml file associated with the ctsm tag I am testing this with.

Relevant log output

(ctsm_pylib) ✔  ~/ctsm/src/fates/testing [sci.1.84.0_api.40.0.0|⚑ 25] 
10:15 $ python run_unit_tests.py 
/glade/u/home/glemieux/ctsm/src/fates/_build/env_mach_specific.xml already exists, delete to replace
ERROR: Command: '/glade/work/glemieux/conda-envs/ctsm_pylib/bin/xmllint --xinclude --noout --schema /glade/u/home/glemieux/ctsm/cime/CIME/data/config/xml_schemas/env_mach_specific.xsd /glade/u/home/glemieux/ctsm/src/fates/_build/env_mach_specific.xml' failed with error '/glade/u/home/glemieux/ctsm/src/fates/_build/env_mach_specific.xml:63: Schemas validity error : Element 'modules', attribute 'gpu_offload': The attribute 'gpu_offload' is not allowed.
/glade/u/home/glemieux/ctsm/src/fates/_build/env_mach_specific.xml fails to validate' from dir '/glade/u/home/glemieux/ctsm/src/fates/_build'

FATES tag

sci.1.84.0_api.40.0.0

Host land model tag

ctsm5.3.063

Machine

derecho

Other supported machine name

No response

Additional context

This came up in the context of testing #1426.

Relevant line of ~/ctsm/src/fates/_build/env_mach_specific.xml

 62     </modules>
 63     <modules mpilib="mpich" compiler="nvhpc" gpu_offload="!none">
 64       <command name="load">cuda/12.2.1</command>
 65     </modules>

I'm not sure if this specific build xml file is procedurally generated or should be a copy of the specific machine xml file.

I confirmed that ccs_config is on the default tag ccs_config_cesm1.0.43.

Metadata

Metadata

Assignees

No one assigned

    Labels

    HLM: ctsmAn issue is related to ctsm host land model or a particular PR has a corresponding ctsm-side PRtype: testingRelated to improvements in code testing (e.g. unit, regression, etc)

    Type

    Projects

    Status

    ✔ Done

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions