Skip to content

patch-too-small error during global transient historical run #1452

@ckoven

Description

@ckoven

Describe the issue

I also ran into another error during historical global transient run. This one appears in a gridcell that is almost entirely either bare ground or rangeland. Possibly it is trying to convert to entirely rangeland. Disabling the endrun as suggested in the error log worked in this case and allowed the run to complete.

Relevant log output

949:  Warning. small nocomp patch wasnt able to find another patch to fuse with.
 949:           14           1  9.463164189702107E-005
 949:  Warning. small nocomp patch wasnt able to find another patch to fuse with.
 949:           14           1  9.463164189702107E-005
 949:  Warning. small nocomp patch wasnt able to find another patch to fuse with.
 949:           14           1  9.463164189702107E-005
 949:  Warning. small nocomp patch wasnt able to find another patch to fuse with.
 949:           14           1  9.463164189702107E-005
 949:  Warning. small nocomp patch wasnt able to find another patch to fuse with.
 949:           14           1  9.463164189702107E-005
 949:  Warning. small nocomp patch wasnt able to find another patch to fuse with.
 949:           14           1  9.463164189702107E-005
 949:  FATES is having difficulties fusing very small patches.
 949:  It is possible that a either a secondary or primary
 949:  patch has become the only patch of its kind, and it is
 949:  is very very small. You can test your luck by
 949:  disabling the endrun statement following this message.
 949:  FATES may or may not continue to operate within error
 949:  tolerances, but will generate another fail if it does not.
 949:  otherwise, dumping some diagnostics.
 949:   9.463164189702107E-005          14           1
 949:  ----------------------------------------
 949:   Site Coordinates                       
 949:  ----------------------------------------
 949:  latitude                    =   -19.8947368421053     
 949:  longitude                   =    140.000000000000     
 949:  ----------------------------------------
 949:  currentSite%area_bareground  0.150956047736229     
 949:  currentSite%area_pft(:,:)  3.066543719903422E-002  0.000000000000000E+000
 949:   0.000000000000000E+000  0.000000000000000E+000  0.000000000000000E+000
 949:   0.000000000000000E+000  0.000000000000000E+000  0.000000000000000E+000
 949:   0.371106235216921       0.000000000000000E+000  0.000000000000000E+000
 949:   0.000000000000000E+000  0.000000000000000E+000  0.598228327584045     
 949:   3.066543719903422E-002  0.000000000000000E+000  0.000000000000000E+000
 949:   0.000000000000000E+000  0.000000000000000E+000  0.000000000000000E+000
 949:   0.000000000000000E+000  0.000000000000000E+000  0.371106235216921     
 949:   0.000000000000000E+000  0.000000000000000E+000  0.000000000000000E+000
 949:   0.000000000000000E+000  0.598228327584045       0.000000000000000E+000
 949:   0.000000000000000E+000  0.000000000000000E+000  0.000000000000000E+000
 949:   0.000000000000000E+000  0.000000000000000E+000  0.000000000000000E+000
 949:   0.000000000000000E+000  0.000000000000000E+000  0.000000000000000E+000
 949:   0.000000000000000E+000  0.000000000000000E+000  0.000000000000000E+000
 949:    1.00000000000000       0.000000000000000E+000  0.000000000000000E+000
 949:   0.000000000000000E+000  0.000000000000000E+000  0.000000000000000E+000
 949:   0.000000000000000E+000  0.000000000000000E+000  0.000000000000000E+000
 949:   0.382846387056066       0.000000000000000E+000  0.000000000000000E+000
 949:   0.000000000000000E+000  0.000000000000000E+000  0.617153612943934     
 949:   0.000000000000000E+000  0.000000000000000E+000  0.000000000000000E+000
 949:   0.000000000000000E+000  0.000000000000000E+000  0.000000000000000E+000
 949:   0.000000000000000E+000  0.000000000000000E+000  0.000000000000000E+000
 949:   0.000000000000000E+000  0.000000000000000E+000  0.000000000000000E+000
 949:    1.00000000000000       0.000000000000000E+000
 949:    1.66531788899030               13           5
 949:    8488.60777410693               14           3
 949:   9.463164189702107E-005          14           1
 949:   0.153640325454551                9           1
 949:   1.269568469716925E-002           1           1
 949:    1509.56047736229                0           0
 949:  current landuse state vector:   1.664306417936176E-005  0.000000000000000E+000
 949:   0.848860777410693       0.000000000000000E+000  1.665317888990300E-004
 949:  current landuse state vector (not including bare gruond): 
 949:   1.960212322929460E-005  0.000000000000000E+000  0.999784257513890     
 949:   0.000000000000000E+000  1.961403628811124E-004
 949:  driver data landuse state vector:   1.323249185591491E-004
 949:   2.978767207873597E-005  0.999657956950475       2.028149628835395E-015
 949:   1.799304588850681E-004
 949:  min_allowed_landuse_fraction:   1.177795327713885E-006
 949:  landuse_vector_gt_min:  T F T F T
 949:  trans matrix from:            1  0.000000000000000E+000  0.000000000000000E+000
 949:   1.574195894913856E-013  0.000000000000000E+000  2.058831000805111E-012
 949:  trans matrix from:            2  0.000000000000000E+000  0.000000000000000E+000
 949:   7.533481747792357E-008  0.000000000000000E+000  1.672087788301078E-009
 949:  trans matrix from:            3  0.000000000000000E+000  3.862805945954699E-012
 949:   0.000000000000000E+000  0.000000000000000E+000  6.026978058250894E-008
 949:  trans matrix from:            4  0.000000000000000E+000  0.000000000000000E+000
 949:   5.556574225441878E-018  0.000000000000000E+000  0.000000000000000E+000
 949:  trans matrix from:            5  0.000000000000000E+000  1.414170290789865E-012
 949:   5.353201272077195E-009  0.000000000000000E+000  0.000000000000000E+000
 949:  this isnt because the land use was less than allowed
 949:  ENDRUN: 
 949:  ERROR: ERROR in EDPatchDynamicsMod.F90 at line 3527
 949: Image              PC                Routine            Line        Source             
 949: e3sm.exe           000000000149C00D  shr_abort_mod_mp_         114  shr_abort_mod.F90
 949: e3sm.exe           0000000000E81C6F  fatesglobals_mp_f         103  FatesGlobals.F90
 949: e3sm.exe           0000000000F93669  edpatchdynamicsmo        3527  EDPatchDynamicsMod.F90
 949: e3sm.exe           0000000000E6AF7A  edmainmod_mp_ed_e         325  EDMainMod.F90
 949: e3sm.exe           00000000005EA422  elmfatesinterface        1352  elmfates_interfaceMod.F90
 949: e3sm.exe           00000000005748F5  elm_driver_mp_elm        1299  elm_driver.F90
 949: e3sm.exe           0000000000558560  lnd_comp_mct_mp_l         619  lnd_comp_mct.F90
 949: e3sm.exe           000000000045C72E  component_mod_mp_         757  component_mod.F90
 949: e3sm.exe           0000000000435D34  cime_comp_mod_mp_        2973  cime_comp_mod.F90
 949: e3sm.exe           000000000045C3C2  MAIN__                    153  cime_driver.F90
 949: e3sm.exe           000000000043347D  Unknown               Unknown  Unknown
 949: libc-2.31.so       00001458BA5CA1FD  __libc_start_main     Unknown  Unknown
 949: e3sm.exe           00000000004333AA  Unknown               Unknown  Unknown
 949: MPICH ERROR [Rank 949] [job id 41501596.0] [Fri Aug  8 19:57:11 2025] [nid006258] - Abort(1001) (rank 949 in comm 0): application called MPI_Abort(MPI_COMM_WORLD, 1001) - process 949
 949: 
 949: aborting job:
 949: application called MPI_Abort(MPI_COMM_WORLD, 1001) - process 949
srun: error: nid006258: task 949: Exited with exit code 255

FATES tag

c2da27f

Host land model tag

39e91e09b5

Machine

perlmutter

Other supported machine name

No response

Additional context

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    Projects

    Status

    ❕Todo

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions