Skip to content

Model hangs after COMPLETED MOM INITIALIZATION? #2473

Closed Answered by benjamin-cash
benjamin-cash asked this question in Q&A
Discussion options

You must be logged in to vote

The answer appears to be that these settings are missing from the job script:

ulimit -s unlimited
export OMP_STACKSIZE=512M

They are present in HERCULES.env and FRONTERA.env, but for some reason were never set in the job script. ulimit -s unlimited eliminated the seg fault in the ice transport code, and export OMP_STACKSIZE=512M eliminated the crash in the radiation, or at least that is what I am seeing in my test run. Somewhat concerningly, in my test run I am also seeing warnings like

(zap_snow_temperature)zqsn:  -110219996.370171
  (zap_snow_temperature)zap_snow_temperature: temperature out of bounds!

without the model crashing, but that is a concern for another day.

Replies: 8 comments 33 replies

Comment options

You must be logged in to vote
6 replies
@jiandewang
Comment options

@benjamin-cash
Comment options

@jiandewang
Comment options

@benjamin-cash
Comment options

@benjamin-cash
Comment options

Comment options

You must be logged in to vote
4 replies
@DeniseWorthen
Comment options

@benjamin-cash
Comment options

@benjamin-cash
Comment options

@DeniseWorthen
Comment options

Comment options

You must be logged in to vote
5 replies
@DeniseWorthen
Comment options

@benjamin-cash
Comment options

@benjamin-cash
Comment options

@jiandewang
Comment options

@benjamin-cash
Comment options

Comment options

You must be logged in to vote
1 reply
@benjamin-cash
Comment options

Comment options

You must be logged in to vote
2 replies
@benjamin-cash
Comment options

@jiandewang
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
15 replies
@benjamin-cash
Comment options

@DeniseWorthen
Comment options

@benjamin-cash
Comment options

@benjamin-cash
Comment options

@DeniseWorthen
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by benjamin-cash
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants