-
Notifications
You must be signed in to change notification settings - Fork 311
Open
Labels
Description
cd /lus/flare/projects/Aurora_testing/mpi/osu_rfm/run_collective/512/gather-gather_persistent-gatherv-gatherv_persistent/stage/2025-09-12_18-37-26/aurora/compute/PrgEnv-intel/RunMPIcollective
awk 'BEGIN{N=5} {if(prev~/Lat(us)/&&/Sat/){for(i=NR-N;i<NR;i++)if(i>0)print buffer[i%N];print $0;count=N} else if(count>0){print $0;count--} buffer[NR%N]=$0; prev=$0}' rfm_job.out
gives the calls that did not return properly
Error signature:
x4213c4s7b0n0.hsn.cm.aurora.alcf.anl.gov: rank 27421 died from signal 6
x4417c3s5b0n0.hsn.cm.aurora.alcf.anl.gov: rank 48109 died from signal 15