You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
At some point, some other REMIND user (we do not know who yet) also set out to generate new input data with the revision 6.58. Luckily (since we thus were able to detect it) that run apparently failed around nine, leaving an incomplete input data revision in its wake.
Social mechanisms for blocking revision numbers obviously do not suffice. We have users use lastrev to check the next free revision number. But concurrently started jobs will collide.
The easiest solution to me seems to create empty files at the start of the run to block them.
retrieveData() will happily overwrite output. In terms of scientific reproducibility, that should not happen.
So stopping if the files that would be written already exist would solve this (and contribute to revision blocking).
Somehow retrieveData() failed, but still wrote a legitimate-looking archive.
I downloaded the original rev6.58_62eff8f7_remind.tgz file for testing before it was corrupted. The new archive is missing several files, rendering it unfit for use with REMIND:
$ comm -3 <( tar -tf ~/PIK/swap/inputdata/output/rev6.58_62eff8f7_remind.tgz | sort ) <( tar -tf ~/PIK/Remind_input/rev6.58_62eff8f7_remind.tgz | sort )
./config.rds
./f21_tau_fe_sub.cs4r
./f21_tau_fe_tax.cs4r
./f21_tax_convergence.cs4r
./f_gdp.cs3r
./f_lab.cs3r
./f_pop.cs3r
./p01_boundInvMacro.cs4r
./pm_shPPPMER.cs4r
./regionmappingH12.csv
I have no idea how retrieveData() wrote an archive with missing files, but I think it should not do that. (It appears to not have been a race condition where the first run deleted the files while the second was packing up its archive. All files from the second run show creation dates later then 19:00.)
The text was updated successfully, but these errors were encountered:
Log files of the colliding runs: /p/tmp/jakobdu/pre-processing/log-2023-10-26-28328875.out and /p/tmp/chengong/pre-processing/log-2023-10-26-28329766.out.
Well, it was bound to happen.
Yesterday, @JakobBD generated new REMIND input data using the revision
6.58
. That finished around half past five.At some point, some other REMIND user (we do not know who yet) also set out to generate new input data with the revision
6.58
. Luckily (since we thus were able to detect it) that run apparently failed around nine, leaving an incomplete input data revision in its wake.Several issues here are worth addressing:
lastrev
to check the next free revision number. But concurrently started jobs will collide.The easiest solution to me seems to create empty files at the start of the run to block them.
retrieveData()
will happily overwrite output. In terms of scientific reproducibility, that should not happen.So stopping if the files that would be written already exist would solve this (and contribute to revision blocking).
retrieveData()
failed, but still wrote a legitimate-looking archive.I downloaded the original
rev6.58_62eff8f7_remind.tgz
file for testing before it was corrupted. The new archive is missing several files, rendering it unfit for use with REMIND:retrieveData()
wrote an archive with missing files, but I think it should not do that. (It appears to not have been a race condition where the first run deleted the files while the second was packing up its archive. All files from the second run showcreation dates
later then 19:00.)The text was updated successfully, but these errors were encountered: