-
Notifications
You must be signed in to change notification settings - Fork 28
Description
Certain conditions set the isCompleted flag to be set -- for example, a negative score in the task, signifying task failure:
ScienceWorld/scienceworld/scienceworld.py
Line 328 in 10dd21a
if (score < 0): |
Right now we set the isCompleted flag to be true, but if an agent doesn't check for this, it's still possible for it to continue to send commands to the environment. There is a report that this might let the tasks be gamed (especially the forced-choice tasks), as the agent might be able to take steps that ultimately further increase the score. We should likely modify the step() code so that if the isCompleted flag is set, it disallows further steps to be processed, to prevent any issues with agents reporting erroneously high scores in the future.