Open
Description
Occasionally my environment (and thus my ray workers) crash at the beginning of training.
I observed two cases so far:
- IndexError: pop from empty list [next_time_step_to_pick = self.next_time_step.pop(0)]
- IndexError: index 15 is out of bounds for axis 0 with size 15 [time_needed = self.instance_matrix[action][current_time_step_job][1]]
Obviously the random actions steer the environment into a bad place.
How did you handle this during your own training? Currently I can't train my agents, because they crash when the env crashes. (Im using Ray[rllib])
Code to reproduce the error:
env_config = {"instance_path": "\\instances\\ta20"}
env = gym.make('JSSEnv-v1', env_config=env_config)
obs = env.reset()
while True:
action = env.action_space.sample()
obs, reward, done, _ = env.step(action)
env.render()
if done:
print("Episode ended")
break
env.close()```
Metadata
Metadata
Assignees
Labels
No labels