Env crashes with IndexError, when using random actions

Occasionally my environment (and thus my ray workers) crash at the beginning of training.

I observed two cases so far:
- IndexError: pop from empty list  [next_time_step_to_pick = self.next_time_step.pop(0)]
- IndexError: index 15 is out of bounds for axis 0 with size 15 [time_needed = self.instance_matrix[action][current_time_step_job][1]]

Obviously the random actions steer the environment into a bad place.

How did you handle this during your own training? Currently I can't train my agents, because they crash when the env crashes. (Im using Ray[rllib])


Code to reproduce the error:
```python
env_config = {"instance_path": "\\instances\\ta20"}
env = gym.make('JSSEnv-v1', env_config=env_config)

obs = env.reset()
while True:
    action = env.action_space.sample()
    obs, reward, done, _ = env.step(action)
    env.render()
    if done:
        print("Episode ended")
        break
env.close()``` 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Env crashes with IndexError, when using random actions #9

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Env crashes with IndexError, when using random actions #9

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions