https://github.com/Farama-Foundation/Minigrid/blob/master/tests/test_baby_ai_bot.py The bot test logic is wrong. Rather than stopping at first success, it should throw an error at first failure