Skip to content

Commit

Permalink
improving baselines: extended obs, better embedder, faster with bf16 (#…
Browse files Browse the repository at this point in the history
…32)

* extended obs wip

* extended obs wip

* extended obs with rules wip

* additional comment

* fix ruff action

* fix render wrapper

* updated baselines with dir obs

* embedder

* updated configs with more SPS

* fix typo

* added bf16 support
  • Loading branch information
Howuhh authored Aug 1, 2024
1 parent dff8040 commit bbc5f5f
Show file tree
Hide file tree
Showing 9 changed files with 177 additions and 153 deletions.
2 changes: 1 addition & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -174,4 +174,4 @@ g
# will remove later
scripts/*testing*
configs/wip
scripts/eval_example.py
scripts/eval_example.py
2 changes: 1 addition & 1 deletion configs/meta.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ num_envs: 16384
num_steps_per_env: 12800
num_steps_per_update: 256
update_epochs: 1
num_minibatches: 32
num_minibatches: 16
eval_num_envs: 16384
eval_num_episodes: 25
train_seed: 5
2 changes: 1 addition & 1 deletion configs/single.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,6 @@ total_timesteps: 1_000_000_000
num_envs: 16384
num_steps: 256
update_epochs: 1
num_minibatches: 8
num_minibatches: 16
eval_episodes: 512
train_seed: 5
7 changes: 2 additions & 5 deletions src/xminigrid/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -2,11 +2,11 @@
from .registration import make, register, registered_environments

# TODO: add __all__
__version__ = "0.8.0"
__version__ = "0.9.0"

# ---------- XLand-MiniGrid environments ----------

# WARN: TMP, only for FPS measurements
# WARN: TMP, only for FPS measurements, will remove later
# register(
# id="MiniGrid-1Rules",
# entry_point="xminigrid.envs.xland_tmp:XLandMiniGrid",
Expand Down Expand Up @@ -79,9 +79,6 @@
# width=64,
# )


# TODO: reconsider grid sizes and time limits after the benchmarks are generated.
# Should be enough space for initial tiles even in the hardest setting
register(
id="XLand-MiniGrid-R1-9x9",
entry_point="xminigrid.envs.xland:XLandMiniGrid",
Expand Down
81 changes: 0 additions & 81 deletions training/eval.py

This file was deleted.

Loading

0 comments on commit bbc5f5f

Please sign in to comment.