Refactoring #19

Howuhh · 2023-05-23T10:31:05Z

No description provided.

algorithms/small_scale/bc_chaotic_lstm.py

vkurenkov · 2023-05-24T11:01:57Z

algorithms/small_scale/bc_chaotic_lstm.py

+
+    set_seed(config.train_seed)
+
+    def env_fn():


Let's move it to a standalone function out of main

then we have to add character as an argument, it's not very nice

vkurenkov · 2023-05-24T11:02:36Z

algorithms/small_scale/bc_chaotic_lstm.py

+    tmp_env = env_fn()
+    eval_env = AsyncVectorEnv(
+        env_fns=[env_fn for _ in range(config.eval_processes)],
+        shared_memory=True,


Add a comment explaining why this is needed

vkurenkov · 2023-05-24T11:03:08Z

algorithms/small_scale/bc_chaotic_lstm.py

+        seed=config.train_seed,
+        add_next_step=False
+    )
+    tp = ThreadPoolExecutor(max_workers=14)


this should either be a config value or something automatic based on the number of cores/processors we have

fixed, now as config value

algorithms/small_scale/bc_chaotic_lstm.py

vkurenkov · 2023-05-24T11:08:10Z

katakomba/env.py

+         Returns score normalized against AutoAscend bot scores achieved for this exact character.
+        """
+        if self.character.count("-") != 2:
+            raise ValueError("Reference score not provided for this character.")


typo: Reference score is not provided...

katakomba/env.py

vkurenkov · 2023-05-24T11:11:12Z

katakomba/env.py

+
+    def get_dataset(self, scale: str = "small", **kwargs):
+        if self.character.count("-") != 2:
+            raise ValueError("Reference score not provided for this character.")


katakomba/env.py

vkurenkov

looks ok

vkurenkov · 2023-05-25T02:45:21Z

algorithms/small_scale/bc_chaotic_lstm.py

+
+
+@torch.no_grad()
+def filter_wd_params(model: nn.Module):


add type for return value

vkurenkov · 2023-05-25T02:45:33Z

algorithms/small_scale/bc_chaotic_lstm.py

+    return no_decay, decay
+
+
+def dict_to_tensor(data, device):


vkurenkov · 2023-05-25T02:46:04Z

algorithms/small_scale/bc_chaotic_lstm.py

+
+
+@torch.no_grad()
+def vec_evaluate(vec_env, actor, num_episodes,  seed=0, device="cpu"):


vkurenkov · 2023-05-25T02:46:12Z

algorithms/small_scale/bc_chaotic_lstm.py

+
+
+class Actor(nn.Module):
+    def __init__(self, action_dim, rnn_hidden_dim=512, rnn_layers=1, rnn_dropout=0.0, use_prev_action=True):


vkurenkov · 2023-05-25T02:47:50Z

algorithms/small_scale/bc_chaotic_lstm.py

+
+    pbar.close()
+    result = {
+        "reward_median": np.median(episode_rewards),


let's rename to returns to be consistent

return_median, return_mean, etc

to late, we have all logs in wandb in this format....

it is consistent across algorithms tho

vkurenkov · 2023-05-25T02:57:28Z

katakomba/utils/datasets/large_scale.py

+        align: Optional[Alignment] = None,
+        **kwargs
+) -> nld.TtyrecDataset:
+    if not nld.db.exists(db_path):


this original solution from DD is actually a bit problematic
if the db was not properly initiialized for some reason (i.e., a wrong path and then fixed) this will silently re-use db
i think it's better to initialize the DB each time as it does not take much time

vkurenkov · 2023-05-25T02:58:37Z

katakomba/utils/datasets/small_scale.py

+CACHE_PATH = os.environ.get('KATAKOMBA_CACHE_DIR', os.path.expanduser('~/.katakomba/cache'))
+
+
+def _flush_to_memmap(filename: str, array: np.ndarray):


return type is missing

katakomba/utils/datasets/small_scale.py

vkurenkov · 2023-05-25T03:02:04Z

katakomba/utils/datasets/small_scale.py

+        gameid = self.gameids[idx]
+        return dict(self.hdf5_file[gameid].attrs)
+
+    def close(self):


let's add a flag for cleaning the memmap
sometimes people would like to work with just one dataset and rebuilding it every time is not desirable

vkurenkov · 2023-05-25T03:02:19Z

katakomba/utils/datasets/small_scale.py

+    def close(self):
+        self.hdf5_file.close()
+        # remove memmap files from the disk upon closing
+        if self.mode == "memmap":


let's add logging that this is happening

katakomba/utils/datasets/small_scale_buffer.py

scripts/generate_small_dataset.py

init refactor

5aec75e

vkurenkov suggested changes May 24, 2023

View reviewed changes

vkurenkov reviewed May 24, 2023

View reviewed changes

katakomba/env.py Outdated Show resolved Hide resolved

Howuhh added 2 commits May 24, 2023 14:49

added datasets to the commit

8712ae4

cql without reward normalization

33216f4

vkurenkov suggested changes May 25, 2023

View reviewed changes

Howuhh added 18 commits May 25, 2023 11:44

added reward normalisation

2bac99f

report and iql drafts

522fab7

cql sweep, finish report script

d3f4213

added iql and rem

38b664b

revert norm

82a2030

added rem, awac, iql

00c5915

deleted discrete iql

16ccfc3

a lot of stuff

275e502

fix formatting

64b391e

stats

c6d28be

add dataset downloading from hf

e54ec8a

updated requirements and dockerfile

a36b293

fix bug, fix docker

c2966d4

num workers for rendering as config value

f97a565

add typings to the algorithms, remove db if exists

012c0f1

more typings, optional memmap cache cleaning

e22c628

more typings

abbcbef

removed default vector env arg

db0ead8

vkurenkov approved these changes Jun 14, 2023

View reviewed changes

vkurenkov merged commit 61a7c77 into main Jun 14, 2023



		@torch.no_grad()
		def vec_evaluate(vec_env, actor, num_episodes, seed=0, device="cpu"):



		class Actor(nn.Module):
		def __init__(self, action_dim, rnn_hidden_dim=512, rnn_layers=1, rnn_dropout=0.0, use_prev_action=True):

		CACHE_PATH = os.environ.get('KATAKOMBA_CACHE_DIR', os.path.expanduser('~/.katakomba/cache'))


		def _flush_to_memmap(filename: str, array: np.ndarray):

Refactoring #19

Refactoring #19

Uh oh!

Conversation

Howuhh commented May 23, 2023

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vkurenkov left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vkurenkov left a comment •

edited

Loading