Update README.md

tinkoff-ai · Jun 14, 2023 · da05d35 · da05d35
1 parent 7102f91
commit da05d35
Showing 1 changed file with 55 additions and 0 deletions.
diff --git a/README.md b/README.md
@@ -18,6 +18,7 @@ docker run --gpus=all -it --rm --name <container_name> <image_name>
 
 2. ```pip install -e .```
 
+
 ## Baselines
 
 | Algorithm                                                                                                                       | Variants Implemented                               | Wandb Report |
@@ -28,6 +29,60 @@ docker run --gpus=all -it --rm --name <container_name> <image_name>
 | ✅ [Offline Reinforcement Learning with Implicit Q-Learning <br>(IQL)](https://arxiv.org/abs/2110.06169)                         | [`iql_chaotic_lstm.py`](algorithms/small_scale/iql_chaotic_lstm.py)                      | [`Katakomba-All`](https://wandb.ai/tlab/NetHack/reports/-Offline-IQL-Katakomba-All--Vmlldzo0NjEwNzQ4)
 | ✅ [An Optimistic Perspective on Offline Reinforcement Learning <br>(REM)](https://arxiv.org/abs/1907.04543)                     | [`rem_chaotic_lstm.py`](algorithms/small_scale/rem_chaotic_lstm.py)                      | [`Katakomba-All`](https://wandb.ai/tlab/NetHack/reports/-Offline-REM-Katakomba-All--Vmlldzo0NjEwOTYw)
 
+## Datasets
+
+In our benchmark, we treat every character configuration as a separate game to be solved -- different configurations may require highly varied forms of gameplay in the early game. To this end, we repacked the original large-scale AutoAscend (this symbolic agent is essentialy an early-game contender) dataset into 38 smaller datasets. This decomposition should allow practitioners to download less data and be more focused on specifics. 
+
+Additionally, as benchmarking new algorithms on all of the datasets could be computationally expensive for many practitioners, we separate the benchmark into three categories, where ```roles > races > alignments``` as by wisdom of the NetHack community. 
+
+We host all of the datasets on the [HuggingFace Hub](https://huggingface.co/datasets/Howuhh/katakomba/tree/main/data), you can download them from there directly. But as we described above, our wrappers will take care of it automatically similar to the D4RL benchmark.
+
+### Tasks
+
+| **Tasks**                        | **# Transitions** | **Median Turns** | **Median Score** | **Median Deathlvl** | **Size (GB)** | **Compressed Size (GB)** |
+|---------------------------------------|-------------------------|-----------------------|-----------------------|--------------------------|--------------------|-------------------------------|
+| **Base (Role-Centric)**          | -                       | -                     | -                     | -                        | -                  | -                             |
+| ```arc-hum-neu```               | 24527163                | 32858.0               | 4802.5                | 2.0                      | 94.5               | 1.3                           |
+| ```bar-hum-neu```               | 26266771                | 35716.0               | 11964.0               | 4.0                      | 101.1              | 1.7                           |
+| ```cav-hum-neu```               | 21674680                | 30361.0               | 8152.0                | 4.0                      | 83.5               | 1.3                           |
+| ```hea-hum-neu```               | 14473997                | 18051.0               | 2043.0                | 1.0                      | 55.7               | 0.8                           |
+| ```kni-hum-law```               | 22287283                | 28246.0               | 6305.0                | 3.0                      | 85.8               | 1.5                           |
+| ```mon-hum-neu```               | 33741542                | 42400.0               | 11356.0               | 4.0                      | 129.9              | 2.1                           |
+| ```pri-hum-neu```               | 18376473                | 26796.5               | 5366.5                | 2.0                      | 70.8               | 1.1                           |
+| ```ran-hum-neu```               | 17625493                | 25354.0               | 6168.0                | 2.0                      | 67.9               | 1.0                           |
+| ```rog-hum-cha```               | 14284927                | 19334.0               | 3005.5                | 1.0                      | 55.0               | 0.8                           |
+| ```sam-hum-law```               | 22422537                | 32951.0               | 7850.0                | 4.0                      | 86.3               | 1.3                           |
+| ```tou-hum-neu```               | 13376498                | 17955.5               | 2554.5                | 1.0                      | 51.5               | 0.8                           |
+| ```val-hum-neu```               | 27784788                | 35250.0               | 11402.5               | 4.0                      | 107.0              | 1.8                           |
+| ```wiz-hum-neu```               | 14343449                | 19808.5               | 3132.5                | 1.0                      | 55.2               | 0.8                           |
+| **Extended (Race-Centric)**      | -                       | -                     | -                     | -                        | -                  | -                             |
+| ```pri-elf-cha```               | 18796560                | 26909.5               | 4718.5                | 2.0                      | 72.4               | 1.1                           |
+| ```ran-elf-cha```               | 18238686                | 26607.0               | 7583.0                | 4.0                      | 70.2               | 1.1                           |
+| ```wiz-elf-cha```               | 15277820                | 19512.0               | 2988.5                | 1.0                      | 58.8               | 0.9                           |
+| ```arc-dwa-law```               | 25100788                | 34669.0               | 4026.0                | 1.0                      | 96.7               | 1.5                           |
+| ```cav-dwa-law```               | 22871890                | 32261.0               | 7158.0                | 3.0                      | 88.1               | 1.5                           |
+| ```val-dwa-law```               | 32787658                | 33973.0               | 8652.5                | 3.0                      | 126.6              | 2.5                           |
+| ```arc-gno-neu```               | 24144048                | 34432.0               | 4077.5                | 1.0                      | 93.0               | 1.4                           |
+| ```cav-gno-neu```               | 21624779                | 29860.0               | 6446.0                | 3.0                      | 83.3               | 1.4                           |
+| ```hea-gno-neu```               | 14884704                | 18518.0               | 1980.5                | 1.0                      | 57.3               | 0.9                           |
+| ```ran-gno-neu```               | 17571659                | 25970.0               | 5326.0                | 2.0                      | 67.7               | 1.1                           |
+| ```wiz-gno-neu```               | 14193637                | 19206.0               | 2736.0                | 1.0                      | 54.7               | 0.9                           |
+| ```bar-orc-cha```               | 27826356                | 39291.0               | 10499.0               | 4.0                      | 107.2              | 1.8                           |
+| ```ran-orc-cha```               | 18127448                | 26707.0               | 5460.0                | 2.0                      | 69.8               | 1.1                           |
+| ```rog-orc-cha```               | 16674806                | 22351.0               | 3103.0                | 1.0                      | 64.2               | 1.0                           |
+| ```wiz-orc-cha```               | 15994150                | 22570.5               | 3241.5                | 1.0                      | 61.6               | 1.0                           |
+| **Complete (Alignment-Centric)** | -                       | -                     | -                     | -                        | -                  | -                             |
+| ```arc-hum-law```               | 23422383                | 31446.0               | 4188.0                | 1.0                      | 90.2               | 1.3                           |
+| ```cav-hum-law```               | 22328494                | 31039.0               | 8174.0                | 4.0                      | 86.0               | 1.3                           |
+| ```mon-hum-law```               | 30782317                | 39647.0               | 10855.0               | 4.0                      | 118.5              | 1.9                           |
+| ```pri-hum-law```               | 18298816                | 27192.0               | 4833.0                | 1.0                      | 70.5               | 1.1                           |
+| ```val-hum-law```               | 30171035                | 34570.5               | 9707.0                | 4.0                      | 116.2              | 2.1                           |
+| ```bar-hum-cha```               | 25362111                | 35925.0               | 12574.0               | 5.0                      | 97.7               | 1.6                           |
+| ```mon-hum-cha```               | 33662420                | 41730.5               | 11418.0               | 4.0                      | 129.6              | 2.1                           |
+| ```pri-hum-cha```               | 18667816                | 28204.5               | 5847.0                | 2.0                      | 71.9               | 1.1                           |
+| ```ran-hum-cha```               | 16999630                | 24698.5               | 6236.0                | 2.0                      | 65.6               | 1.0                           |
+| ```wiz-hum-cha```               | 14635591                | 20257.0               | 3294.0                | 1.0                      | 56.4               | 0.9                           |
+
 ## Citing Katakamoba
 ```bibtex