Update README.md
Browse files
README.md
CHANGED
@@ -5,7 +5,13 @@ license: mit
|
|
5 |
For an explanation of this project and the models trained for it, please see the [Report](Report/REPORT.md).
|
6 |
|
7 |
The root folder contains scripts for dataset preprocessing.
|
8 |
-
|
9 |
-
|
10 |
-
|
11 |
-
[chess-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
5 |
For an explanation of this project and the models trained for it, please see the [Report](Report/REPORT.md).
|
6 |
|
7 |
The root folder contains scripts for dataset preprocessing.
|
8 |
+
|
9 |
+
[chess-mamba-vs-xformer](../../tree/main/chess-mamba-vs-xformer/) contains the training scripts.
|
10 |
+
|
11 |
+
Config files, used to set model configuration and training hyperameters, are in [chess-mamba-vs-xformer/config](../../tree/main/chess-mamba-vs-xformer/config).
|
12 |
+
|
13 |
+
Model checkpoints are in [chess-mamba-vs-xformer/out](../../tree/main/chess-mamba-vs-xformer/out). The last checkpoint for completed models (e.g. Mamba and Transformer 50M) are .../anneal/anneal_complete.pt.
|
14 |
+
|
15 |
+
[chess-gpt-eval](../../tree/main/chess-gpt-eval/) has the scripts for model evaluation - playings games against Stockfish or lc0 chess engines. The logs folder contains raw evaluation metrics.
|
16 |
+
|
17 |
+
[chess-gpt-eval-contrastive](../../tree/main/chess-gpt-eval-contrastive/) likewise has the scripts for model evaluation, but modified for training and evaluation of contrastive activation and linear probes. The logs folder again contains raw evaluation metrics.
|