noanabeshima
/

tiny_model_cached_acts

Model card Files Files and versions Community

noanabeshima commited on Jul 9, 2024

Commit

b431aec

·

verified ·

1 Parent(s): e0d156c

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -28,4 +28,6 @@ feat_acts = load_feat_acts(f"mlp_map_test/M2_S-2_R1_P0/{300}.pt").to_dense()
 ```
 The activations are from 13% of the train split of https://huggingface.co/datasets/noanabeshima/TinyModelTokIds.
-I gather all the activations per feature and then take the smallest activations prefix so that each feature has at least 3K documents on which it activates and 12K token-activations.

 ```
 The activations are from 13% of the train split of https://huggingface.co/datasets/noanabeshima/TinyModelTokIds.
+I gather all the activations per feature and then take the smallest activations prefix so that each feature has at least 3K documents on which it activates and 12K token-activations.
+I think it takes something like 6 hours to do 9K features on a single 4090.