AI & ML interests
None defined yet.
Recent Activity
View all activity
models
69

science-of-finetuning/gemma-2-2b-gemma-2-2b-it-L13-mu5.5e-02-lr1e-04-local-shuffling-CrosscoderLoss
Updated

science-of-finetuning/gemma-2-2b-gemma-2-2b-it-L13-k256-lr1e-04-local-shuffling-Crosscoder
Updated

science-of-finetuning/gemma-2-2b-gemma-2-2b-it-L13-k55-lr1e-04-local-shuffling-Crosscoder
Updated

science-of-finetuning/gemma-2-2b-gemma-2-2b-it-L13-mu2.5e-02-lr1e-04-local-shuffling-CrosscoderLoss
Updated

science-of-finetuning/SAE-base-Llama-3.2-1B-L8-k100-x32-lr1e-04-local-shuffling
Updated
•
4

science-of-finetuning/R1dist-Qwen-1.5B-Nemotron-L16-k100-lr1e-04-local-shuffling-CCLoss
Updated
•
10

science-of-finetuning/R1dist-Qwen-1.5B-Nemotron-L16-mu3.6e-02-lr1e-04-local-shuffling-CCLoss
Updated
•
8

science-of-finetuning/gemma-2-2b-it-Meditron3-L16-k100-lr1e-04-local-shuffling-CCLoss
Updated
•
9

science-of-finetuning/gemma-2-2b-it-Meditron3-L16-mu3.8e-02-lr1e-04-local-shuffling-CCLoss
Updated
•
10

science-of-finetuning/qwen3_1_7B-em_bad_medical_advice-L14-Crosscoder-s2-t100-k100-lr1e-04-x32
Updated
•
7
datasets
98
science-of-finetuning/diffing-stats-gemma-2-2b-gemma-2-2b-it-L13-mu5.5e-02-lr1e-04-local-shuffling-CrosscoderLoss
Viewer
•
Updated
•
73.7k
science-of-finetuning/diffing-stats-gemma-2-2b-gemma-2-2b-it-L13-k55-lr1e-04-local-shuffling-Crosscoder
Viewer
•
Updated
•
73.7k
science-of-finetuning/diffing-stats-gemma-2-2b-gemma-2-2b-it-L13-k256-lr1e-04-local-shuffling-Crosscoder
Viewer
•
Updated
•
73.7k
science-of-finetuning/diffing-stats-gemma-2-2b-gemma-2-2b-it-L13-mu2.5e-02-lr1e-04-local-shuffling-CrosscoderLoss
Viewer
•
Updated
•
73.7k
science-of-finetuning/diffing-stats-Meta-Llama-3.1-8B-L16-k200-lr1e-04-local-shuffling-Crosscoder-ni0.3-ka1k5k
Viewer
•
Updated
•
131k
•
8
science-of-finetuning/diffing-stats-Meta-Llama-3.1-8B-L16-mu2.0e-02-lr1e-04-local-shuffling-CCLoss
Viewer
•
Updated
•
131k
•
17
science-of-finetuning/diffing-stats-Meta-Llama-3.1-8B-L16-k222-lr1e-04-local-shuffling-Crosscoder
Viewer
•
Updated
•
131k
•
20
science-of-finetuning/diffing-stats-Llama-3.2-1B-L8-mu3.6e-02-lr1e-04-local-shuffling-CrosscoderLoss
Viewer
•
Updated
•
65.5k
•
17
science-of-finetuning/ultrachat_200k_generated_llama3.1-8b-Instruct-mini
Viewer
•
Updated
•
3.97k
•
30
science-of-finetuning/diffing-stats-gemma-2-2b-it-Meditron3-L16-mu3.8e-02-lr1e-04-local-shuffling-CCLoss
Viewer
•
Updated
•
73.7k
•
14