distily_attn_mlp_sweep / benchmarks.shelve.bak
lapp0's picture
End of training
a20cb45 verified