eridu / checkpoint-2772 /scaler.pt

Commit History

Latest training run, just 4 epochs, optimizations all pulled except for FP16, save and eval at epochs to avoid over-fitting
e659e59
unverified

rjurney commited on