Mistral-7B-Instruct-v0.2-ORPO / train_results.json
chchen's picture
End of training
5238045 verified
raw
history blame
221 Bytes
{
"epoch": 2.986666666666667,
"total_flos": 2.303993975490478e+17,
"train_loss": 1.1586682626179285,
"train_runtime": 5352.1206,
"train_samples_per_second": 0.504,
"train_steps_per_second": 0.031
}