rjurney's picture
Latest training run, just 4 epochs, optimizations all pulled except for FP16, save and eval at epochs to avoid over-fitting
e659e59 unverified
raw
history blame contribute delete
296 Bytes
{
"word_embedding_dimension": 384,
"pooling_mode_cls_token": false,
"pooling_mode_mean_tokens": true,
"pooling_mode_max_tokens": false,
"pooling_mode_mean_sqrt_len_tokens": false,
"pooling_mode_weightedmean_tokens": false,
"pooling_mode_lasttoken": false,
"include_prompt": true
}