eager-fog-84 distillation from sage-mountain-341 (flan-t5-base tuned) with batch size 32 ec608af unverified rolf-mozilla commited on Apr 24
Distillation test. lively-planet-17 (based on t5-efficient-tiny) distilled from trainer azure-frost-334 (baed on flan-t5-base) db36e89 unverified rolf-mozilla commited on Apr 15
still-durian-309 build with 0.1 setting for word shortening f567279 unverified rolf-mozilla commited on Mar 26
sandy-forest-305 model, similar to devoted-* model but with correct bad word filtering 8776e77 unverified rolf-mozilla commited on Mar 26
major-elevator-302 - title case specific bad word filtering and addition of training data pertaining to searches dbaeff1 unverified rolf-mozilla commited on Mar 24
devoted-puddle-246 training run with 'None' result when uncertain and some removed layers 308b7bd unverified rolf-mozilla commited on Mar 13
new download format for elated-lake-212 model with data stored separately for faster retreival 3eb9b39 unverified rolf-mozilla commited on Mar 9
elated-lake-212 training run with 6 layers removed for size e891659 unverified rolf-mozilla commited on Mar 8
upbeat-eon-195 training run with multiple decoder layers removed to reduce model size. Based on flan-t5-small d90b33a unverified rolf-mozilla commited on Mar 7
pious-butterfly-170 remove layers and fine tune from flan-t5-small model 56bb4af unverified rolf-mozilla commited on Mar 5
dulcet-durian-136 training with t5-efficient-tiny base model e658dc5 unverified rolf-mozilla commited on Feb 25
Dainty-blaze-127 training run with smaller t5-efficient-mini model 4d6d784 unverified rolf-mozilla commited on Feb 25
swift-rain-107 release - different prompt for single title 907f5cd unverified rolf-mozilla commited on Feb 14
cool-yogurt-98 relase with support for keywords in input 84ae69a unverified rolf-mozilla commited on Jan 6
Upgrade to simplified model with less training data. azure-river-73 training run in w&b 1227a4b unverified rolf-mozilla commited on Dec 31, 2024