Commits · Mozilla/smart-tab-topic

drawn-water-93 distillation run

b0fd1b2
unverified

rolf-mozilla commited on Apr 25

dry-meadow-86 distilled

c63ada0
unverified

rolf-mozilla commited on Apr 25

eager-fog-84 distillation from sage-mountain-341 (flan-t5-base tuned) with batch size 32

ec608af
unverified

rolf-mozilla commited on Apr 24

0.7.0 missing files

a08acd3
unverified

rolf-mozilla commited on Apr 15

cleaned up, fixed bug quantizing 0.7.0 model

4464982
unverified

rolf-mozilla commited on Apr 15

Distillation test. lively-planet-17 (based on t5-efficient-tiny) distilled from trainer azure-frost-334 (baed on flan-t5-base)

db36e89
unverified

rolf-mozilla commited on Apr 15

eager-plant-323 model - no removed layers

999b59c
unverified

rolf-mozilla commited on Apr 1

still-durian-309 build with 0.1 setting for word shortening

f567279
unverified

rolf-mozilla commited on Mar 26

sandy-forest-305 model, similar to devoted-* model but with correct bad word filtering

8776e77
unverified

rolf-mozilla commited on Mar 26

major-elevator-302 - title case specific bad word filtering and addition of training data pertaining to searches

dbaeff1
unverified

rolf-mozilla commited on Mar 24

genial-tree-283 run. Brevity and None support

ca627ad
unverified

rolf-mozilla commited on Mar 15

devoted-puddle-246 training run with 'None' result when uncertain and some removed layers

308b7bd
unverified

rolf-mozilla commited on Mar 13

new download format for elated-lake-212 model with data stored separately for faster retreival

3eb9b39
unverified

rolf-mozilla commited on Mar 9

elated-lake-212 training run with 6 layers removed for size

e891659
unverified

rolf-mozilla commited on Mar 8

upbeat-eon-195 training run with multiple decoder layers removed to reduce model size. Based on flan-t5-small

d90b33a
unverified

rolf-mozilla commited on Mar 7

pious-butterfly-170 remove layers and fine tune from flan-t5-small model

56bb4af
unverified

rolf-mozilla commited on Mar 5

dulcet-durian-136 training with t5-efficient-tiny base model

e658dc5
unverified

rolf-mozilla commited on Feb 25

Dainty-blaze-127 training run with smaller t5-efficient-mini model

4d6d784
unverified

rolf-mozilla commited on Feb 25

gentil-pyramid run -- extractive summarization

4bd8ebe
unverified

rolf-mozilla commited on Feb 16

fixed quantize bug

1c40fd6
unverified

rolf-mozilla commited on Feb 14

swift-rain-107 release - different prompt for single title

907f5cd
unverified

rolf-mozilla commited on Feb 14

cool-yogurt-98 relase with support for keywords in input

84ae69a
unverified

rolf-mozilla commited on Jan 6

Upgrade to simplified model with less training data. azure-river-73 training run in w&b

1227a4b
unverified

rolf-mozilla commited on Dec 31, 2024

update config.json prefx str

a537e51
unverified

vazish commited on Dec 16, 2024

add models

a2b0da0
unverified

vazish commited on Dec 16, 2024

initial commit

81e301b
verified

vazish commited on Dec 16, 2024

Mozilla
/

smart-tab-topic

Commit History

drawn-water-93 distillation run

b0fd1b2
unverified

dry-meadow-86 distilled

c63ada0
unverified

eager-fog-84 distillation from sage-mountain-341 (flan-t5-base tuned) with batch size 32

ec608af
unverified

0.7.0 missing files

a08acd3
unverified

cleaned up, fixed bug quantizing 0.7.0 model

4464982
unverified

Distillation test. lively-planet-17 (based on t5-efficient-tiny) distilled from trainer azure-frost-334 (baed on flan-t5-base)

db36e89
unverified

eager-plant-323 model - no removed layers

999b59c
unverified

still-durian-309 build with 0.1 setting for word shortening

f567279
unverified

sandy-forest-305 model, similar to devoted-* model but with correct bad word filtering

8776e77
unverified

major-elevator-302 - title case specific bad word filtering and addition of training data pertaining to searches

dbaeff1
unverified

genial-tree-283 run. Brevity and None support

ca627ad
unverified

devoted-puddle-246 training run with 'None' result when uncertain and some removed layers

308b7bd
unverified

new download format for elated-lake-212 model with data stored separately for faster retreival

3eb9b39
unverified

elated-lake-212 training run with 6 layers removed for size

e891659
unverified

upbeat-eon-195 training run with multiple decoder layers removed to reduce model size. Based on flan-t5-small

d90b33a
unverified

pious-butterfly-170 remove layers and fine tune from flan-t5-small model

56bb4af
unverified

dulcet-durian-136 training with t5-efficient-tiny base model

e658dc5
unverified

Dainty-blaze-127 training run with smaller t5-efficient-mini model

4d6d784
unverified

gentil-pyramid run -- extractive summarization

4bd8ebe
unverified

fixed quantize bug

1c40fd6
unverified

swift-rain-107 release - different prompt for single title

907f5cd
unverified

cool-yogurt-98 relase with support for keywords in input

84ae69a
unverified

Upgrade to simplified model with less training data. azure-river-73 training run in w&b

1227a4b
unverified

update config.json prefx str

a537e51
unverified

add models

a2b0da0
unverified

initial commit

81e301b
verified

Commit History

drawn-water-93 distillation run b0fd1b2 unverified

dry-meadow-86 distilled c63ada0 unverified

eager-fog-84 distillation from sage-mountain-341 (flan-t5-base tuned) with batch size 32 ec608af unverified

0.7.0 missing files a08acd3 unverified

cleaned up, fixed bug quantizing 0.7.0 model 4464982 unverified

Distillation test. lively-planet-17 (based on t5-efficient-tiny) distilled from trainer azure-frost-334 (baed on flan-t5-base) db36e89 unverified

eager-plant-323 model - no removed layers 999b59c unverified

still-durian-309 build with 0.1 setting for word shortening f567279 unverified

sandy-forest-305 model, similar to devoted-* model but with correct bad word filtering 8776e77 unverified

major-elevator-302 - title case specific bad word filtering and addition of training data pertaining to searches dbaeff1 unverified

genial-tree-283 run. Brevity and None support ca627ad unverified

devoted-puddle-246 training run with 'None' result when uncertain and some removed layers 308b7bd unverified

new download format for elated-lake-212 model with data stored separately for faster retreival 3eb9b39 unverified

elated-lake-212 training run with 6 layers removed for size e891659 unverified

upbeat-eon-195 training run with multiple decoder layers removed to reduce model size. Based on flan-t5-small d90b33a unverified

pious-butterfly-170 remove layers and fine tune from flan-t5-small model 56bb4af unverified

dulcet-durian-136 training with t5-efficient-tiny base model e658dc5 unverified

Dainty-blaze-127 training run with smaller t5-efficient-mini model 4d6d784 unverified

gentil-pyramid run -- extractive summarization 4bd8ebe unverified

fixed quantize bug 1c40fd6 unverified

swift-rain-107 release - different prompt for single title 907f5cd unverified

cool-yogurt-98 relase with support for keywords in input 84ae69a unverified

Upgrade to simplified model with less training data. azure-river-73 training run in w&b 1227a4b unverified

update config.json prefx str a537e51 unverified

add models a2b0da0 unverified

initial commit 81e301b verified

drawn-water-93 distillation run

b0fd1b2
unverified

dry-meadow-86 distilled

c63ada0
unverified

eager-fog-84 distillation from sage-mountain-341 (flan-t5-base tuned) with batch size 32

ec608af
unverified

0.7.0 missing files

a08acd3
unverified

cleaned up, fixed bug quantizing 0.7.0 model

4464982
unverified

Distillation test. lively-planet-17 (based on t5-efficient-tiny) distilled from trainer azure-frost-334 (baed on flan-t5-base)

db36e89
unverified

eager-plant-323 model - no removed layers

999b59c
unverified

still-durian-309 build with 0.1 setting for word shortening

f567279
unverified

sandy-forest-305 model, similar to devoted-* model but with correct bad word filtering

8776e77
unverified

major-elevator-302 - title case specific bad word filtering and addition of training data pertaining to searches

dbaeff1
unverified

genial-tree-283 run. Brevity and None support

ca627ad
unverified

devoted-puddle-246 training run with 'None' result when uncertain and some removed layers

308b7bd
unverified

new download format for elated-lake-212 model with data stored separately for faster retreival

3eb9b39
unverified

elated-lake-212 training run with 6 layers removed for size

e891659
unverified

upbeat-eon-195 training run with multiple decoder layers removed to reduce model size. Based on flan-t5-small

d90b33a
unverified

pious-butterfly-170 remove layers and fine tune from flan-t5-small model

56bb4af
unverified

dulcet-durian-136 training with t5-efficient-tiny base model

e658dc5
unverified

Dainty-blaze-127 training run with smaller t5-efficient-mini model

4d6d784
unverified

gentil-pyramid run -- extractive summarization

4bd8ebe
unverified

fixed quantize bug

1c40fd6
unverified

swift-rain-107 release - different prompt for single title

907f5cd
unverified

cool-yogurt-98 relase with support for keywords in input

84ae69a
unverified

Upgrade to simplified model with less training data. azure-river-73 training run in w&b

1227a4b
unverified

update config.json prefx str

a537e51
unverified

add models

a2b0da0
unverified

initial commit

81e301b
verified