SmolLM2-360M-Instruct-TaiwanChat / train_with_unsloth.py

Commit History

adjust hyper-parameters
976c215

Luigi commited on

train on 800k examples
38e2b45

Luigi commited on

train with whole dataset
39559f2

Luigi commited on

show also examples leading to infinite eval loss
9c94812

Luigi commited on

update train script
fc65dac

Luigi commited on

update train script
4bf72b9

Luigi commited on

update train script
c285ad3

Luigi commited on

do not generation prompt in the end, irrelavent for training and evaluation
36395a3

Luigi commited on

decrease val size
d874b94

Luigi commited on

re-implement dataset filtering in more efficient way
afa5f94

Luigi commited on

bugfix
69d7616

Luigi commited on

filter out samples too long for MAX_LEN from dataset
144d876

Luigi commited on

eval loss got NAN but train loss kepp finite, adjust hyper-parameters
aa49cd2

Luigi commited on

adjust train parameters to prevent from underfitting
7d54ecf

Luigi commited on

initial commit
a5af3c2

Luigi commited on