do not generation prompt in the end, irrelavent for training and evaluation 36395a3 Luigi commited on Apr 28
eval loss got NAN but train loss kepp finite, adjust hyper-parameters aa49cd2 Luigi commited on Apr 28