Ahmadzei's picture
update 1
57bdca5
raw
history blame
138 Bytes
This is true even if you're training the model further - you will probably get the best
performance if you keep the chat tokens constant.