Cascade0-Series
Collection
First Cascade Series.
•
2 items
•
Updated
(Trained only on 4.8B tokens)
Experimental DPO Pass to see the difference.
In some cases, during certain chats, the new DPO actually helps and makes the model feel more chat-y, sort of.
For eg, when asking 'How to make a salad', it responds:
All models are evaluated using LMEval Harness, on the same PC/Settings and GGUF with F16 Quant.
made with LMEval Harness