Built with Axolotl

Trained using https://huggingface.co/datasets/cgato/TheSmarts for demonstration purposes. Probably a fairly competent assistant model. May do KTO overtop to sand down the edges and improve performance later.

Prompt Format: ChatML

Roles: system, user, assistant

Training results

Training Loss Epoch Step Validation Loss
0.9338 0.0003 1 0.9347
0.8271 0.0328 100 0.7933
0.9541 0.0656 200 0.8407
0.7497 0.0984 300 0.7934
0.8786 0.1311 400 0.7724
0.8257 0.1639 500 0.7627
0.8258 0.1967 600 0.7679
0.7207 0.2295 700 0.7497
0.9439 0.2623 800 0.7576
0.852 0.2951 900 0.7361
0.7852 0.3279 1000 0.7375
0.7 0.3607 1100 0.7298
0.7865 0.3934 1200 0.7202
0.6182 0.4262 1300 0.7146
0.6885 0.4590 1400 0.7131
0.7154 0.4918 1500 0.7083
0.7187 0.5246 1600 0.7016
0.6877 0.5574 1700 0.6976
0.7908 0.5902 1800 0.6946
0.7664 0.6230 1900 0.6894
0.7214 0.6557 2000 0.6857
0.6971 0.6885 2100 0.6837
0.6527 0.7213 2200 0.6804
0.6815 0.7541 2300 0.6781
0.6359 0.7869 2400 0.6759
0.6874 0.8197 2500 0.6742
0.5999 0.8525 2600 0.6728
0.7391 0.8852 2700 0.6719
0.6509 0.9180 2800 0.6710
0.6346 0.9508 2900 0.6702
0.7023 0.9836 3000 0.6696
Downloads last month
0
Safetensors
Model size
12.2B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support