OpenCodeInterpreter-DS-6.7B_Fi__CMP_TR_size_304_epochs_10_2024-06-22_21-11-23_3558620
This model is a fine-tuned version of m-a-p/OpenCodeInterpreter-DS-6.7B on the None dataset. It achieves the following results on the evaluation set:
- Loss: 3.8030
- Accuracy: 0.484
- Chrf: 0.025
- Bleu: 0.003
- Sacrebleu: 0.0
- Rouge1: 0.0
- Rouge2: 0.0
- Rougel: 0.0
- Rougelsum: 0.0
- Meteor: 0.079
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 0.001
- train_batch_size: 1
- eval_batch_size: 1
- seed: 3407
- distributed_type: multi-GPU
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-06
- lr_scheduler_type: linear
- lr_scheduler_warmup_steps: 304
- training_steps: 3040
Training results
Training Loss | Epoch | Step | Validation Loss | Accuracy | Chrf | Bleu | Sacrebleu | Rouge1 | Rouge2 | Rougel | Rougelsum | Meteor |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0.8053 | 1.0 | 304 | 4.0078 | 0.465 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
0.0803 | 2.0 | 608 | 4.1351 | 0.468 | 0.003 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.034 |
0.0933 | 3.0 | 912 | 4.2756 | 0.45 | 0.007 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.063 |
0.048 | 4.0 | 1216 | 4.1382 | 0.458 | 0.022 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.084 |
1.2159 | 5.0 | 1520 | 4.2940 | 0.452 | 0.022 | 0.0 | 0.0 | 0.003 | 0.0 | 0.003 | 0.003 | 0.053 |
0.0978 | 6.0 | 1824 | 4.5470 | 0.462 | 0.007 | 0.0 | 0.0 | 0.003 | 0.0 | 0.001 | 0.003 | 0.017 |
0.0756 | 7.0 | 2128 | 4.0928 | 0.472 | 0.014 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.03 |
0.0864 | 8.0 | 2432 | 3.8324 | 0.477 | 0.021 | 0.003 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.064 |
0.0474 | 9.0 | 2736 | 3.8037 | 0.489 | 0.024 | 0.003 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.079 |
0.0827 | 10.0 | 3040 | 3.8030 | 0.484 | 0.025 | 0.003 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.079 |
Framework versions
- PEFT 0.7.1
- Transformers 4.37.0
- Pytorch 2.2.1+cu121
- Datasets 2.20.0
- Tokenizers 0.15.2
- Downloads last month
- 4
Model tree for vdavidr/OpenCodeInterpreter-DS-6.7B_Fi__CMP_TR_size_304_epochs_10_2024-06-22_21-11-23_3558620
Base model
m-a-p/OpenCodeInterpreter-DS-6.7B