Anwaarma
/

Llama-20242025-2

Generated from Trainer

Model card Files Files and versions Community

Anwaarma commited on 4 days ago

Commit

9bb51f7

·

verified ·

1 Parent(s): 2cb280d

Anwaarma/Llama-20242025-3

Files changed (1) hide show

README.md +7 -6

README.md CHANGED Viewed

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [meta-llama/Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.3860
-- F1: 0.7935
 ## Model description
@@ -38,22 +38,23 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1.79e-05
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
-- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 0.2849        | 1.0   | 3032 | 1.1011          | 0.8009 |
-| 0.2724        | 2.0   | 6064 | 1.3860          | 0.7935 |
 ### Framework versions

 This model is a fine-tuned version of [meta-llama/Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.1898
+- F1: 0.7663
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2e-05
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 0.4873        | 1.0   | 3032 | 0.7785          | 0.7518 |
+| 0.4535        | 2.0   | 6064 | 1.1578          | 0.7603 |
+| 0.3595        | 3.0   | 9096 | 1.1898          | 0.7663 |
 ### Framework versions