DavieLion
/

Llama-3.2-1B-SPIN-iter0

alignment-handbook

Generated from Trainer

Model card Files Files and versions Community

DavieLion commited on Dec 29, 2024

Commit

f796432

·

verified ·

1 Parent(s): bc1a379

Update README.md

Files changed (1) hide show

README.md +8 -22

README.md CHANGED Viewed

@@ -1,40 +1,30 @@
 ---
 base_model:
-- DavieLion/Lllma-3.2-1B
 tags:
 - alignment-handbook
 - generated_from_trainer
 datasets:
-- DavieLion/SPIN_iter0
 model-index:
-- name: iter0-ckpt
   results: []
-license: apache-2.0
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# iter0-ckpt
-This model is a fine-tuned version of [DavieLion/Lllma-3.2-1B](https://huggingface.co/DavieLion/Lllma-3.2-1B) on the DavieLion/SPIN_iter0 dataset.
 ## Model description
 - Model type: A 1B parameter GPT-like model fine-tuned on synthetic datasets.
 - Language(s) (NLP): Primarily English
-- License: Apache License 2.0
-- Finetuned from model: DavieLion/Lllma-3.2-1B (based on meta/Llama-3.2-1B)
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -53,10 +43,6 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_ratio: 0.1
 - num_epochs: 6.0
-### Training results
 ### Framework versions
 - Transformers 4.37.0

 ---
 base_model:
+- meta-llama/Llama-3.2-1B
 tags:
 - alignment-handbook
 - generated_from_trainer
 datasets:
+- HuggingFaceH4/ultrachat_200k
 model-index:
+- name: Llama-3.2-1B-SPIN-iter0
   results: []
+license: llama3.2
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Llama-3.2-1B-SPIN-iter3
+This model is a fine-tuned version of [meta-llama/Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) on the [HuggingFaceH4/ultrachat_200k](https://huggingface.co/datasets/HuggingFaceH4/ultrachat_200k) datasets.
 ## Model description
 - Model type: A 1B parameter GPT-like model fine-tuned on synthetic datasets.
 - Language(s) (NLP): Primarily English
+- License: Llama 3.2 Community Lisense Agreement
+- Finetuned from model: meta-llama/Llama-3.2-1B
 ### Training hyperparameters
 - lr_scheduler_warmup_ratio: 0.1
 - num_epochs: 6.0
 ### Framework versions
 - Transformers 4.37.0