ggbetz commited on
Commit
db1e897
·
verified ·
1 Parent(s): fad1287

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -21,6 +21,10 @@ licence: llama3.1
21
 
22
  # Model Card for Llama-3.1-Argunaut-1-8B-SPIN
23
 
 
 
 
 
24
 
25
  This model is a fine-tuned version of [DebateLabKIT/Llama-3.1-Argunaut-1-8B-SFT](https://huggingface.co/DebateLabKIT/Llama-3.1-Argunaut-1-8B-SFT).
26
  It has been trained using [TRL](https://github.com/huggingface/trl) and [vLLM](https://docs.vllm.ai/). Checkpoints are tagged.
@@ -42,7 +46,7 @@ print(output["generated_text"])
42
 
43
  This model was trained with Self-Play Fine-Tuning (SPIN), a method introduced in [Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models](https://huggingface.co/papers/2401.01335).
44
 
45
- More details about the training procedure will be released in a blog post!
46
 
47
  ### Framework versions
48
 
 
21
 
22
  # Model Card for Llama-3.1-Argunaut-1-8B-SPIN
23
 
24
+ 📘 [HF Blog Article](https://huggingface.co/blog/ggbetz/argunauts-phase-2)
25
+
26
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/ggbetz/argunauts-training/runs/s89n820x)
27
+
28
 
29
  This model is a fine-tuned version of [DebateLabKIT/Llama-3.1-Argunaut-1-8B-SFT](https://huggingface.co/DebateLabKIT/Llama-3.1-Argunaut-1-8B-SFT).
30
  It has been trained using [TRL](https://github.com/huggingface/trl) and [vLLM](https://docs.vllm.ai/). Checkpoints are tagged.
 
46
 
47
  This model was trained with Self-Play Fine-Tuning (SPIN), a method introduced in [Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models](https://huggingface.co/papers/2401.01335).
48
 
49
+ More details about the training procedure are described in the [blog post](https://huggingface.co/blog/ggbetz/argunauts-phase-2).
50
 
51
  ### Framework versions
52