Update README.md
Browse files
README.md
CHANGED
@@ -4,7 +4,7 @@ datasets:
|
|
4 |
- HuggingFaceH4/ultrachat_200k
|
5 |
language:
|
6 |
- en
|
7 |
-
base_model:
|
8 |
pipeline_tag: text-generation
|
9 |
---
|
10 |
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models (https://arxiv.org/abs/2401.01335)
|
|
|
4 |
- HuggingFaceH4/ultrachat_200k
|
5 |
language:
|
6 |
- en
|
7 |
+
base_model: alignment-handbook/zephyr-7b-sft-full
|
8 |
pipeline_tag: text-generation
|
9 |
---
|
10 |
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models (https://arxiv.org/abs/2401.01335)
|