Update README.md
Browse files
README.md
CHANGED
|
@@ -3,7 +3,7 @@ license: mit
|
|
| 3 |
datasets:
|
| 4 |
- HuggingFaceH4/ultrafeedback_binarized
|
| 5 |
base_model:
|
| 6 |
-
-
|
| 7 |
---
|
| 8 |
|
| 9 |
-
This is an aligned model based on princeton-nlp/Llama-3-Base-8B-SFT. This model is aligned using the Ultrafeedback dataset, fine-tuned through the Simple Preference Optimization (SimPO) loss. The optimization process was conducted with a single epoch.
|
|
|
|
| 3 |
datasets:
|
| 4 |
- HuggingFaceH4/ultrafeedback_binarized
|
| 5 |
base_model:
|
| 6 |
+
- princeton-nlp/Llama-3-Base-8B-SFT
|
| 7 |
---
|
| 8 |
|
| 9 |
+
This is an aligned model based on princeton-nlp/Llama-3-Base-8B-SFT. This model is aligned using the Ultrafeedback dataset, fine-tuned through the Simple Preference Optimization (SimPO) loss. The optimization process was conducted with a single epoch.
|