Update README.md
Browse files
README.md
CHANGED
@@ -11,6 +11,8 @@ tags:
|
|
11 |
- trl
|
12 |
---
|
13 |
|
|
|
|
|
14 |
# Fireball-12B-V1.0
|
15 |
This model is super fine-tune to provide better coding and better response(from first fine-tune) than Llama-3.1-8B and Google Gemma 2 9B.
|
16 |
Further fine tuned with ORPO method with dataset. DPO fine-tuned again with much higher quality tuning.
|
|
|
11 |
- trl
|
12 |
---
|
13 |
|
14 |
+
<img src="https://huggingface.co/EpistemeAI/Fireball-Mistral-Nemo-Base-2407-v1-DPO2/resolve/main/fireball.JPG" width="200"/>
|
15 |
+
|
16 |
# Fireball-12B-V1.0
|
17 |
This model is super fine-tune to provide better coding and better response(from first fine-tune) than Llama-3.1-8B and Google Gemma 2 9B.
|
18 |
Further fine tuned with ORPO method with dataset. DPO fine-tuned again with much higher quality tuning.
|