Text Generation
Transformers
Safetensors
English
mistral
text-generation-inference
unsloth
trl
Eval Results
legolasyiu commited on
Commit
cb04486
·
verified ·
1 Parent(s): 4b5d6d3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -15,7 +15,7 @@ tags:
15
  <img src="https://huggingface.co/EpistemeAI/Fireball-Mistral-Nemo-Base-2407-v1-DPO2/resolve/main/fireball.JPG" width="200"/>
16
 
17
 
18
- # Fireball-Mistral-Nemo-Base-2407-V2
19
  This model is super fine-tune to provide better coding and better response(from first fine-tune) than Llama-3.1-8B and Google Gemma 2 9B.
20
  Further fine tuned with ORPO method with dataset
21
  - reciperesearch/dolphin-sft-v0.1-preference
@@ -46,7 +46,7 @@ This mistral model was trained 2x faster with [Unsloth](https://github.com/unslo
46
 
47
 
48
 
49
- # Model Card for Mistral-Nemo-Base-2407
50
 
51
  The Mistral-Nemo-Base-2407 Large Language Model (LLM) is a pretrained generative text model of 12B parameters trained jointly by Mistral AI and NVIDIA, it significantly outperforms existing models smaller or similar in size.
52
 
 
15
  <img src="https://huggingface.co/EpistemeAI/Fireball-Mistral-Nemo-Base-2407-v1-DPO2/resolve/main/fireball.JPG" width="200"/>
16
 
17
 
18
+ # Fireball-12B
19
  This model is super fine-tune to provide better coding and better response(from first fine-tune) than Llama-3.1-8B and Google Gemma 2 9B.
20
  Further fine tuned with ORPO method with dataset
21
  - reciperesearch/dolphin-sft-v0.1-preference
 
46
 
47
 
48
 
49
+ # Model Card for Fireball-12B
50
 
51
  The Mistral-Nemo-Base-2407 Large Language Model (LLM) is a pretrained generative text model of 12B parameters trained jointly by Mistral AI and NVIDIA, it significantly outperforms existing models smaller or similar in size.
52