Update README.md
Browse files
README.md
CHANGED
|
@@ -11,6 +11,10 @@ An unslop finetune of [google/gemma-3-4b-it](https://huggingface.co/google/gemma
|
|
| 11 |
|
| 12 |
### Updates / Observations
|
| 13 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 14 |
I've received some excellent feedback.
|
| 15 |
|
| 16 |
Some usage notes: Low temp recommended. My training technique uses high temp to try to hit slop edge cases, but I ended up baking in some trippiness on accident I think.
|
|
|
|
| 11 |
|
| 12 |
### Updates / Observations
|
| 13 |
|
| 14 |
+
An updated version of this model is here: [v3](https://huggingface.co/electroglyph/gemma-3-4b-it-unslop-GRPO-v3)
|
| 15 |
+
|
| 16 |
+
---
|
| 17 |
+
|
| 18 |
I've received some excellent feedback.
|
| 19 |
|
| 20 |
Some usage notes: Low temp recommended. My training technique uses high temp to try to hit slop edge cases, but I ended up baking in some trippiness on accident I think.
|