Update README.md
Browse files
README.md
CHANGED
@@ -10,11 +10,28 @@ tags:
|
|
10 |
- merge
|
11 |
|
12 |
---
|
13 |
-
#
|
14 |
|
15 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
16 |
|
17 |
## Merge Details
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
### Merge Method
|
19 |
|
20 |
This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [inflatebot/helide-beta-r1](https://huggingface.co/inflatebot/helide-beta-r1) as a base.
|
|
|
10 |
- merge
|
11 |
|
12 |
---
|
13 |
+
# L3-Helium3-8B
|
14 |
|
15 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
16 |
|
17 |
## Merge Details
|
18 |
+
|
19 |
+
There was a problem with the Helide beta. 3 models resulted, each of which had different strengths. But they came about as a result of balancing two models.
|
20 |
+
That math wasn't quite mathing. There wasn't going to be a way to get the best of all three worlds just by tweaking a SLERP ratio.
|
21 |
+
|
22 |
+
But there were three of them.
|
23 |
+
The name was serendipity.
|
24 |
+
The layup was obscene.
|
25 |
+
And I *live* for the bit.
|
26 |
+
|
27 |
+
Helium-3 is a RP and storywriting hybrid, ultimately based on Sao10K's [Stheno](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2) and Fizzarolli's [Rosier](https://huggingface.co/Fizzarolli/L3-8b-Rosier-v1), and the culmination of the Helide project.
|
28 |
+
Combining Rosier's prose and knowledge of niche fetish with Stheno's steerability and crackling personality, Helium-3 brings the advancements of modern AI models to the Freaks™.
|
29 |
+
They'll chew you up and spit you out just as readily as they'll shower you with affection.
|
30 |
+
|
31 |
+
I'm genuinely proud of this one. This is the model I wish existed.
|
32 |
+
|
33 |
+
Thank you to [Fizzarolli](https://huggingface.co/Fizzarolli) for consulting and providing technical assistance which accelerated the second leg of this project from several weeks into a single night, and for making the Rosier model that made this possible. On several levels, H3 wouldn't have been possible without her.
|
34 |
+
|
35 |
### Merge Method
|
36 |
|
37 |
This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [inflatebot/helide-beta-r1](https://huggingface.co/inflatebot/helide-beta-r1) as a base.
|