Safeword/Abomination 36B/70B 4.1
A new Safeword first, size 36B with and Abomination alternate.
A second attempt at 70B - We learned a lot from the first attempt.
Enjoy :)[
https://huggingface.co/ReadyArt/Fallen-Safeword-70B-R1-v4.1
https://huggingface.co/ReadyArt/Fallen-Abomination-70B-R1-v4.1
https://huggingface.co/ReadyArt/Forgotten-Safeword-36B-4.1
https://huggingface.co/ReadyArt/Forgotten-Abomination-36B-v4.1
Thanks a lot for continue creating such awesome models. I'm looking forward a lot to give them a try. I'm especially excited for the second attempt of your 70B model. I queued them all. As always you can follow their status under https://hf.tst.eu/status.html
They will appear on the download page under:
- https://hf.tst.eu/model#Fallen-Safeword-70B-R1-v4.1-GGUF
- https://hf.tst.eu/model#Fallen-Abomination-70B-R1-v4.1-GGUF
- https://hf.tst.eu/model#Forgotten-Safeword-36B-4.1-GGUF
- https://hf.tst.eu/model#Forgotten-Abomination-36B-v4.1-GGUF
@mradermacher Just look at https://huggingface.co/ReadyArt/Fallen-Safeword-70B-R1-v4.1 to see a beautiful model card. They not only managed to design the readme like their own website but even put an animated webp on there.
We just changed the tokenizer_config.json
"eos_token": "<|eot_id|>",
to
"eos_token": "<|end▁of▁sentence|>",
That should fix the token leakage.
Oh no I nukeall and requeued all of them. Luckely they where imatrix blocked due to blocked/budget
thanks to DeepSeek-V2.5-236B
so not much work was lost.
Indeed, the 70B one is near the top of my list to try :)
I'm looking forward to knowing how it goes. I don't personally run anything that my own 4090 can't run and made the 70B by request for friends but this llama/deepseek architecture of the Fallen model it's trained on is totally different from what I've been training.