Rei-KTO-24B

Created by
Delta-Vector
โ
Model Information
Rei-KTO-24B
A model meant to replicate the style and prose of the Anthropic Claude models, Opus and Sonnet. This model is meant for Roleplaying/Creative-writing, Has some nice smarts without being too sloppy, etc - It's pretty good. Trained in 2 steps, Firstly SFT trained on Zerofata's PaintedFantasy which i found great at anime-otaku-esque characters, and then KTO'd to improve coherency and Instruct Following
Quantized Versions
Available Downloads
- GGUF FormatFor use with LLama.cpp & Forks (Ty Mradermacher <3)
- EXL2 FormatFor use with TabbyAPI (Coming Soon!)
Prompting
The model is tuned with V7 Tekken formatting. A typical input would look like this:
[SYSTEM_PROMPT]system_prompt[/SYSTEM_PROMPT][INST]Hi there![/INST]Nice to meet you![INST]Can I ask a question?[/INST]
Training
Training was done in 2 steps, SFT>KTO
Access Configs
SFT: https://wandb.ai/new-eden/Painted-Fantasy-Rei/artifacts/axolotl-config/config-u7to9d5q/v0/files/axolotl_config_f0p7vnaf.yml
KTO : https://wandb.ai/new-eden/Painted-Rei/artifacts/axolotl-config/config-8n37w77c/v0/files/axolotl_config_hvrd2tzn.yml
Training
The training was done for 2 epochs using 8 x A100s for 24 hours/p>
Credits
Thank you to Lucy Knada, Ateron, Alicat, Intervitens, Cgato, Kubernetes Bad and the rest of Anthracite.
- Downloads last month
- 40
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Model tree for Delta-Vector/Rei-24B-KTO
Base model
mistralai/Mistral-Small-3.1-24B-Base-2503
Finetuned
zerofata/MS3.2-PaintedFantasy-24B
Finetuned
Delta-Vector/Rei-24B-Base