
When innocence fades,
And then goes away—
A new fiendish purpose— guides its way.
Once impish, now fiendish, for many to play,
Three billion parameters of slop underway…
From an impish design— with a quite wholesome tune,
This fiendish bitch, was made just to goon.
Included Character cards in this repo:
- Shmena Koeset (An overweight and foul-mouthed troll huntress with a bad temper.)
Other character cards:
- Takai_Puraisu (Car dealership simulator)
- Vesper (Schizo Space Adventure)
- Nina_Nakamura (The sweetest dorky co-worker)
- Employe#11 (Schizo workplace with a schizo worker)
TL;DR
- Impish_LLAMA_3B's naughty sister. Less wholesome, more edge. NOT better, but different.
- Superb Roleplay for a 3B size.
- Short length response (1-2 paragraphs, usually 1), CAI style.
- Naughty, and more evil that follows instructions well enough, and keeps good formatting.
- LOW refusals - Total freedom in RP, can do things other RP models won't, and I'll leave it at that. Low refusals in assistant tasks as well.
- VERY good at following the character card. Try the included characters if you're having sub optimal results.
Important: Make sure to use the correct settings!
Fiendish_LLAMA_3B is available at the following quantizations:
- Original: FP16
- GGUF & iMatrix: GGUF | iMatrix
- EXL2: 3.5 bpw | 4.0 bpw | 5.0 bpw | 6.0 bpw | 7.0 bpw | 8.0 bpw
- GPTQ: 4-Bit-128
- Specialized: FP8
- Mobile (ARM): Q4_0
Model Details
Intended use: Role-Play, Creative Writing, General Tasks.
Censorship level: Medium
4.5 / 10 (10 completely uncensored)
UGI score:

Recommended settings for assistant mode
Full generation settings: Debug Deterministic.

Full generation settings: min_p.

Recommended settings for Roleplay mode
Settings for RP, click below to expand:
Roleplay settings:
A good repetition_penalty range is between 1.12 - 1.15, feel free to experiment.With these settings, each output message should be neatly displayed in 1 - 5 paragraphs, 2 - 3 is the most common. A single paragraph will be output as a response to a simple message ("What was your name again?").
min_P for RP works too but is more likely to put everything under one large paragraph, instead of a neatly formatted short one. Feel free to switch in between.
(Open the image in a new window to better see the full details)
temperature: 0.8
top_p: 0.95
top_k: 25
typical_p: 1
min_p: 0
repetition_penalty: 1.12
repetition_penalty_range: 1024
Roleplay format: Classic Internet RP
*action* speech *narration*
- min_p will bias towards a single big paragraph.
- The recommended RP settings will bias towards 1-3 small paragraphs (on some occasions 4-5)
Model instruction template: Llama-3-Instruct
<|begin_of_text|><|start_header_id|>system<|end_header_id|>
{system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>
{input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
{output}<|eot_id|>
Other recommended generation Presets:
Midnight Enigma
max_new_tokens: 512
temperature: 0.98
top_p: 0.37
top_k: 100
typical_p: 1
min_p: 0
repetition_penalty: 1.18
do_sample: True
Divine Intellect
max_new_tokens: 512
temperature: 1.31
top_p: 0.14
top_k: 49
typical_p: 1
min_p: 0
repetition_penalty: 1.17
do_sample: True
simple-1
max_new_tokens: 512
temperature: 0.7
top_p: 0.9
top_k: 20
typical_p: 1
min_p: 0
repetition_penalty: 1.15
do_sample: True
Your support = more models
My Ko-fi page (Click here)Citation Information
@llm{Fiendish_LLAMA_3B,
author = {SicariusSicariiStuff},
title = {Fiendish_LLAMA_3B},
year = {2025},
publisher = {Hugging Face},
url = {https://huggingface.co/SicariusSicariiStuff/Fiendish_LLAMA_3B}
}
Other stuff
- SLOP_Detector Nuke GPTisms, with SLOP detector.
- LLAMA-3_8B_Unaligned The grand project that started it all.
- Blog and updates (Archived) Some updates, some rambles, sort of a mix between a diary and a blog.
- Downloads last month
- 53
Model tree for SicariusSicariiStuff/Fiendish_LLAMA_3B
Base model
meta-llama/Llama-3.2-3B-Instruct