Fiendish_LLAMA_3B
Fiendish_LLAMA_3B

Click here for TL;DR


When innocence fades,
And then goes away—
A new fiendish purpose— guides its way.

Once impish, now fiendish, for many to play,
Three billion parameters of slop underway…

From an impish design— with a quite wholesome tune,
This fiendish bitch, was made just to goon.


Included Character cards in this repo:

  • Shmena Koeset (An overweight and foul-mouthed troll huntress with a bad temper.)

Other character cards:


TL;DR

  • Impish_LLAMA_3B's naughty sister. Less wholesome, more edge. NOT better, but different.
  • Superb Roleplay for a 3B size.
  • Short length response (1-2 paragraphs, usually 1), CAI style.
  • Naughty, and more evil that follows instructions well enough, and keeps good formatting.
  • LOW refusals - Total freedom in RP, can do things other RP models won't, and I'll leave it at that. Low refusals in assistant tasks as well.
  • VERY good at following the character card. Try the included characters if you're having sub optimal results.

Important: Make sure to use the correct settings!

Assistant settings

Roleplay settings


Fiendish_LLAMA_3B is available at the following quantizations:


Model Details

  • Intended use: Role-Play, Creative Writing, General Tasks.

  • Censorship level: Medium

  • 4.5 / 10 (10 completely uncensored)

UGI score:


Recommended settings for assistant mode

Full generation settings: Debug Deterministic. Debug Deterministic_Settings
Full generation settings: min_p. min_P_Settings

Recommended settings for Roleplay mode


Settings for RP, click below to expand:

Roleplay settings: A good repetition_penalty range is between 1.12 - 1.15, feel free to experiment.

With these settings, each output message should be neatly displayed in 1 - 5 paragraphs, 2 - 3 is the most common. A single paragraph will be output as a response to a simple message ("What was your name again?").

min_P for RP works too but is more likely to put everything under one large paragraph, instead of a neatly formatted short one. Feel free to switch in between.

(Open the image in a new window to better see the full details) Oni_Mitsubishi_12B_Settings

temperature:  0.8
top_p:  0.95
top_k:  25
typical_p:  1
min_p:  0
repetition_penalty: 1.12
repetition_penalty_range: 1024

Roleplay format: Classic Internet RP

*action* speech *narration*
  • min_p will bias towards a single big paragraph.
  • The recommended RP settings will bias towards 1-3 small paragraphs (on some occasions 4-5)

Model instruction template: Llama-3-Instruct

<|begin_of_text|><|start_header_id|>system<|end_header_id|>

{system_prompt}<|eot_id|><|start_header_id|>user<|end_header_id|>

{input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>

{output}<|eot_id|>

Other recommended generation Presets:

Midnight Enigma
max_new_tokens: 512
temperature: 0.98
top_p: 0.37
top_k: 100
typical_p: 1
min_p: 0
repetition_penalty: 1.18
do_sample: True
Divine Intellect
max_new_tokens: 512
temperature: 1.31
top_p: 0.14
top_k: 49
typical_p: 1
min_p: 0
repetition_penalty: 1.17
do_sample: True
simple-1
max_new_tokens: 512
temperature: 0.7
top_p: 0.9
top_k: 20
typical_p: 1
min_p: 0
repetition_penalty: 1.15
do_sample: True

Your support = more models

My Ko-fi page (Click here)

Citation Information

@llm{Fiendish_LLAMA_3B,
  author = {SicariusSicariiStuff},
  title = {Fiendish_LLAMA_3B},
  year = {2025},
  publisher = {Hugging Face},
  url = {https://huggingface.co/SicariusSicariiStuff/Fiendish_LLAMA_3B}
}

Other stuff

Downloads last month
53
Safetensors
Model size
3.61B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for SicariusSicariiStuff/Fiendish_LLAMA_3B

Finetuned
(345)
this model
Finetunes
1 model
Quantizations
5 models

Collection including SicariusSicariiStuff/Fiendish_LLAMA_3B