How you make a uncensored model?

#6
by RGTails - opened

All is in title. ^^

You take a pre-trained base model like llama-2 (e.g. https://huggingface.co/TheBloke/Llama-2-7B-fp16), then run instruction/conversation fine-tuning on a dataset that does not include alignment and refusals, e.g. https://huggingface.co/datasets/ehartford/wizard_vicuna_70k_unfiltered

To actually run the fine-tuning you can take a look at https://georgesung.github.io/ai/qlora-ift/

georgesung changed discussion status to closed
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment