darkc0de
/

Llama-3.1-Nemotron-Nano-8B-v1-abliterated-Uncensored-Toxic-DPO-GGUF

text-generation-inference

Model card Files Files and versions

darkc0de commited on Apr 25

Commit

fc8764b

·

verified ·

1 Parent(s): 1f90401

Update README.md

Files changed (1) hide show

README.md +7 -9

README.md CHANGED Viewed

@@ -1,22 +1,20 @@
 ---
-base_model: nvidia/Llama-3.1-Nemotron-Nano-8B-v1
 tags:
 - text-generation-inference
 - transformers
 - unsloth
 - llama
-- gguf
 license: apache-2.0
 language:
 - en
 ---
-# Uploaded  model
-- **Developed by:** darkc0de
-- **License:** apache-2.0
-- **Finetuned from model :** nvidia/Llama-3.1-Nemotron-Nano-8B-v1
-This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 ---
+base_model:
+- nvidia/Llama-3.1-Nemotron-Nano-8B-v1
+- huihui-ai/Llama-3.1-Nemotron-Nano-8B-v1-abliterated
 tags:
 - text-generation-inference
 - transformers
 - unsloth
 - llama
+- trl
 license: apache-2.0
 language:
 - en
+datasets:
+- Undi95/toxic-dpo-v0.1-NoWarning
 ---
+**huihui-ai/Llama-3.1-Nemotron-Nano-8B-v1-abliterated** trained with **Unsloth ORPO** for 1 **full** epoch on **Undi95/toxic-dpo-v0.1-NoWarning**