Goekdeniz-Guelmez
/

j.o.s.i.e.v4o-1.5b-dpo-stage1-v1-gguf

Model card Files Files and versions

Goekdeniz-Guelmez commited on Oct 7, 2024

Commit

fe3b6f1

·

verified ·

1 Parent(s): 7fd62d5

Update README.md

Files changed (1) hide show

README.md +1 -21

README.md CHANGED Viewed

@@ -1,27 +1,7 @@
 ---
-base_model: Goekdeniz-Guelmez/Josiefied-Qwen2.5-1.5B-Instruct-abliterated-v1
-language:
-- en
-license: apache-2.0
-tags:
-- text-generation-inference
-- transformers
-- unsloth
-- qwen2
-- trl
-- dpo
 ---
-# Uploaded  model
-- **Developed by:** Goekdeniz-Guelmez
-- **License:** apache-2.0
-- **Finetuned from model :** Goekdeniz-Guelmez/Josiefied-Qwen2.5-1.5B-Instruct-abliterated-v1
-This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 ## A experimental DPO training with a custom dataset.

 ---
+base_model: Goekdeniz-Guelmez/j.o.s.i.e.v4o-1.5b-dpo-stage1-v1
 ---
 ## A experimental DPO training with a custom dataset.