Goekdeniz-Guelmez commited on
Commit
fe3b6f1
·
verified ·
1 Parent(s): 7fd62d5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -21
README.md CHANGED
@@ -1,27 +1,7 @@
1
  ---
2
- base_model: Goekdeniz-Guelmez/Josiefied-Qwen2.5-1.5B-Instruct-abliterated-v1
3
- language:
4
- - en
5
- license: apache-2.0
6
- tags:
7
- - text-generation-inference
8
- - transformers
9
- - unsloth
10
- - qwen2
11
- - trl
12
- - dpo
13
  ---
14
 
15
- # Uploaded model
16
-
17
- - **Developed by:** Goekdeniz-Guelmez
18
- - **License:** apache-2.0
19
- - **Finetuned from model :** Goekdeniz-Guelmez/Josiefied-Qwen2.5-1.5B-Instruct-abliterated-v1
20
-
21
- This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
22
-
23
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
24
-
25
  ## A experimental DPO training with a custom dataset.
26
 
27
 
 
1
  ---
2
+ base_model: Goekdeniz-Guelmez/j.o.s.i.e.v4o-1.5b-dpo-stage1-v1
 
 
 
 
 
 
 
 
 
 
3
  ---
4
 
 
 
 
 
 
 
 
 
 
 
5
  ## A experimental DPO training with a custom dataset.
6
 
7