-
-
-
-
-
-
Inference Providers
Active filters:
cpo, trl
NBA55/Experiment_with_trained_model_Final_CPO_for_all_3_issues-epoch-2
Updated
smohammadi/llama2-lora-aligned-cpo
Updated
•
28
NBA55/Final_Experiment_with_trained_model_Final_CPO_for_all_3_issues-epoch-2
Updated
jbjeong91/llama3.1-cpo-full
Text Generation
•
8B
•
Updated
•
15
jbjeong91/llama3.1-cpo_j-full-0911
Text Generation
•
8B
•
Updated
•
12
jbjeong91/llama3.1-cpo-full-0911
Text Generation
•
8B
•
Updated
•
14
jbjeong91/llama3.1-cpo_j-full-0912
Text Generation
•
8B
•
Updated
•
13
jbjeong91/llama3.1-cpo-full-0912
Text Generation
•
8B
•
Updated
•
17
jbjeong91/llama3.1-cpo-full-0913
Text Generation
•
8B
•
Updated
•
19
Siddartha10/outputs_cpo
Text Generation
•
0.1B
•
Updated
•
10
ravithejads/test_model_sft
Text Generation
•
0.1B
•
Updated
maxmyn/c4ai-takehome-model-simpo
Text Generation
•
0.1B
•
Updated
•
13
twigs/smolm-cposimpo
Text Generation
•
0.1B
•
Updated
•
15
sarthakrw/cpo_model
Text Generation
•
0.1B
•
Updated
•
10
CharlesLi/OpenELM-1_1B-SimPO
Text Generation
•
1B
•
Updated
•
9
CharlesLi/OpenELM-1_1B-CPO
Text Generation
•
1B
•
Updated
•
7
NBA55/CPO_with_baseline_modalh
Text Generation
•
4B
•
Updated
•
11
NBA55/CPO_with_trained_model_for_all_3_issues-epoch-2
Updated
rawsh/mirrorqwen2.5-0.5b-SimPO
Text Generation
•
0.5B
•
Updated
•
9
rawsh/simpo-math-model
Text Generation
•
0.5B
•
Updated
•
11
rawsh/mirrorqwen2.5-0.5b-SimPO-0
Text Generation
•
0.5B
•
Updated
•
12
mradermacher/mirrorqwen2.5-0.5b-SimPO-GGUF
0.5B
•
Updated
•
466
mradermacher/mirrorqwen2.5-0.5b-SimPO-0-GGUF
0.5B
•
Updated
•
176
rawsh/mirrorqwen2.5-0.5b-SimPO-1
Text Generation
•
0.5B
•
Updated
•
9
rawsh/mirrorqwen2.5-0.5b-SimPO-2
Text Generation
•
0.5B
•
Updated
•
9
rawsh/mirrorqwen2.5-0.5b-SimPO-3
Text Generation
•
0.5B
•
Updated
•
10
mradermacher/mirrorqwen2.5-0.5b-SimPO-1-GGUF
0.5B
•
Updated
•
246
mradermacher/mirrorqwen2.5-0.5b-SimPO-2-GGUF
0.5B
•
Updated
•
297
mradermacher/mirrorqwen2.5-0.5b-SimPO-3-GGUF
0.5B
•
Updated
•
88
botways/llama-CPO
Updated