-
-
-
-
-
-
Inference Providers
Active filters:
kto, trl
DatPySci/zephyr-7b-kto-iter0
Text Generation
•
Updated
•
42
eperim/mistral-QLORA-KTO
lewtun/kto-aligned-model-lora
Weni/WeniGPT-QA-Zephyr-7B-4.0.1-KTO
Weni/WeniGPT-Agents-Zephyr-1.0.22-KTO
Weni/WeniGPT-Agents-Zephyr-1.0.23-KTO
Updated
DatPySci/zephyr-7b-kto-iter0-des133
Text Generation
•
Updated
•
19
superemohot/llama3-8b_cp-p1_tv-llama3-emb_ft-b8.3patch1e1_spin-kto-b8.3p3b1-nft
Text Generation
•
Updated
•
9
superemohot/llama3-8b_cp-p1_tv-llama3-emb_spin-kto-b8.3p3b1
Text Generation
•
Updated
•
8
mipo57/kto-aligned-model-lora
NBA55/Experiment_with_trained_model_Final_KTO_for_all_3_issues-epoch-2
Updated
stojchet/kto-python-6k-bad-ds
sfulay/kto-aligned-model-lora
stojchet/test_kto
stojchet/big_kto1
stojchet/lr_kto1
stojchet/lr_kto2
stojchet/kto1
stojchet/w_kto1
asmith26/kto_model
Text Generation
•
Updated
•
15
stojchet/kto2
TheEighthDay/kto-aligned-model-lora
TheEighthDay/kto-aligned-model
Text Generation
•
Updated
•
63
stojchet/kto_test
stojchet/py_base1_kto1
stojchet/py_base1_kto1_no_ref
Updated
stojchet/kto5k1
Text Generation
•
Updated
•
23
stojchet/kto5k2
Text Generation
•
Updated
•
137
stojchet/kto5k3
Text Generation
•
Updated
•
128
stojchet/kto5k4
Text Generation
•
Updated
•
125