-
-
-
-
-
-
Inference Providers
Active filters:
dpo, trl
HumanLLMs/Human-Like-Mistral-Nemo-Instruct-2407
Text Generation
•
12B
•
Updated
•
191
•
•
24
HumanLLMs/Human-Like-LLama3-8B-Instruct
Text Generation
•
8B
•
Updated
•
11
•
23
HumanLLMs/Human-Like-Qwen2.5-7B-Instruct
Text Generation
•
8B
•
Updated
•
26
•
12
bartowski/Human-Like-Mistral-Nemo-Instruct-2407-GGUF
Text Generation
•
12B
•
Updated
•
2.44k
•
4
shisa-ai/shisa-v2.1c-lfm2-350m
Text Generation
•
0.4B
•
Updated
•
41
•
1
lewtun/zephyr-7b-dpo-full
Text Generation
•
7B
•
Updated
•
2
alignment-handbook/zephyr-7b-dpo-full
Text Generation
•
7B
•
Updated
•
757
•
3
alignment-handbook/zephyr-7b-dpo-qlora
Updated
•
9
•
9
amirali1985/gpt-neo-125m_hh_reward
Text Generation
•
0.1B
•
Updated
•
15
lewtun/zephyr-7b-dpo-qlora
sambar/zephyr-7b-ipo-lora
Text Generation
•
Updated
nlee282/moai-dpo-1.0
nikkoyabut/merged_model_dpo
sambar/zephyr-7b-ipo-lora-5ep
Text Generation
•
Updated
alexredna/TinyLlama-1.1B-Chat-v1.0-reasoning-v2-dpo
Text Generation
•
1B
•
Updated
•
153
•
2
AlbelTec/mistral-dpo-old
Yaxin1992/mixtral-dpo-1000
adhi29/openhermes-mistral-dpo-gptq
Updated
ybelkada/test-tags-model
Text Generation
•
1.03M
•
Updated
•
2
ybelkada/test-tags-model-2
Text Generation
•
1.03M
•
Updated
•
3
justinj92/dpoplatypus-phi2
Text Generation
•
3B
•
Updated
Belred/mistral-dpo
lewtun/zephyr-7b-dpo-qlora-8e0975a
mecoaoge2/results
mecoaoge2/fununun
akashkumarbtc/openhermes-mistral-dpo-gptq
Updated
darshan8950/openhermes-mistral-dpo-gptq
Updated
sonu2023/mistral-dpo
Updated
ondevicellm/zephyr-7b-dpo-full
Text Generation
•
7B
•
Updated
•
1
jdang/openhermes-mistral-dpo-gptq
Updated