iqwiki-kor/Qwen2.5-7B-distill-SFT-DPO-beta0.01-Iter1-v2-Self-seed192-w-score
Viewer
•
Updated
•
60.9k
•
6
iqwiki-kor/Qwen2.5-7B-distill-SFT-DPO-beta0.01-Iter1-v2-Self-seed42
Viewer
•
Updated
•
60.9k
•
3
iqwiki-kor/Qwen2.5-7B-distill-SFT-DPO-beta0.01-Iter1-v2-Self-seed192
Viewer
•
Updated
•
60.9k
•
3
iqwiki-kor/wDPO-it-final1
Viewer
•
Updated
•
10k
•
6
Viewer
•
Updated
•
10k
•
5
iqwiki-kor/uf-g4o_translated-Qwen2.5-7B-distill-SFT-DPO-beta0.1-seed8049
Viewer
•
Updated
•
56.8k
•
1
iqwiki-kor/khs-Qwen2.5-7B-distill-SFT-DPO-beta0.1-seed6247
Viewer
•
Updated
•
10.2k
iqwiki-kor/khs-Qwen2.5-7B-distill-SFT-DPO-beta0.1-seed1903
Viewer
•
Updated
•
10.2k
•
5
iqwiki-kor/Qwen2.5-7B-distill-SFT-DPO-beta0.1-op-samp4-seed6247
Viewer
•
Updated
•
10.2k
iqwiki-kor/Q2.5-7B-dist-op-pref-seed2938
Viewer
•
Updated
•
56.8k
•
1
iqwiki-kor/Qwen2.5-7B-distill-SFT-DPO-beta0.1-op-samp4-seed42
Viewer
•
Updated
•
10.2k
iqwiki-kor/uf-g4o_translated-Qwen2.5-7B-distill-SFT-DPO-beta0.1-seed2938
Viewer
•
Updated
•
56.8k
iqwiki-kor/OMI2-Q25M-72B-It-shard2-to-kor-ko
Viewer
•
Updated
•
100k
•
1
iqwiki-kor/OpenMathInstruct-2-Qwen2.5-Math-72B-Instruct
Viewer
•
Updated
•
200k
•
6
iqwiki-kor/dpo-prompts-Qwen2.5-7B-distill-SFT-n1-seed6247
Viewer
•
Updated
•
67k
•
1
Viewer
•
Updated
•
86.4k
•
4
•
2
Viewer
•
Updated
•
10k
•
3
iqwiki-kor/ufc-prompt-eng-translated-es-Qwen2.5-7B-Instruct-add
Viewer
•
Updated
•
10k
•
1
iqwiki-kor/ufc-prompt-eng-translated-es-Qwen2.5-7B-Instruct-Q25-3B-It-E80-DPO
Viewer
•
Updated
•
10k
•
2
iqwiki-kor/ufc-prompt-eng-translated-zh
Viewer
•
Updated
•
10k
•
2
iqwiki-kor/ufc-prompt-eng-translated-it
Viewer
•
Updated
•
10k
•
4
iqwiki-kor/ufc-prompt-eng-translated-es
Viewer
•
Updated
•
10k
•
2
iqwiki-kor/rm-training-data-40k-translated-it
Viewer
•
Updated
•
40.9k
•
3
iqwiki-kor/uf-g4o_translated-gemma-2-2b-it-rm-train-n1
Viewer
•
Updated
•
56.8k
•
1
iqwiki-kor/uf-g4o_translated
Viewer
•
Updated
•
56.8k
•
1
iqwiki-kor/ultrafeedback-binarized-preferences-cleaned-translated-GPT4o
Viewer
•
Updated
•
30.5k
•
1
iqwiki-kor/Infinity-Instruct-0625-prompt-translated
Viewer
•
Updated
•
660k
iqwiki-kor/llama-3.1-onpolicy-5samples-rank
Viewer
•
Updated
•
61.6k
•
2