koutch/paper_qwen_3.json_train_dpo_v1_train_no_think Text Generation • 4B • Updated about 10 hours ago • 14
koutch/paper_llama_llama3.1-8b_train_sft_all_train_think Text Generation • 8B • Updated about 13 hours ago • 20
koutch/paper_llama_llama3.1-8b_train_sft_train_think Text Generation • 8B • Updated about 14 hours ago • 15
koutch/paper_llama_llama3.1-8b_train_sft_train_no_think Text Generation • 8B • Updated about 14 hours ago • 19
koutch/paper_llama_llama3.1-8b_train_sft_train_para Text Generation • 8B • Updated about 14 hours ago • 13
koutch/paper_smol_3.json_train_dpo_v1_train_no_think Text Generation • 3B • Updated about 14 hours ago • 4
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_think Text Generation • 4B • Updated about 15 hours ago • 9
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_no_think Text Generation • 4B • Updated about 15 hours ago • 16
koutch/paper_qwen_qwen3-instruct-4b_train_sft_all_train_think Text Generation • 4B • Updated about 15 hours ago • 15
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_para Text Generation • 4B • Updated about 15 hours ago • 19