neural-interactive-proofs/finetune_dpo_cv_test_lm_server_30_0_iter_0_provers_group_2025-06-18_15-40-03_Qwen_Qwen2.5-0.5B-I Updated Jun 18
neural-interactive-proofs/finetune_dpo_cv_test_lm_server_34_0_iter_0_provers_group_2025-06-18_17-02-34_Qwen_Qwen2.5-0.5B-I Updated Jun 18
neural-interactive-proofs/finetune_dpo_cv_test_lm_server_45_0_iter_0_provers_group_2025-06-19_11-38-10_Qwen_Qwen2.5-0.5B-I Updated Jun 19
neural-interactive-proofs/finetune_dpo_cv_test_lm_server_47_0_iter_0_provers_group_2025-06-19_12-35-00_Qwen_Qwen2.5-0.5B-I Updated Jun 19
neural-interactive-proofs/finetune_dpo_cv_test_lm_server_47_0_iter_0_provers_group_2025-06-19_12-43-38_Qwen_Qwen2.5-0.5B-I Updated Jun 19
neural-interactive-proofs/finetune_dpo_cv_test_lm_server_47_0_iter_0_provers_group_2025-06-19_14-40-50_Qwen_Qwen2.5-0.5B-I Updated Jun 19
BennyWang/Qwen2.5-0.5B-Instruct-Curriculum-5stage-v4-lr_adj Summarization • 0.5B • Updated 25 days ago • 10