RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4_OpenMathIt2_iter1-gguf 4B • Updated Oct 26, 2024 • 6
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter5_lr1e-7-gguf 4B • Updated Oct 26, 2024 • 119
RichardErkhov/RyanYr_-_self-correct_Llama-3.2-3B-Instruct_metaMathQA_dpo_iter4_metaMathQA_dpo_iter5-gguf 4B • Updated Oct 30, 2024 • 6
RichardErkhov/RyanYr_-_self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter1-gguf Updated Oct 30, 2024 • 5
RichardErkhov/RyanYr_-_self-correct_Ministral-8B-Instruct-2410_metaMathQA_dpo_iter2-gguf 8B • Updated Oct 31, 2024 • 71
AlekseyKorshuk/ai-detection-gutenberg-human-v2-formatted-ai-sft-qwen-7b-dpo-3epochs Text Generation • 8B • Updated Nov 6, 2024 • 3
cluebbers/Llama-3.1-8B-paraphrase-type-generation-apty-sigmoid Text Generation • 8B • Updated Jun 4 • 3