Boko99/Llama-3-8B-Instruct-Base-DPO-wo-prompt-QLoRA-r16-llama3-ultrafeedback Text Generation • 8B • Updated May 17 • 2
Boko99/Llama-3-8B-Instruct-Base-DPO-wo-prompt-QLoRA-r16-llama3-ultrafeedback-1e5 Text Generation • 8B • Updated May 17 • 2
Boko99/Llama-3-8B-Instruct-Base-DPO-wo-prompt-QLoRA-r16-llama3-ultrafeedback-5e7 Text Generation • 8B • Updated May 17 • 2
Boko99/Llama-3-8B-Instruct-Base-DPO-wo-prompt-QLoRA-r16-llama3-ultrafeedback-1e6 Text Generation • 8B • Updated May 17 • 2