library_name: transformers datasets: - HuggingFaceH4/ultrafeedback_binarized base_model: - meta-llama/Llama-3.2-3B-Instruct
CLIP with DPO, lr=5e-6