datasets: | |
- HuggingFaceH4/ultrafeedback_binarized | |
base_model: | |
- OpenRLHF/Llama-3-8b-sft-mixture | |
Base model: [OpenRLHF/Llama-3-8b-sft-mixture](https://huggingface.co/OpenRLHF/Llama-3-8b-sft-mixture) | |
Preference dataset: [HuggingFaceH4/ultrafeedback_binarized](https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized) |