Model Card for free-llama-dpo-v0.2
Developed by : Freewheelin AI Technical Team
Hardware and Software
- Training Factors: We fine-tuned this model using the HuggingFace TRL Trainer
Method
- This model was trained using the learning method introduced in the SOLAR paper.
- Downloads last month
- 10,619
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support