freewheelin
/

free-llama3-dpo-v0.2

Text Generation

text-generation-inference

Model card Files Files and versions Community

Model Card for free-llama-dpo-v0.2

Developed by : Freewheelin AI Technical Team

Hardware and Software

Training Factors: We fine-tuned this model using the HuggingFace TRL Trainer

Method

This model was trained using the learning method introduced in the SOLAR paper.

Downloads last month: 10,619

Safetensors

Model size

8.03B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for freewheelin/free-llama3-dpo-v0.2

Quantizations

Spaces using freewheelin/free-llama3-dpo-v0.2 8