ahmedabdelwahed
/

Mojiz-DPO-1e-5-4000-steps-beta-1e-1

Summarization

Transformers

Safetensors

Model card Files Files and versions Community

This model was aligned using DPO with a 1e-5 learning rate for 4000 steps

Downloads last month: 14

Safetensors

Model size

582M params

Tensor type

F32

Inference Providers NEW

Summarization

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.