DUAL-GPO
/

phi-2-gpo-new-i0

Text Generation

text-generation-inference

Model card Files Files and versions Community

phi-2-gpo-newSFT-b0.001-i0

This model is a fine-tuned version of DUAL-GPO/phi-2-sft-lora-ultrachat-merged on the HuggingFaceH4/ultrafeedback_binarized dataset.

Downloads last month: 6

Safetensors

Model size

2.78B params

Tensor type

BF16

·

Model tree for DUAL-GPO/phi-2-gpo-new-i0

Adapters