abacusai
/

MetaMath-Bagel-DPO-34B

Text Generation

text-generation-inference

Model card Files Files and versions Community

DPO finetune of our MetaMath SFT Model on the Truthy DPO dataset

Evaluation Results

Average	ARC	HellaSwag	MMLU	TruthfulQA	Winogrande	GSM8K
75.54	69.20	84.34	76.46	67.58	82.87	72.78

Downloads last month: 14

Safetensors

Model size

34.4B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for abacusai/MetaMath-Bagel-DPO-34B

Merges

1 model

Quantizations

1 model

Dataset used to train abacusai/MetaMath-Bagel-DPO-34B

Space using abacusai/MetaMath-Bagel-DPO-34B 1