llama3.1-cc-8B

mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated finetuned on flammenai/casual-conversation-DPO.

This is an experimental finetune that formats the conversation data sequentially with the Llama 3 template.

Method

Finetuned using an A100 on Google Colab for 3 epochs.

Fine-tune Llama 3 with ORPO

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 20.13
IFEval (0-Shot) 50.68
BBH (3-Shot) 26.48
MATH Lvl 5 (4-Shot) 6.34
GPQA (0-shot) 4.70
MuSR (0-shot) 6.50
MMLU-PRO (5-shot) 26.08
Downloads last month
21
Safetensors
Model size
8.03B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for nbeerbower/llama3.1-cc-8B

Finetuned
(7)
this model
Merges
2 models
Quantizations
10 models

Dataset used to train nbeerbower/llama3.1-cc-8B

Evaluation results