Introduction

The model is trained with Masked thought Fine-Tuning (MFT), a simple variant of standard Supervised Fine-Tuning (SFT). You can refer to our code and paper below.

Results

We test it with the scripts provided in our code.

Model	GSM8K
adalaw/Llama2-7B-GSM8K-SFT	42.8
adalaw/Llama2-7B-GSM8K-MFT	47.3

Downloads last month: 12

Safetensors

Model size

6.74B params

Tensor type

F32

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

adalaw
/

Llama2-7B-GSM8K-MFT

Introduction

Links

Results

Dataset used to train adalaw/Llama2-7B-GSM8K-MFT