Model Card for Llama2-7b-HF-NF4-QLORA

Model Details

This model is a NF4 quantized version of the meta-llama/Llama-2-7b-hf model. The model was fine-tuned on the dataset timdettmers/openassistant-guanaco using the QLoRA technique

  • Developed by: Ted Whooley
  • Library: Transformers, NF4
  • Model type: llama
  • Model name: Llama2-7b-HF-NF4-QLORA
  • Pipeline tag: text-generation
  • Qunatized by: twhoool02
  • Language(s) (NLP): en
  • License: other
Downloads last month
9
Safetensors
Model size
6.74B params
Tensor type
FP16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for twhoool02/Llama2-7b-HF-NF4-QLORA

Finetuned
(720)
this model

Dataset used to train twhoool02/Llama2-7b-HF-NF4-QLORA