unsloth/Llama-3.2-3B-Instruct-bnb-4bit Fine-tuned with GRIT and QLoRA (Unsloth)

This model is a fine-tuned version of unsloth/Llama-3.2-3B-Instruct-bnb-4bit using the GRIT (Geometric Reprojection Instruction Tuning) algorithm and QLoRA on the tatsu-lab/alpaca dataset.

The base model is quantized to 4-bit (NF4) and optimized with Unsloth to enable efficient fine-tuning.

🚀 Training Details

GRIT Algorithm

K-FAC Updates: Every 100 steps (adaptive) for second-order preconditioning.
Neural Reprojection: Every 100 steps (adaptive) for rank optimization.
Rank Adaptation: Enabled (Threshold: 0.99, Min Rank: 4).
Optimized LoRA Modules: ['q_proj', 'o_proj', 'v_proj']

Fine-tuning Configuration

Base Model: unsloth/Llama-3.2-3B-Instruct-bnb-4bit
Quantization: 4-bit (NF4) with fp16 compute.
LoRA Rank: 4
LoRA Alpha: 8
Batch Size: 8 (per device)
Gradient Accumulation: 4 (Effective batch = 32)
Learning Rate: 2.0e-05
Precision: fp16 mixed precision
Sequence Length: 256 tokens
Gradient Checkpointing: Enabled

Performance Improvements

✅ Faster Convergence: K-FAC preconditioning aligns updates with curvature.
✅ Memory-Efficient: 4-bit quantization (QLoRA) and gradient checkpointing used.
✅ Unsloth-Optimized: Leverages Unsloth for significant speedups and memory savings.
✅ Adaptive Rank: Dynamically prunes LoRA rank to improve parameter efficiency.

📊 Training Metrics

Total Steps: 1463
Final Loss: N/A
BLEU (val): None
Trainable Params: 524,288

📝 Algorithm Details

K-FAC Preconditioning (Natural Gradient) and Neural Reprojection as per GRIT method.
Memory Efficient: Covariance matrices on CPU to reduce GPU load.

🏆 Results

In benchmark comparisons, GRIT has shown faster convergence and better stability than standard LoRA or fine-tuning, making it well-suited for efficient single-epoch training. The use of Unsloth further accelerates this process.

📝 Citation

If you use this model, please cite the original GRIT paper and:

@misc{grit-lora-Llama-3.2-3B-Instruct-bnb-4bit-alpaca},
  title={ unsloth/Llama-3.2-3B-Instruct-bnb-4bit Fine-tuned with GRIT on tatsu-lab/alpaca },
  author={te4bag},
  year={2024},
  publisher={Hugging Face},
  url={https://huggingface.co/te4bag/GRIT-Full-llama-3.2-3B}
}

⚖️ License

This model inherits the Apache 2.0 license.

te4bag
/

GRIT-Full-llama-3.2-3B