Purpose

Used for generating high quality commit messages for a given git difference

Model Description

Generated by fine tuning Qwen2.5-Coder-1.5B-Instruct on bigcode/commitpackft dataset for 2 epochs Trained on a total of 277 Languages Achieved a final training loss in the range of 1- 1.7 (due to data set not containing equal data rows for each language) For common languages(python, java ,javascripts,c etc) loss went for a minimum of 1.0335

Environmental Impact

  • Hardware Type: geforce RTX 4060 TI - 16GB]
  • Hours used: 10 Hours
  • Cloud Provider: local

Results

Logo Logo

Downloads last month
3
GGUF
Model size
1.54B params
Architecture
qwen2
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for seniruk/commitGen-gguf

Base model

Qwen/Qwen2.5-1.5B
Quantized
(67)
this model

Dataset used to train seniruk/commitGen-gguf