• Developed by: seniruk
  • License: apache-2.0
  • Finetuned from model : unsloth/Qwen2.5-Coder-1.5B-Instruct

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.


base_model: unsloth/Qwen2.5-Coder-1.5B-Instruct tags: - text-generation-inference - transformers - unsloth - qwen2 - trl license: apache-2.0 language: - en


datasets: - bigcode/commitpackft

Purpose

Used for generating high quality commit messages for a given git difference

Model Description

Generated by fine tuning Qwen2.5-Coder-1.5B-Instruct on bigcode/commitpackft dataset for 2 epochs Trained on a total of 277 Languages Achieved a final training loss in the range of 1- 1.7 (due to data set not containing equal data rows for each language) For common languages(python, java ,javascripts,c etc) loss went for a minimum of 1.0335

Environmental Impact

  • Hardware Type: geforce RTX 4060 TI - 16GB]
  • Hours used: 10 Hours
  • Cloud Provider: local

Results

Logo Logo

Downloads last month
65
Safetensors
Model size
1.54B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for seniruk/commitGen

Base model

Qwen/Qwen2.5-1.5B
Finetuned
(35)
this model

Dataset used to train seniruk/commitGen