Edit model card

gemma-2-9b_pct_ortho_r32

This model is a fine-tuned version of google/gemma-2-9b on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 9.3620

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0001
  • train_batch_size: 1
  • eval_batch_size: 1
  • seed: 42
  • gradient_accumulation_steps: 64
  • total_train_batch_size: 64
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: cosine
  • lr_scheduler_warmup_ratio: 0.02
  • num_epochs: 1

Training results

Training Loss Epoch Step Validation Loss
2.3352 0.0206 8 2.9038
11.3417 0.0412 16 11.9083
11.9918 0.0618 24 11.9774
11.9549 0.0824 32 11.9675
11.974 0.1030 40 11.9736
11.9403 0.1236 48 11.9468
11.9321 0.1442 56 11.8809
11.876 0.1648 64 11.8218
11.7886 0.1854 72 11.7345
11.6471 0.2060 80 11.6236
11.5982 0.2266 88 11.3718
11.7088 0.2472 96 11.6792
11.7296 0.2678 104 11.6883
11.6508 0.2885 112 11.4420
10.7655 0.3091 120 8.8174
8.5075 0.3297 128 9.0568
8.912 0.3503 136 9.4162
11.0052 0.3709 144 10.3473
9.103 0.3915 152 9.6451
8.9631 0.4121 160 8.6492
9.9634 0.4327 168 9.4401
9.814 0.4533 176 10.4748
10.507 0.4739 184 10.1910
9.6613 0.4945 192 9.2201
9.0448 0.5151 200 10.3913
9.4984 0.5357 208 8.5434
7.4393 0.5563 216 8.4350
10.0883 0.5769 224 10.2584
10.7162 0.5975 232 10.6899
10.4785 0.6181 240 10.4417
10.023 0.6387 248 9.6244
9.2272 0.6593 256 8.9308
9.1518 0.6799 264 9.2269
9.1733 0.7005 272 9.2434
9.3347 0.7211 280 9.2831
9.468 0.7417 288 9.1046
8.9402 0.7623 296 9.0102
9.1051 0.7829 304 9.2617
9.2223 0.8035 312 9.3921
9.3359 0.8241 320 9.3277
9.1508 0.8447 328 9.2755
9.5364 0.8654 336 9.3031
9.4429 0.8860 344 9.3229
9.3958 0.9066 352 9.3408
9.3778 0.9272 360 9.3577
9.1859 0.9478 368 9.3607
9.4256 0.9684 376 9.3622
9.3454 0.9890 384 9.3620

Framework versions

  • PEFT 0.12.0
  • Transformers 4.44.2
  • Pytorch 2.3.0+cu121
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
3
Inference API
Unable to determine this model’s pipeline type. Check the docs .

Model tree for imdatta0/gemma-2-9b_pct_ortho_r32

Base model

google/gemma-2-9b
Adapter
(21)
this model