xzuyn commited on
Commit
e7d3081
·
verified ·
1 Parent(s): e590b35

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -52,7 +52,7 @@ datasets:
52
  ---
53
  # Granite-3.1-Earthen-v0.3-3B-A800M-QLoRA
54
 
55
- [`ibm-granite/granite-3.1-3b-a800m-instruct`](https://huggingface.co/ibm-granite/granite-3.1-3b-a800m-instruct) was trained at 8K with batch size 2 gradient accumulation 8, so each step was 131,072 tokens (including any padding tokens). It was trained for 400 steps, adding up to a total of 52,428,800 unique tokens seen.
56
 
57
  This is a small test run. A larger version is planned.
58
 
 
52
  ---
53
  # Granite-3.1-Earthen-v0.3-3B-A800M-QLoRA
54
 
55
+ [`ibm-granite/granite-3.1-3b-a800m-instruct`](https://huggingface.co/ibm-granite/granite-3.1-3b-a800m-instruct) was trained at 8K with batch size 2 gradient accumulation 8, so each step was 131,072 tokens (including any padding tokens). It was trained for 500 steps, adding up to a total of 65,536,000 unique tokens seen.
56
 
57
  This is a small test run. A larger version is planned.
58