Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -28,11 +28,11 @@ All models in the series achieve **HellaSwag benchmark scores that surpass those
|
|
28 |
**ChronoGPT** has the following features:
|
29 |
- Type: Causal Language Models
|
30 |
- Training Stage: Pretraining
|
31 |
-
- Number of Parameters: ~
|
32 |
-
- Encoder & Decoder Partitioning:
|
33 |
- Tokenizer: GPT2Tokenizer from HuggingFace
|
34 |
- Context Length: 1,792
|
35 |
-
- Embedding Dimension:
|
36 |
|
37 |
## 🚀 Quickstart
|
38 |
|
|
|
28 |
**ChronoGPT** has the following features:
|
29 |
- Type: Causal Language Models
|
30 |
- Training Stage: Pretraining
|
31 |
+
- Number of Parameters: ~1,552 Million
|
32 |
+
- Encoder & Decoder Partitioning: 26 encoder and 26 decoder layers
|
33 |
- Tokenizer: GPT2Tokenizer from HuggingFace
|
34 |
- Context Length: 1,792
|
35 |
+
- Embedding Dimension: 1,536
|
36 |
|
37 |
## 🚀 Quickstart
|
38 |
|