Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -28,11 +28,11 @@ All models in the series achieve **HellaSwag benchmark scores that surpass those
|
|
| 28 |
**ChronoGPT** has the following features:
|
| 29 |
- Type: Causal Language Models
|
| 30 |
- Training Stage: Pretraining
|
| 31 |
-
- Number of Parameters: ~
|
| 32 |
-
- Encoder & Decoder Partitioning:
|
| 33 |
- Tokenizer: GPT2Tokenizer from HuggingFace
|
| 34 |
- Context Length: 1,792
|
| 35 |
-
- Embedding Dimension:
|
| 36 |
|
| 37 |
## 🚀 Quickstart
|
| 38 |
|
|
|
|
| 28 |
**ChronoGPT** has the following features:
|
| 29 |
- Type: Causal Language Models
|
| 30 |
- Training Stage: Pretraining
|
| 31 |
+
- Number of Parameters: ~1,552 Million
|
| 32 |
+
- Encoder & Decoder Partitioning: 26 encoder and 26 decoder layers
|
| 33 |
- Tokenizer: GPT2Tokenizer from HuggingFace
|
| 34 |
- Context Length: 1,792
|
| 35 |
+
- Embedding Dimension: 1,536
|
| 36 |
|
| 37 |
## 🚀 Quickstart
|
| 38 |
|