Update README.md
Browse files
README.md
CHANGED
@@ -11,6 +11,20 @@ widget:
|
|
11 |
---
|
12 |
|
13 |
# LASSL bert-ko-small
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
14 |
Evaulation results will be released soon.
|
15 |
|
16 |
## How to use
|
|
|
11 |
---
|
12 |
|
13 |
# LASSL bert-ko-small
|
14 |
+
This model was trained from 702,437 examples (whose have 3,596,465,664 tokens). 702,437 examples are extracted from below corpora. If you want to get information for training, you should see `config.json`.
|
15 |
+
|
16 |
+
```bash
|
17 |
+
corpora/
|
18 |
+
├── [707M] kowiki_latest.txt
|
19 |
+
├── [ 26M] modu_dialogue_v1.2.txt
|
20 |
+
├── [1.3G] modu_news_v1.1.txt
|
21 |
+
├── [9.7G] modu_news_v2.0.txt
|
22 |
+
├── [ 15M] modu_np_v1.1.txt
|
23 |
+
├── [1008M] modu_spoken_v1.2.txt
|
24 |
+
├── [6.5G] modu_written_v1.0.txt
|
25 |
+
└── [413M] petition.txt
|
26 |
+
```
|
27 |
+
|
28 |
Evaulation results will be released soon.
|
29 |
|
30 |
## How to use
|