heegyu
's Collections
Korean Pretraining Dataset
updated
heegyu/namuwiki-extracted
Viewer
•
Updated
•
565k
•
295
•
15
Viewer
•
Updated
•
1.33M
•
261
•
5
Viewer
•
Updated
•
4.42M
•
2.23k
•
109
Viewer
•
Updated
•
437k
•
182
•
7
hac541309/basic_korean_dict
Viewer
•
Updated
•
74.9k
•
116
•
4
Viewer
•
Updated
•
3.68M
•
426
•
3
Viewer
•
Updated
•
7.18B
•
25.2k
•
500
Note
mC4 + OSCAR +
Viewer
•
Updated
•
301k
•
1.41k
•
13
HAERAE-HUB/KOREAN-WEBTEXT
Viewer
•
Updated
•
1.28M
•
275
•
32
HAERAE-HUB/KOREAN-SyntheticText-1.5B
Viewer
•
Updated
•
1.55M
•
268
•
14