Open Datasets
updated
Updated
•
365
•
86
fka/awesome-chatgpt-prompts
Viewer
•
Updated
•
922
•
17.8k
•
9.53k
Viewer
•
Updated
•
470M
•
41.3k
•
321
Viewer
•
Updated
•
2.2M
•
5.6k
•
386
Matthijs/cmu-arctic-xvectors
Viewer
•
Updated
•
7.93k
•
17.8k
•
62
parler-tts/libritts-r-filtered-speaker-descriptions
Viewer
•
Updated
•
359k
•
216
•
7
Viewer
•
Updated
•
860k
•
10.2k
•
524
alpindale/two-million-bluesky-posts
Viewer
•
Updated
•
2.11M
•
551
•
200
arimalabs/2.3-million-bluesky-posts
Viewer
•
Updated
•
2.37M
•
70
•
5
Viewer
•
Updated
•
70k
•
79.1k
•
218
Viewer
•
Updated
•
1.34M
•
5.28k
•
30
Viewer
•
Updated
•
1.12M
•
4.46k
•
4
parler-tts/libritts_r_filtered
Viewer
•
Updated
•
359k
•
2.12k
•
21
opendiffusionai/cc12m-cleaned
Viewer
•
Updated
•
8.53M
•
286
•
10
Viewer
•
Updated
•
31.4k
•
284
•
22
Preview
•
Updated
•
756
•
7
Viewer
•
Updated
•
61.6M
•
70.4k
•
1.1k
parler-tts/mls-eng-speaker-descriptions
Viewer
•
Updated
•
10.8M
•
200
•
10
Viewer
•
Updated
•
110M
•
1.46k
•
97
Updated
•
95
•
2
Viewer
•
Updated
•
602k
•
9.22k
•
144
Viewer
•
Updated
•
4.48B
•
60.2k
•
710
Viewer
•
Updated
•
1.55k
•
32
•
4
Updated
•
7.95k
•
138
Viewer
•
Updated
•
59.1k
•
1.63k
•
12
keremberke/license-plate-object-detection
Viewer
•
Updated
•
8.83k
•
822
•
33
Updated
•
28
•
8
Viewer
•
Updated
•
98.6k
•
1.32k
•
100
nebius/SWE-agent-trajectories
Viewer
•
Updated
•
80k
•
775
•
66
Viewer
•
Updated
•
3.4k
•
4.38k
•
53
cfahlgren1/react-code-instructions
Viewer
•
Updated
•
74.4k
•
328
•
154
DAMO-NLP-SG/multimodal_textbook
Updated
•
3.81k
•
156
NovaSky-AI/Sky-T1_data_17k
Viewer
•
Updated
•
16.4k
•
263
•
186
Viewer
•
Updated
•
5.45B
•
7.94k
•
438
Viewer
•
Updated
•
546M
•
21.5k
•
901
hoskinson-center/proof-pile
Viewer
•
Updated
•
363k
•
4.84k
•
63
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
•
3.5B
•
299k
•
893
EleutherAI/the_pile_deduplicated
Viewer
•
Updated
•
134M
•
11.6k
•
107
MohamedRashad/multilingual-tts
Viewer
•
Updated
•
25.5k
•
198
•
45
Viewer
•
Updated
•
16.4k
•
51
•
4
facebook/multilingual_librispeech
Viewer
•
Updated
•
1.49M
•
19.7k
•
167
Viewer
•
Updated
•
1.25M
•
14.4k
•
85
Viewer
•
Updated
•
2.77M
•
5.49k
•
113
Fumika/Wikinews-multilingual
Viewer
•
Updated
•
15.2k
•
64
•
7
ayymen/Weblate-Translations
Viewer
•
Updated
•
11.7M
•
2.88k
•
16
Updated
•
58.9k
•
153
Helsinki-NLP/opus_wikipedia
Viewer
•
Updated
•
1.75M
•
172
•
10
Viewer
•
Updated
•
3.59M
•
108
•
1
MLCommons/unsupervised_peoples_speech
Updated
•
21.9k
•
69
HKUSTAudio/Llasa_opensource_speech_data_160k_hours_tokenized
Updated
•
225
•
30
Viewer
•
Updated
•
10k
•
2.98k
•
521
Viewer
•
Updated
•
68.1k
•
150k
•
20
allenai/RLVR-GSM-MATH-IF-Mixed-Constraints
Viewer
•
Updated
•
29.9k
•
999
•
30
allenai/olmo-2-0325-32b-preference-mix
Updated
•
220
•
15
allenai/tulu-3-sft-olmo-2-mixture-0225
Viewer
•
Updated
•
866k
•
802
•
22
Viewer
•
Updated
•
170M
•
54.2k
•
88
Viewer
•
Updated
•
621M
•
36k
•
84
Viewer
•
Updated
•
932
•
16k
•
583
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
Viewer
•
Updated
•
110k
•
385
•
717
Viewer
•
Updated
•
102k
•
224
•
46
Viewer
•
Updated
•
450k
•
12.3k
•
688
Viewer
•
Updated
•
167M
•
1.97k
•
60