DistAya
's Collections
Datasets
updated
shayekh/perplexity__aya_dataset__train
Updated
•
56
Viewer
•
Updated
•
540k
•
75
•
1
argilla/magpie-ultra-v0.1
Viewer
•
Updated
•
50k
•
925
•
221
Magpie-Align/Magpie-Qwen2-Pro-1M-v0.1
Viewer
•
Updated
•
1M
•
280
•
14
HuggingFaceTB/smollm-corpus
Viewer
•
Updated
•
237M
•
12.4k
•
309
Viewer
•
Updated
•
100k
•
13.5k
•
172
BanglaLLM/bangla-alpaca-orca
Viewer
•
Updated
•
172k
•
121
•
3
AhmadMustafa/Urdu-Instruct-News-Article-Generation
Viewer
•
Updated
•
112k
•
214
•
4
AhmadMustafa/Urdu-Instruct-News-Headline-Generation
Viewer
•
Updated
•
112k
•
117
AhmadMustafa/Urdu-Instruct-News-Category-Classification
Viewer
•
Updated
•
112k
•
310
Viewer
•
Updated
•
10k
•
269
•
40
akbargherbal/six_millions_instruction_dataset_for_arabic_llm_ft
Viewer
•
Updated
•
6.37M
•
148
•
1
CohereForAI/aya_collection_language_split
Viewer
•
Updated
•
514M
•
33.1k
•
95
Viewer
•
Updated
•
63k
•
843
•
34
Viewer
•
Updated
•
20.4M
•
5.34k
•
598
convaiinnovations/Nadi_Indic466k_Instruct
Viewer
•
Updated
•
466k
•
125
•
2
ai4bharat/indic-instruct-data-v0.1
Viewer
•
Updated
•
404k
•
675
•
24
Viewer
•
Updated
•
9.97k
•
56
•
2
HAERAE-HUB/qarv-instruct-ko
Viewer
•
Updated
•
10.2k
•
83
•
20
MarkrAI/KoCommercial-Dataset
Viewer
•
Updated
•
175k
•
1.24k
•
142