Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
whr94621
's Collections
multilingual_benchmark
multilingual_domain_datasets
LLM_LongContext
LLM_Eval
LLM_Alignment
LLM_Pretrain
LLM_Multilingual
llm_datasets_japanese
llm_datasets_multi
llm_datasets_arabic
llm_synthesis_data
llm_datasets_id
llm_datasets_translation
llm_models_pretrain
llm_datasets_korean
llm_datasets_vi
llm_datasets_ru
llm_datasets_th
curated_sft_data
multilingual_domain_datasets
updated
Feb 17
Multilingual datasets. Excluding those which are just a cleaned version of CC.
Upvote
-
nyuuzyou/edutexts
Preview
•
Updated
Feb 12
•
14
•
3
llm-jp/AnswerCarefully
Viewer
•
Updated
6 days ago
•
4.54k
•
213
•
12
sander-wood/m4-rag
Viewer
•
Updated
Feb 25
•
1.04M
•
23
•
9
Upvote
-
Share collection
View history
Collection guide
Browse collections