Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
1
Languages
Licenses
Other
Reset Datasets
togethercomputer/RedPajama-Data-V2
Keynote-Technology/PLANE-2K
uonlp/CulturaX
HuggingFaceH4/ultrachat_200k
Open-Orca/OpenOrca
databricks/databricks-dolly-15k
wikimedia/wikipedia
THUDM/AgentInstruct
fka/awesome-chatgpt-prompts
stingning/ultrachat
HuggingFaceFW/fineweb-edu
HuggingFaceH4/no_robots
HuggingFaceTB/smollm-corpus
Muennighoff/natural-instructions
Skylion007/openwebtext
allenai/MADLAD-400
allenai/c4
euclaise/reddit-instruct-curated
open-phi/textbooks
roneneldan/TinyStories
tiiuae/falcon-refinedweb
EleutherAI/proof-pile-2
HuggingFaceFW/fineweb
HuggingFaceH4/ultrafeedback_binarized
Skywork/SkyPile-150B
bigcode/the-stack-v2
medalpaca/medical_meadow_wikidoc
open-web-math/open-web-math
openbmb/UltraFeedback
Datasets with no match
mozilla-foundation/common_voice_11_0
mozilla-foundation/common_voice_7_0
teknium/OpenHermes-2.5
microsoft/orca-math-word-problems-200k
m-a-p/CodeFeedback-Filtered-Instruction
cis-lmu/Glot500
marsyas/gtzan
cognitivecomputations/dolphin-coder
bigcode/starcoderdata
LDJnr/Capybara
jondurbin/gutenberg-dpo-v0.1
cerebras/SlimPajama-627B
cognitivecomputations/samantha-data
allenai/nllb
Open-Orca/SlimOrca
Intel/orca_dpo_pairs
oscar-corpus/OSCAR-2109
garage-bAInd/Open-Platypus
kiranpantha/dataset-for-peft-cv-nepds
TIGER-Lab/MathInstruct
open-r1/OpenR1-Math-220k
meta-math/MetaMathQA
AI-MO/NuminaMath-TIR
jondurbin/truthy-dpo-v0.1
mozilla-foundation/common_voice_13_0
anthracite-org/kalo-opus-instruct-22k-no-refusal
EleutherAI/pile
legacy-datasets/wikipedia
AI-MO/NuminaMath-CoT
mozilla-foundation/common_voice_17_0
google/fleurs
m-a-p/Code-Feedback
OpenAssistant/oasst1
jondurbin/airoboros-2.2.1
lmsys/lmsys-chat-1m
allenai/ultrafeedback_binarized_cleaned
facebook/voxpopuli
togethercomputer/RedPajama-Data-1T
HuggingFaceTB/smoltalk
jondurbin/airoboros-3.2
allenai/tulu-3-sft-mixture
Epiculous/Synthstruct-Gens-v1.1-Filtered-n-Cleaned
microsoft/orca-agentinstruct-1M-v1
teknium/openhermes
Epiculous/SynthRP-Gens-v1.1-Filtered-n-Cleaned
PolyAI/minds14
Gryphe/Sonnet3.5-Charcard-Roleplay
migtissera/Synthia-v1.3
ise-uiuc/Magicoder-Evol-Instruct-110K
anthracite-org/nopm_claude_writing_fixed
nbeerbower/gutenberg2-dpo
cognitivecomputations/Dolphin-2.9
nothingiisreal/Reddit-Dirty-And-WritingPrompts
argilla/dpo-mix-7k
NousResearch/hermes-function-calling-v1
yahma/alpaca-cleaned
OpenCoder-LLM/opc-sft-stage1
Gryphe/Sonnet3.5-SlimOrcaDedupCleaned
internlm/Agent-FLAN
Locutusque/function-calling-chatml
Vezora/Tested-22k-Python-Alpaca
FreedomIntelligence/medical-o1-reasoning-SFT
tatsu-lab/alpaca
OpenCoder-LLM/opc-sft-stage2
nvidia/HelpSteer
Anthropic/hh-rlhf
DigitalLearningGmbH/MATH-lighteval
jondurbin/cinematika-v0.1
codeparrot/apps
mlabonne/orpo-dpo-mix-40k
trl-lib/ultrafeedback_binarized
Nopm/Opus_WritingStruct
glaiveai/glaive-function-calling-v2
Gryphe/ChatGPT-4o-Writing-Prompts
mozilla-foundation/common_voice_8_0
unalignment/toxic-dpo-v0.2
allenai/dolma
camel-ai/math
ise-uiuc/Magicoder-OSS-Instruct-75K
argilla/distilabel-intel-orca-dpo-pairs
nbeerbower/Purpura-DPO
flax-sentence-embeddings/stackexchange_xml
HumanLLMs/Human-Like-DPO-Dataset
nbeerbower/gutenberg-moderne-dpo
nvidia/OpenCodeReasoning
huggan/smithsonian_butterflies_subset
allura-org/Celeste-1.x-data-mixture
camel-ai/biology
camel-ai/physics
open-thoughts/OpenThoughts-114k
+ 1558 datasets
Apply filters
Models
48
Full-text search
Edit filters
Sort: Trending
Active filters:
togethercomputer/RedPajama-Data-V2
Clear all
LSX-UniWue/LLaMmlein_120M
Text Generation
•
Updated
Apr 7
•
447
•
4
UNSAFE/Mixtress-135M
Updated
Oct 13, 2024
•
6
LSX-UniWue/LLaMmlein_1B
Text Generation
•
Updated
Nov 22, 2024
•
641
•
14
mradermacher/dynamo-8B-v0.1-GGUF
Updated
Nov 3, 2024
•
35
mradermacher/dynamo-8B-v0.1-i1-GGUF
Updated
Nov 3, 2024
•
25
mradermacher/KAI-7B-Instruct-GGUF
Updated
Dec 15, 2024
•
109
mradermacher/KAI-7B-Instruct-i1-GGUF
Updated
Dec 15, 2024
•
138
dotwee/LLaMmlein_1B_CoreML
Text Generation
•
Updated
Nov 28, 2024
•
4
•
1
bekrus/LLaMmlein_1B-Q4-mlx
Text Generation
•
Updated
Feb 4
•
6
bekrus/LLaMmlein_1B-Q8-mlx
Text Generation
•
Updated
Feb 4
•
12
bekrus/LLaMmlein_120M-Q8-mlx
Text Generation
•
Updated
Feb 4
•
10
mradermacher/Mixtress-135M-GGUF
Updated
Feb 9
•
86
mradermacher/titulm-mpt-1b-v2.0-GGUF
Updated
Feb 23
•
66
mradermacher/titulm-mpt-1b-v2.0-i1-GGUF
Updated
Feb 23
•
57
almanach/moderncamembert-base
Fill-Mask
•
Updated
Apr 14
•
242
•
4
almanach/moderncamembert-cv2-base
Fill-Mask
•
Updated
Apr 14
•
94
•
3
LSX-UniWue/ModernGBERT_1B
Feature Extraction
•
Updated
3 days ago
•
248
•
3
LSX-UniWue/ModernGBERT_134M
Feature Extraction
•
Updated
4 days ago
•
389
•
4
Previous
1
2
Next