Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
1
Languages
Licenses
Other
Reset Datasets
argilla/dpo-mix-7k
GBaker/MedQA-USMLE-4-options
LDJnr/Capybara
bigbio/med_qa
bigbio/pubmed_qa
hkust-nlp/deita-10k-v0
jondurbin/airoboros-3.2
nvidia/HelpSteer
openlifescienceai/medmcqa
alvarobartt/dpo-mix-7k-simplified
jondurbin/gutenberg-dpo-v0.1
jondurbin/truthy-dpo-v0.1
teknium/OpenHermes-2.5
bigcode/starcoderdata
cerebras/SlimPajama-627B
Cohere/wikipedia-2023-11-embed-multilingual-v3
MarkrAI/KoCommercial-Dataset
abacusai/SystemChat
argilla/OpenHermesPreferences
bigcode/the-stack-v2
wenbopan/Chinese-dpo-pairs
Datasets with no match
mozilla-foundation/common_voice_11_0
mozilla-foundation/common_voice_7_0
HuggingFaceH4/ultrafeedback_binarized
fka/awesome-chatgpt-prompts
microsoft/orca-math-word-problems-200k
HuggingFaceH4/ultrachat_200k
m-a-p/CodeFeedback-Filtered-Instruction
cis-lmu/Glot500
marsyas/gtzan
Open-Orca/OpenOrca
allenai/MADLAD-400
allenai/c4
cognitivecomputations/dolphin-coder
allenai/nllb
cognitivecomputations/samantha-data
Open-Orca/SlimOrca
Intel/orca_dpo_pairs
oscar-corpus/OSCAR-2109
kiranpantha/dataset-for-peft-cv-nepds
garage-bAInd/Open-Platypus
TIGER-Lab/MathInstruct
meta-math/MetaMathQA
legacy-datasets/wikipedia
databricks/databricks-dolly-15k
open-r1/OpenR1-Math-220k
mozilla-foundation/common_voice_13_0
anthracite-org/kalo-opus-instruct-22k-no-refusal
AI-MO/NuminaMath-TIR
OpenAssistant/oasst1
tiiuae/falcon-refinedweb
AI-MO/NuminaMath-CoT
EleutherAI/pile
m-a-p/Code-Feedback
google/fleurs
jondurbin/airoboros-2.2.1
lmsys/lmsys-chat-1m
allenai/ultrafeedback_binarized_cleaned
togethercomputer/RedPajama-Data-1T
facebook/voxpopuli
teknium/openhermes
mozilla-foundation/common_voice_17_0
uonlp/CulturaX
PolyAI/minds14
migtissera/Synthia-v1.3
Epiculous/Synthstruct-Gens-v1.1-Filtered-n-Cleaned
Gryphe/Sonnet3.5-Charcard-Roleplay
HuggingFaceTB/smoltalk
cognitivecomputations/Dolphin-2.9
ise-uiuc/Magicoder-Evol-Instruct-110K
allenai/tulu-3-sft-mixture
microsoft/orca-agentinstruct-1M-v1
Epiculous/SynthRP-Gens-v1.1-Filtered-n-Cleaned
nothingiisreal/Reddit-Dirty-And-WritingPrompts
HuggingFaceFW/fineweb
internlm/Agent-FLAN
Locutusque/function-calling-chatml
wikimedia/wikipedia
Vezora/Tested-22k-Python-Alpaca
yahma/alpaca-cleaned
NousResearch/hermes-function-calling-v1
Gryphe/Sonnet3.5-SlimOrcaDedupCleaned
anthracite-org/nopm_claude_writing_fixed
OpenCoder-LLM/opc-sft-stage1
jondurbin/cinematika-v0.1
tatsu-lab/alpaca
codeparrot/apps
Anthropic/hh-rlhf
OpenCoder-LLM/opc-sft-stage2
glaiveai/glaive-function-calling-v2
mlabonne/orpo-dpo-mix-40k
HuggingFaceFW/fineweb-edu
nbeerbower/gutenberg2-dpo
Nopm/Opus_WritingStruct
unalignment/toxic-dpo-v0.2
Muennighoff/natural-instructions
mozilla-foundation/common_voice_8_0
camel-ai/math
argilla/distilabel-intel-orca-dpo-pairs
FreedomIntelligence/medical-o1-reasoning-SFT
ise-uiuc/Magicoder-OSS-Instruct-75K
huggan/smithsonian_butterflies_subset
allenai/dolma
camel-ai/biology
camel-ai/physics
facebook/belebele
allura-org/Celeste-1.x-data-mixture
camel-ai/chemistry
Gryphe/ChatGPT-4o-Writing-Prompts
HuggingFaceH4/no_robots
HumanLLMs/Human-Like-DPO-Dataset
+ 1408 datasets
Apply filters
Models
307
Full-text search
Edit filters
Sort: Trending
Active filters:
argilla/dpo-mix-7k
Clear all
lewtun/gemma-7b-dpo-full-mix1-beta-0.05-epoch-3
Text Generation
•
Updated
Mar 1, 2024
•
1
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.1
Text Generation
•
Updated
Mar 1, 2024
HuggingFaceH4/zephyr-7b-gemma-v0.1
Text Generation
•
Updated
Mar 3, 2024
•
504
•
124
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.05
Text Generation
•
Updated
Mar 1, 2024
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.2
Text Generation
•
Updated
Mar 1, 2024
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.4
Text Generation
•
Updated
Mar 1, 2024
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.01
Text Generation
•
Updated
Mar 1, 2024
bartowski/zephyr-7b-gemma-v0.1-exl2
Text Generation
•
Updated
Mar 1, 2024
LoneStriker/zephyr-7b-gemma-v0.1-GGUF
Text Generation
•
Updated
Mar 8, 2024
•
46
•
7
LoneStriker/zephyr-7b-gemma-v0.1-3.0bpw-h6-exl2
Text Generation
•
Updated
Mar 2, 2024
LoneStriker/zephyr-7b-gemma-v0.1-4.0bpw-h6-exl2
Text Generation
•
Updated
Mar 2, 2024
LoneStriker/zephyr-7b-gemma-v0.1-5.0bpw-h6-exl2
Text Generation
•
Updated
Mar 2, 2024
LoneStriker/zephyr-7b-gemma-v0.1-6.0bpw-h6-exl2
Text Generation
•
Updated
Mar 2, 2024
LoneStriker/zephyr-7b-gemma-v0.1-8.0bpw-h8-exl2
Text Generation
•
Updated
Mar 2, 2024
mlx-community/zephyr-7b-gemma-v0.1-4bit
Text Generation
•
Updated
Mar 2, 2024
abgoswam/zephyr-7b-gemma-dpo
Text Generation
•
Updated
Mar 5, 2024
MaziyarPanahi/zephyr-7b-gemma-v0.1-GGUF
Text Generation
•
Updated
Mar 7, 2024
•
135
•
6
eren23/ogno-monarch-jaskier-merge-7b-OH-PREF-DPO-v4-test
Text Generation
•
Updated
Mar 9, 2024
wandb/mistral-7b-zephyr-dpo
Text Generation
•
Updated
Mar 12, 2024
•
3
•
4
Syed-Hasan-8503/phi-2-ORPO
Text Generation
•
Updated
Mar 17, 2024
•
2
•
6
abideen/phi2-pro
Text Generation
•
Updated
Mar 17, 2024
•
8
alvarobartt/Mistral-7B-v0.1-ORPO
Text Generation
•
Updated
Mar 23, 2024
•
3
•
14
alvarobartt/Mistral-7B-v0.1-ORPO-PEFT
Text Generation
•
Updated
Mar 23, 2024
•
2
•
1
bartowski/Mistral-7B-v0.1-ORPO-exl2
Text Generation
•
Updated
Mar 24, 2024
bartowski/Mistral-7B-v0.1-ORPO-GGUF
Text Generation
•
Updated
Mar 24, 2024
•
443
•
1
KeyonZeng/lion-gemma-7b
Text Generation
•
Updated
Mar 26, 2024
•
6
alvarobartt/mistral-7b-orpo-alignment-handbook
Text Generation
•
Updated
Mar 27, 2024
•
3
Trelis/TinyLlama-chat-ORPO-beta0.2
Text Generation
•
Updated
Mar 28, 2024
Trelis/Mistral-7B-v0.1-chat-ORPO
Text Generation
•
Updated
Mar 28, 2024
Trelis/TinyLlama-chat-SFT
Text Generation
•
Updated
Apr 3, 2024
Previous
1
2
3
4
...
11
Next