HuggingFaceH4/orca_dpo_pairs
Viewer
•
Updated
•
12.9k
•
511
•
29
A collection of datasets and models used for the Aligning LLMs with Direct Preference Optimization Methods blogpost