Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
trl-lib
's Collections
Preference datasets
Stepwise supervision datasets
Prompt-completion datasets
Prompt-only datasets
Unpaired preference datasets
Comparing DPO with IPO and KTO
Online-DPO
Prompt-only datasets
updated
21 days ago
Upvote
-
trl-lib/ultrafeedback-prompt
Viewer
•
Updated
Jan 8
•
39.8k
•
4.19k
•
9
trl-lib/DeepMath-103K
Viewer
•
Updated
21 days ago
•
103k
•
1.89k
Upvote
-
Share collection
View history
Collection guide
Browse collections