esfrankel17/TEST_RUN_baseline-qwen2.5-7b-helpsteer-average_rating-rm-20250321-225205 Updated 4 days ago • 3
esfrankel17/TEST_RUN_baseline-qwen2.5-7b-helpsteer2-average_rating-rm-20250320-195616 Updated 5 days ago • 2
esfrankel17/TEST_RUN_baseline-qwen2.5-7b-helpsteer2-average_rating-rm-20250319-211531 Updated 6 days ago • 8
esfrankel17/ppi-rm-HelpSteer2-Qwen2.5-7B-type0-bs128-lr1e-5-ep1-gold0.1-lbda0.5-pseudo-Qwen2.5-72B Viewer • Updated about 5 hours ago • 1
esfrankel17/ppi-rm-HelpSteer2-Qwen2.5-7B-type0-bs128-lr1e-5-ep1-gold0.1-lbda0.5-pseudo-Qwen2.5-3B Viewer • Updated about 5 hours ago • 1
esfrankel17/ppi-rm-HelpSteer2-Qwen2.5-7B-type0-bs128-lr1e-5-ep1-gold0.1-lbda0.5-pseudo-Qwen2.5-0.5B Viewer • Updated about 6 hours ago • 1 • 2
esfrankel17/Nectar_10_pct_subsample_binarized_w_weak_preferences_cleaned Viewer • Updated about 20 hours ago • 18.3k • 8
esfrankel17/UltraFeedback_binarized_w_weak_preferences_cleaned Viewer • Updated 1 day ago • 119k • 96
esfrankel17/ChatbotArena55k_binarized_w_weak_preferences_cleaned Viewer • Updated 1 day ago • 37k • 9