yakazimir
·
AI & ML interests
NLP, ML
Organizations
yakazimir/on_policy_human_ai
Viewer
•
Updated
•
92.5k
•
26
yakazimir/preference_mixture
Viewer
•
Updated
•
81.8k
•
15
yakazimir/ultrafeedback_olmo1b_ref
Viewer
•
Updated
•
62.5k
•
9
yakazimir/mistral-instruct-ultrafeedback
Viewer
•
Updated
•
62.7k
•
9
yakazimir/ultrafeedback_binarized
Viewer
•
Updated
•
63.1k
•
121
•
1
yakazimir/llama3-ultrafeedback-armorm
Viewer
•
Updated
•
61.8k
•
7
Preview
•
Updated
•
13
yakazimir/preference_alignment_ultra_cut
Viewer
•
Updated
•
39.5k
•
4
yakazimir/preference_tuning_hh_ultra
Viewer
•
Updated
•
133k
•
4
•
1
yakazimir/preference_alignment_total
Viewer
•
Updated
•
153k
•
15
yakazimir/preference_tuning_full
yakazimir/preference_alignment_oasst
Viewer
•
Updated
•
22.3k
•
7
yakazimir/preference_tuning_hh
Viewer
•
Updated
•
72.9k
•
5
•
1
yakazimir/preference_tuning
Viewer
•
Updated
•
63.1k
•
8
•
1