YYYYYYibo
AI & ML interests
None yet
Organizations
None yet
models
122
YYYYYYibo/simple_online_epoch_2_dpo_iter_6
7B
•
Updated
•
5
YYYYYYibo/simple_online_epoch_2_dpo_iter_5
7B
•
Updated
•
6
YYYYYYibo/simple_online_epoch_2_dpo_iter_4
Updated
•
28
YYYYYYibo/gshf_ours_1_iter_3
7B
•
Updated
•
4
YYYYYYibo/gshf_ours_1_iter_2
7B
•
Updated
•
4
YYYYYYibo/two_agent_1_epoch_2_dpo_iter_6
7B
•
Updated
•
6
YYYYYYibo/two_agent_1_epoch_2_rdpo_iter_6
7B
•
Updated
•
5
YYYYYYibo/approx_nash_again_1_iter_3
7B
•
Updated
•
4
YYYYYYibo/approx_nash_again_1_iter_2
7B
•
Updated
•
6
YYYYYYibo/approx_nash_again_iter_3
7B
•
Updated
•
7
YYYYYYibo/ultrafeedback_binarized_with_response_full_part2_mini
Viewer
•
Updated
•
2k
•
27
YYYYYYibo/ultrafeedback_binarized_with_response_full_part2
Viewer
•
Updated
•
21.1k
•
28
YYYYYYibo/ultrafeedback_binarized_with_response_full_part1_mini
Viewer
•
Updated
•
2k
•
32
YYYYYYibo/ultrafeedback_binarized_with_response_full_part1
Viewer
•
Updated
•
20k
•
29
YYYYYYibo/ultrafeedback_binarized_with_response_full_part0_mini
Viewer
•
Updated
•
2k
•
27
YYYYYYibo/ultrafeedback_binarized_with_response_full_part0
Viewer
•
Updated
•
20k
•
36
YYYYYYibo/ultrafeedback_binarized_gshf_lora_train_part_3
Viewer
•
Updated
•
21.1k
•
26
YYYYYYibo/ultrafeedback_binarized_gshf_lora_vllm_2_part_3
Viewer
•
Updated
•
21.1k
•
4
YYYYYYibo/ultrafeedback_binarized_gshf_lora_vllm_1_part_3
Viewer
•
Updated
•
21.1k
•
25
YYYYYYibo/ultrafeedback_binarized_gshf_lora_train_part_2
Viewer
•
Updated
•
20k
•
28