-
koreankiwi99/MNLP_M3_dpo_model
0.6B • Updated • 4 -
koreankiwi99/0_predpo_lower_beta_balanced_lower_beta_mnlp_aggregate
0.6B • Updated • 1 -
koreankiwi99/1_predpo_tuned_balanced_lower_beta_mnlp_aggregate
0.6B • Updated • 1 -
koreankiwi99/3_predpo_base_curriculum_lower_beta_mnlp_aggregate
0.6B • Updated • 1
KyuheeKim
koreankiwi99
AI & ML interests
None yet
Recent Activity
updated
a dataset
3 days ago
koreankiwi99/Nunchi-Bench
published
a dataset
3 days ago
koreankiwi99/Nunchi-Bench
liked
a dataset
6 days ago
gaotang/RM-R1-Entire-RLVR-Train
Organizations
2025_MNLP_M3_DPO
-
koreankiwi99/MNLP_M3_dpo_model
0.6B • Updated • 4 -
koreankiwi99/0_predpo_lower_beta_balanced_lower_beta_mnlp_aggregate
0.6B • Updated • 1 -
koreankiwi99/1_predpo_tuned_balanced_lower_beta_mnlp_aggregate
0.6B • Updated • 1 -
koreankiwi99/3_predpo_base_curriculum_lower_beta_mnlp_aggregate
0.6B • Updated • 1
epfl-lighteval-dpo-datasets
models
85
koreankiwi99/0_predpo_lower_beta_balanced_lower_beta_mnlp_aggregate
0.6B
•
Updated
•
1
koreankiwi99/1_predpo_tuned_balanced_lower_beta_mnlp_aggregate
0.6B
•
Updated
•
1
koreankiwi99/2_predpo_base_balanced_plus_lower_beta_mnlp_aggregate
0.6B
•
Updated
•
1
koreankiwi99/3_predpo_base_curriculum_lower_beta_mnlp_aggregate
0.6B
•
Updated
•
1
koreankiwi99/4_dpo_curriculum_lower_beta_mnlp_aggregate
0.6B
•
Updated
•
1
koreankiwi99/5_dpo_balanced_plus_lower_beta_mnlp_aggregate
0.6B
•
Updated
•
1
koreankiwi99/dpo_model_predpo_config_mnlp_aggregate
0.6B
•
Updated
•
1
koreankiwi99/sft_model_sft_base_mnlp_stem_curriculum
0.6B
•
Updated
•
1
koreankiwi99/sft_model_sft_base_mnlp_stem_balanced_plus
0.6B
•
Updated
•
1
koreankiwi99/6_predpo_base_lightweight_lower_beta_mnlp_aggregate
0.6B
•
Updated
•
1
datasets
18
koreankiwi99/Nunchi-Bench
Preview
•
Updated
•
26
•
1
koreankiwi99/MNLP_M3_dpo_dataset
Viewer
•
Updated
•
135k
•
8
koreankiwi99/helpsteer3-dpo-general
Viewer
•
Updated
•
915
•
8
koreankiwi99/helpsteer3-dpo-stem
Viewer
•
Updated
•
243
•
16
koreankiwi99/helpsteer3-dpo-code
Viewer
•
Updated
•
432
•
11
koreankiwi99/mtbench-dpo-turn1-gpt4_pair
Viewer
•
Updated
•
882
•
10
koreankiwi99/mtbench-dpo-turn1-human
Viewer
•
Updated
•
1.28k
•
10
koreankiwi99/hh-dpo-eval
Viewer
•
Updated
•
8.53k
•
9
koreankiwi99/mnlp_stem_curriculum
Viewer
•
Updated
•
31.8k
•
16
koreankiwi99/mnlp_stem_balanced_plus
Viewer
•
Updated
•
40.8k
•
12