WPRM
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
52

WPRM/llama-3.1-8b-ar-rm-mtl
8B
•
Updated

WPRM/qwen3-8b-ar-reward-cot-mtl-checklist-enhanced
8B
•
Updated
•
1

WPRM/qwen-3b-ar-reward-cot-mtl-checklist-enhanced
3B
•
Updated
•
35

WPRM/qwen3-8b-checklist-enhanced
8B
•
Updated
•
1

WPRM/qwen3-ar-reward-cot-mtl-same-ratio-epoch2
8B
•
Updated
•
1

WPRM/qwen3-ar-reward-cot-mtl
8B
•
Updated
•
1

WPRM/qwen3-ar-reward-cot-mtl-epoch1
8B
•
Updated
•
1

WPRM/qwen2_5vl-3b_ar_reward_cot_multimodal_mtl
4B
•
Updated
•
1

WPRM/qwen2.5-ar-reward-cot-mtl
3B
•
Updated
•
472

WPRM/qwen2_5vl-3b_ar_reward_cot_multimodal_final_new
4B
•
Updated
•
1
datasets
105
WPRM/checklist_dataset_sharegpt_for_offline_ppo_short_axtree
Viewer
•
Updated
•
4.62k
•
82
WPRM/checklist_dataset_sharegpt_for_offline_ppo_tiny_axtree
Viewer
•
Updated
•
8
•
44
WPRM/checklist_dataset_sharegpt_for_offline_ppo_axtree
Viewer
•
Updated
•
9.46k
•
43
WPRM/ours_8b_mtl_enhanced_annotated_walite_combined_checklist
Viewer
•
Updated
•
812
•
10
WPRM/ours_3b_mtl_enhanced_annotated_walite_combined_checklist
Viewer
•
Updated
•
812
•
7
WPRM/ours_8b_enhanced_annotated_walite_combined_checklist
Viewer
•
Updated
•
812
•
5
WPRM/WebShepherd_train_multimodal_final_0513_checklist_only
Viewer
•
Updated
•
3.63k
•
8
WPRM/WebShepherd_train_text_only_final_0513_checklist_only
Viewer
•
Updated
•
3.63k
•
12
WPRM/WebShepherd_train_multimodal_final_0513
Viewer
•
Updated
•
43.2k
•
7
WPRM/WebShepherd_train_text_only_final_0513
Viewer
•
Updated
•
43.2k
•
11