Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
WorldPM-72B-RLHFLow
like
7
Follow
Qwen
34.8k
Text Classification
Transformers
Safetensors
RLHFlow/pair_data_v2_80K_wsafety
English
qwen2
feature-extraction
Modeling World Preference
WorldPM
reward model
preference model
preference model pretraining
PMP
custom_code
text-generation-inference
arxiv:
2505.10527
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
WorldPM-72B-RLHFLow
Commit History
Update modeling_qwen2_rm.py
da08c94
verified
littlebird13
commited on
28 days ago
Update README.md
52a42ed
verified
refrain-wbh
commited on
28 days ago
Update README.md
cd66e7f
verified
littlebird13
commited on
29 days ago
Create LICENSE
f946fa0
verified
littlebird13
commited on
29 days ago
Delete configuration.json
c6b46bf
verified
littlebird13
commited on
29 days ago
Create README.md
5b325a4
verified
littlebird13
commited on
29 days ago
Delete .mv
71f18f9
verified
littlebird13
commited on
29 days ago
Delete .msc
c1e50ef
verified
littlebird13
commited on
29 days ago
Add files using upload-large-folder tool
b50cb5b
verified
littlebird13
commited on
29 days ago
initial commit
168ebe0
verified
clonefy
commited on
29 days ago