Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
llm-blender
/
PairRM
like
192
Follow
LLM Blender
13
Text Generation
Transformers
Safetensors
6 datasets
English
deberta
reward_model
reward-model
RLHF
evaluation
llm
instruction
reranking
Inference Endpoints
arxiv:
2306.02561
arxiv:
2112.09332
License:
mit
Model card
Files
Files and versions
Community
4
Train
Deploy
Use this model
94512ba
PairRM
/
README.md
Commit History
Update README.md
94512ba
yuchenlin
commited on
Nov 13, 2023
Update README.md
345b1ee
yuchenlin
commited on
Nov 12, 2023
Update README.md
7333fb2
Dongfu Jiang
commited on
Nov 11, 2023
Update README.md
14d4a72
Dongfu Jiang
commited on
Nov 11, 2023
Update README.md
a2f8211
Dongfu Jiang
commited on
Nov 11, 2023
Update README.md
edac579
Dongfu Jiang
commited on
Nov 11, 2023
Update README.md
8d9ead8
Dongfu Jiang
commited on
Nov 11, 2023
Update README.md
00b7e60
Dongfu Jiang
commited on
Nov 11, 2023
Update README.md
80230fd
Dongfu Jiang
commited on
Nov 11, 2023
Update README.md
0ef6e21
Dongfu Jiang
commited on
Nov 11, 2023
Update README.md
bb45a4c
Dongfu Jiang
commited on
Nov 11, 2023
initial commit
d09aaea
DongfuJiang
commited on
Nov 6, 2023