Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
reward model
Inference Endpoints
AutoTrain Compatible
text-generation-inference
4-bit precision
custom_code
Carbon Emissions
8-bit precision
Eval Results
Mixture of Experts
Misc with no match
Merge
text-embeddings-inference
Apply filters
Models
102
Full-text search
Edit filters
Sort: Trending
Active filters:
reward model
Clear all
Qwen/Qwen2.5-Math-PRM-72B
Text Classification
•
Updated
about 23 hours ago
•
358
•
51
Qwen/Qwen2.5-Math-PRM-7B
Text Classification
•
Updated
about 23 hours ago
•
943
•
36
Qwen/Qwen2.5-Math-7B-PRM800K
Text Classification
•
Updated
about 23 hours ago
•
103
•
7
Qwen/Qwen2.5-Math-RM-72B
Text Classification
•
Updated
Oct 31, 2024
•
11.7k
•
71
berkeley-nest/Starling-LM-7B-alpha
Text Generation
•
Updated
Mar 20, 2024
•
17.6k
•
558
internlm/internlm2-1_8b-reward
Text Classification
•
Updated
Jul 15, 2024
•
4.87k
•
11
internlm/internlm2-20b-reward
Text Classification
•
Updated
Oct 9, 2024
•
519
•
23
nvidia/Llama-3.1-Nemotron-70B-Reward-HF
Updated
Oct 15, 2024
•
13.7k
•
78
nicholasKluge/RewardModelPT
Text Classification
•
Updated
Jun 18, 2024
•
160
nicholasKluge/RewardModel
Text Classification
•
Updated
Jun 18, 2024
•
117
Ablustrund/moss-rlhf-reward-model-7B-zh
Updated
Jul 13, 2023
•
2
•
23
fnlp/moss-rlhf-reward-model-7B-en
Updated
Jul 13, 2023
•
9
berkeley-nest/Starling-RM-7B-alpha
Updated
Jul 30, 2024
•
45
•
102
LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
11
LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
14
•
1
LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
12
•
2
LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
13
•
1
LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2
Text Generation
•
Updated
Nov 27, 2023
•
12
•
2
TheBloke/Starling-LM-7B-alpha-GGUF
Updated
Nov 28, 2023
•
669
•
94
TheBloke/Starling-LM-7B-alpha-AWQ
Text Generation
•
Updated
Nov 28, 2023
•
26
•
9
second-state/Starling-LM-7B-alpha-GGUF
Text Generation
•
Updated
Mar 20, 2024
•
57
•
3
TheBloke/Starling-LM-7B-alpha-GPTQ
Text Generation
•
Updated
Nov 28, 2023
•
26
•
9
bartowski/Starling-LM-7B-alpha-old-exl2
Text Generation
•
Updated
Nov 28, 2023
CallComply/Starling-LM-11B-alpha
Text Generation
•
Updated
Mar 4, 2024
•
719
•
12
TheBloke/Starling-LM-alpha-8x7B-MoE-GGUF
Updated
Dec 16, 2023
•
101
•
9
TheBloke/Starling-LM-alpha-8x7B-MoE-GPTQ
Text Generation
•
Updated
Dec 17, 2023
•
13
•
2
bartowski/Starling-LM-7B-alpha-exl2
Text Generation
•
Updated
Dec 27, 2023
gizmo-ai/Starling-LM-7B-alpha
Text Generation
•
Updated
Jan 12, 2024
•
13
gizmo-ai/Starling-LM-7B-alpha-AWQ
Text Generation
•
Updated
Jan 12, 2024
•
10
MaziyarPanahi/Starling-LM-7B-alpha-GGUF
Text Generation
•
Updated
Feb 4, 2024
•
52
•
1
Previous
1
2
3
4
Next