Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Misc
Reset Misc
reward model
Inference Endpoints
AutoTrain Compatible
text-generation-inference
custom_code
4-bit precision
Carbon Emissions
8-bit precision
Eval Results
Mixture of Experts
Misc with no match
Merge
text-embeddings-inference
Apply filters
Models
109
Full-text search
Edit filters
Sort: Trending
Active filters:
reward model
Clear all
bunnycore/Starling-LM-7B-beta-laser-dpo-Q5_K_M-GGUF
Updated
Apr 6, 2024
•
1
•
1
QuantFactory/Starling-LM-7B-alpha-GGUF
Text Generation
•
Updated
Apr 10, 2024
•
35
RaincloudAi/Starling-LM-7B-alpha-Q4_K_M-GGUF
Updated
Apr 10, 2024
•
1
ManniX-ITA/Starling-LM-7B-beta-LaserRMT-v1
Text Generation
•
Updated
Apr 12, 2024
•
8
•
2
johnsnowlabs/JSL-MedMNX-7B
Text Generation
•
Updated
Apr 18, 2024
•
2.71k
•
4
johnsnowlabs/JSL-MedMNX-7B-SFT
Text Generation
•
Updated
Apr 18, 2024
•
2.69k
•
3
mradermacher/JSL-MedMNX-7B-GGUF
Updated
May 6, 2024
•
135
•
1
codeIA/GuIA-v2
Text Generation
•
Updated
Apr 22, 2024
•
6
•
1
johnsnowlabs/JSL-MedMNX-7B-v2.0
Text Generation
•
Updated
Apr 22, 2024
•
2.69k
•
3
jieliu/Storm-7B
Text Generation
•
Updated
Jun 18, 2024
•
18
•
41
newsletter/Starling-LM-7B-beta-Q6_K-GGUF
Updated
May 14, 2024
•
8
nicholasKluge/Aux-RewardModel
Text Classification
•
Updated
Jun 18, 2024
•
126
nicholasKluge/Aux-RewardModelPT
Text Classification
•
Updated
Jun 18, 2024
•
112
nvidia/Llama3-70B-SteerLM-RM
Updated
Jun 19, 2024
•
16
•
42
EmbeddedLLM/Starling-LM-7b-beta-onnx
Text Generation
•
Updated
Jun 17, 2024
mradermacher/Storm-7B-GGUF
Updated
Jun 18, 2024
•
5
mradermacher/Storm-7B-i1-GGUF
Updated
Aug 2, 2024
•
151
•
1
internlm/internlm2-7b-reward
Text Classification
•
Updated
Jul 15, 2024
•
858
•
17
NikolayKozloff/Storm-7B-Q8_0-GGUF
Updated
Jul 9, 2024
•
2
•
2
SteveTran/internlm2-20b-4bit-RM
Text Classification
•
Updated
Jul 25, 2024
•
2
ttkciar/Starling-LM-11B-alpha-Q4_K_M-GGUF
Updated
Aug 2, 2024
•
41
wangclnlp/robust_visual_reward_model
Updated
Aug 23, 2024
•
2
hellork/Starling-LM-7B-beta-IQ4_NL-GGUF
Updated
Sep 17, 2024
Qwen/Qwen2-Math-RM-72B
Text Classification
•
Updated
Sep 18, 2024
•
6
•
3
mlx-community/nvidia-Llama-3.1-Nemotron-70B-Reward-HF-AQ41
Updated
Oct 2, 2024
•
5
second-state/Llama-3.1-Nemotron-70B-Reward-HF-GGUF
Text Generation
•
Updated
Oct 19, 2024
•
179
•
1
gaianet/Llama-3.1-Nemotron-70B-Reward-HF-GGUF
Text Generation
•
Updated
Oct 19, 2024
•
83
•
1
yale-nlp/MDCureRM
Updated
Nov 22, 2024
•
30
•
3
mradermacher/Starling-LM-7B-alpha-GGUF
Updated
Nov 4, 2024
•
85
•
1
tensorblock/Starling-LM-7B-alpha-GGUF
Updated
Nov 16, 2024
•
50
Previous
1
2
3
4
Next