Papers
AI & ML interests
R3 Model is all you need
Recent Activity
View all activity
models
66

rubricreward/LLaMA-3.2-3B-DPO-HelpSteer3-R3-Qwen3-14B-LoRA-4k
Text Generation
•
Updated
•
14

rubricreward/LLaMA-3.2-3B-DPO-HelpSteer3-R3-Qwen3-8B-14k
Text Generation
•
Updated
•
13

rubricreward/LLaMA-3.2-3B-DPO-HelpSteer3-R3-Qwen3-4B-14k
Text Generation
•
Updated
•
13

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-LoRA-4k
15B
•
Updated
•
7

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-LoRA-14k
15B
•
Updated
•
10

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-14k
Text Generation
•
15B
•
Updated
•
10

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-4k
Text Generation
•
15B
•
Updated
•
11

rubricreward/R3-Phi-4-reasoning-plus-LoRA-14k
15B
•
Updated
•
13

rubricreward/R3-Qwen3-14B-LoRA-14k
15B
•
Updated
•
13

rubricreward/R3-Qwen3-8B-LoRA-14k
Text Generation
•
8B
•
Updated
•
8
•
2
datasets
86
rubricreward/HumanEval-XL-Python
Viewer
•
Updated
•
3.68k
rubricreward/MMMLU
Viewer
•
Updated
•
197k
•
6
rubricreward/HelpSteer3
Viewer
•
Updated
•
40.5k
•
11
rubricreward/PolyGuardMix
Viewer
•
Updated
•
2.99M
•
22
rubricreward/arena-human-preference
Viewer
•
Updated
•
120k
•
16
rubricreward/R3-eval-XSUM-new
Viewer
•
Updated
•
5.36k
•
150
rubricreward/R3-eval-MMLU-STEM
Viewer
•
Updated
•
6.31k
•
164
rubricreward/R3-eval-BBH
Viewer
•
Updated
•
13.5k
•
155
rubricreward/R3-eval-RM-Bench-new
Viewer
•
Updated
•
11.9k
•
134
rubricreward/R3-eval-reward-bench-new
Viewer
•
Updated
•
2.99k
•
178