Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published 4 days ago • 24
Skywork/Skywork-Reward-V2-Llama-3.1-8B-40M Text Classification • 8B • Updated about 8 hours ago • 240 • 6
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy Paper • 2507.01352 • Published 5 days ago • 38
KnutJaegersberg/Llama-3_3-Nemotron-Super-49B-GenRM-Multilingual-Q6_K-GGUF 50B • Updated 4 days ago • 24
KnutJaegersberg/Llama-3_3-Nemotron-Super-49B-GenRM-Multilingual-Q6_K-GGUF 50B • Updated 4 days ago • 24
nvidia/Llama-3_3-Nemotron-Super-49B-GenRM-Multilingual Text Generation • 50B • Updated 10 days ago • 43 • 6
Reward Models Collection Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated 4 days ago • 11