POLAR - a internlm Collection

internlm 's Collections

POLAR

InternLM-XComposer2.5

OREAL

InternLM2-Reward

InternLM-XComposer2

POLAR

updated 7 days ago

internlm/POLAR-1_8B

Text Classification • Updated about 19 hours ago • 160 • 5
internlm/POLAR-1_8B-Base

Text Classification • Updated about 19 hours ago • 27
internlm/POLAR-7B

Text Classification • Updated about 19 hours ago • 526 • 19
internlm/POLAR-7B-Base

Text Classification • Updated about 19 hours ago • 37 • 3
Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published 8 days ago • 34