facebook/dinov3-vit7b16-pretrain-lvd1689m Image Feature Extraction • 7B • Updated 4 days ago • 9.16k • 108
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published 15 days ago • 154