arxiv:2605.31159
Boris Shaposhnikov
borisshapa
AI & ML interests
NLP
Recent Activity
authored a paper 3 days ago
Trust-Region Behavior Blending for On-Policy Distillation upvoted a paper 12 days ago
Trust-Region Behavior Blending for On-Policy Distillation upvoted a paper 4 months ago
F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the RareOrganizations
None yet