Michael Barry
MichaelBarryUK
AI & ML interests
None yet
Recent Activity
commented on
a paper
about 6 hours ago
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised
Fine-Tuning and Reinforcement Learning via Dynamic Weighting
commented on
a paper
about 23 hours ago
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised
Fine-Tuning and Reinforcement Learning via Dynamic Weighting
Organizations
None yet