arxiv:2504.17950
Isadora White
izzcw
AI & ML interests
LLMs, Reinforcement Learning, agents, embodiment, multi-agent collaboration
Recent Activity
upvoted
a
paper
27 days ago
Steering Autoregressive Music Generation with Recursive Feature Machines
upvoted
a
paper
4 months ago
Group Sequence Policy Optimization
published
a model
6 months ago
izzcw/dpo_model_3.1_8k