Datasets and trained checkpoints of Composition-RL
xuxin
xx18
AI & ML interests
None yet
Recent Activity
upvoted a paper 5 days ago
Progressive Residual Warmup for Language Model Pretraining authored
a paper
5 days ago
Progressive Residual Warmup for Language Model Pretraining authored
a paper
29 days ago
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models