Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published 1 day ago • 60
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Paper • 2504.11536 • Published 22 days ago • 60
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper • 2504.08685 • Published 26 days ago • 123
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction Paper • 2504.01014 • Published Apr 1 • 68
TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization Paper • 2503.19901 • Published Mar 25 • 41
Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k Paper • 2503.09642 • Published Mar 12 • 18
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Apr 3 • 146