P1: Mastering Physics Olympiads with Reinforcement Learning Paper • 2511.13612 • Published Nov 17, 2025 • 134
weizechen/RL-Compositionality-Stage2-RL-Level8-TestData Viewer • Updated Oct 17, 2025 • 2.05k • 6 • 1
weizechen/RL-Compositionality-Stage2-RL-Level2-TrainData Viewer • Updated Oct 17, 2025 • 500k • 9 • 1
weizechen/RL-Compositionality-Stage2-RL-Level1-TrainData Viewer • Updated Oct 17, 2025 • 500k • 18 • 1
RL Compositionality Collection From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones. https://huggingface.co/papers/2509.25123 • 5 items • Updated Oct 17, 2025 • 1
RL Compositionality Collection From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones. https://huggingface.co/papers/2509.25123 • 5 items • Updated Oct 17, 2025 • 1
weizechen/RL-Compositionality-Stage2-RL-Level8-TestData Viewer • Updated Oct 17, 2025 • 2.05k • 6 • 1
weizechen/RL-Compositionality-Stage2-RL-Level2-TrainData Viewer • Updated Oct 17, 2025 • 500k • 9 • 1
weizechen/RL-Compositionality-Stage2-RL-Level1-TrainData Viewer • Updated Oct 17, 2025 • 500k • 18 • 1
From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones Paper • 2509.25123 • Published Sep 29, 2025 • 22