caiyuchen's picture

caiyuchen

caiyuchen

·

AI & ML interests

None yet

Recent Activity

authored a paper 10 days ago

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

upvoted a paper 10 days ago

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

submitted a paper 10 days ago

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

View all activity

Organizations

None yet

authored a paper 10 days ago

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

Paper • 2605.11739 • Published 15 days ago • 58

upvoted a paper 10 days ago

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

Paper • 2605.11739 • Published 15 days ago • 58

submitted a paper to Daily Papers 10 days ago

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

Paper • 2605.11739 • Published 15 days ago • 58

upvoted a paper 3 months ago

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models

Paper • 2602.12036 • Published Feb 12 • 94

updated 16 models 6 months ago

caiyuchen/Spiral-step-14

Text Generation • 4B • Updated Nov 15, 2025 • 1

caiyuchen/Spiral-step-22

Text Generation • 4B • Updated Nov 15, 2025 • 2

caiyuchen/Spiral-step-13

Text Generation • 4B • Updated Nov 15, 2025

caiyuchen/Spiral-step-21

Text Generation • 4B • Updated Nov 15, 2025 • 2

caiyuchen/Spiral-step-12

Text Generation • 4B • Updated Nov 15, 2025

caiyuchen/Spiral-step-20

Text Generation • 4B • Updated Nov 15, 2025 • 1

caiyuchen/Spiral-step-11

Text Generation • 4B • Updated Nov 15, 2025 • 1

caiyuchen/Spiral-step-19

Text Generation • 4B • Updated Nov 15, 2025

caiyuchen/Spiral-step-10

Text Generation • 4B • Updated Nov 15, 2025

caiyuchen/Spiral-step-18

Text Generation • 4B • Updated Nov 15, 2025

caiyuchen/Spiral-step-9

Text Generation • 4B • Updated Nov 15, 2025 • 2

caiyuchen/Spiral-step-17

Text Generation • 4B • Updated Nov 15, 2025 • 2

caiyuchen/Spiral-step-16

Text Generation • 4B • Updated Nov 15, 2025 • 1

caiyuchen/Spiral-step-15

Text Generation • 4B • Updated Nov 15, 2025 • 3

caiyuchen/Spiral-step-8

Text Generation • 4B • Updated Nov 15, 2025 • 1

caiyuchen/Spiral-step-7

Text Generation • 4B • Updated Nov 15, 2025