Learning Dynamics in Continual Pre-Training for Large Language Models Paper • 2505.07796 • Published 2 days ago • 16