Towards Understanding the Cognitive Habits of Large Reasoning Models Paper • 2506.21571 • Published Jun 13 • 1
Course-Correction: Safety Alignment Using Synthetic Preferences Paper • 2407.16637 • Published Jul 23, 2024 • 27