Understanding Likelihood Over-optimisation in Direct Alignment Algorithms Paper • 2410.11677 • Published Oct 15, 2024 • 1
Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models Paper • 2405.05417 • Published May 8, 2024 • 1