From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models Paper • 2406.16838 • Published Jun 24, 2024 • 2
Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? Paper • 2406.04391 • Published Jun 6, 2024 • 9
Suppressing Pink Elephants with Direct Principle Feedback Paper • 2402.07896 • Published Feb 12, 2024 • 11
Llemma: An Open Language Model For Mathematics Paper • 2310.10631 • Published Oct 16, 2023 • 56
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling Paper • 2304.01373 • Published Apr 3, 2023 • 9