Exploring the Latent Capacity of LLMs for One-Step Text Generation Paper • 2505.21189 • Published May 27 • 62
Time Transfer: On Optimal Learning Rate and Batch Size In The Infinite Data Limit Paper • 2410.05838 • Published Oct 8, 2024 • 1