Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo Paper • 2503.09799 • Published 3 days ago • 10
FAX: Scalable and Differentiable Federated Primitives in JAX Paper • 2403.07128 • Published Mar 11, 2024 • 13
FAX: Scalable and Differentiable Federated Primitives in JAX Paper • 2403.07128 • Published Mar 11, 2024 • 13