Running 244 244 HF's Missing Inference Widget 💻 Interact with advanced AI models to get text responses
RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale Paper • 2505.03005 • Published May 5 • 31