Performance Prediction for Large Systems via Text-to-Text Regression Paper • 2506.21718 • Published 17 days ago • 5
TokenButler Collection TokenButler -- Predict token importance for all heads across the transformer in the first layer itself. Enable fine-grained token sparsity! • 6 items • Updated Mar 11 • 3