TokenButler -- Predict token importance for all heads across the transformer in the first layer itself. Enable fine-grained token sparsity!
YASH AKHAURI
akhauriyash
AI & ML interests
None yet
Recent Activity
updated
a model
about 20 hours ago
akhauriyash/E2EGRPO_bm14B_32Gen_8GAcc_2K_2xAccF
upvoted
a
paper
2 days ago
Performance Prediction for Large Systems via Text-to-Text Regression
published
a model
9 days ago
akhauriyash/E2EGRPO_bm14B_32Gen_8GAcc_2K_2xAccF
Organizations
None yet