Speed Always Wins: A Survey on Efficient Architectures for Large Language Models Paper • 2508.09834 • Published 11 days ago • 46
Efficient Attention Mechanisms for Large Language Models: A Survey Paper • 2507.19595 • Published 30 days ago • 6