WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference Paper • 2505.19427 • Published May 26 • 10 • 2
DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs Paper • 2503.07067 • Published Mar 10 • 32 • 2