Resonance RoPE: Improving Context Length Generalization of Large Language Models Paper • 2403.00071 • Published Feb 29, 2024 • 25
Mutual Adversarial Training: Learning together is better than going alone Paper • 2112.05005 • Published Dec 9, 2021 • 1
A2Q: Accumulator-Aware Quantization with Guaranteed Overflow Avoidance Paper • 2308.13504 • Published Aug 25, 2023