Low-Precision Training of Large Language Models: Methods, Challenges, and Opportunities Paper • 2505.01043 • Published May 2 • 10
GhostNetV3: Exploring the Training Strategies for Compact Models Paper • 2404.11202 • Published Apr 17, 2024
ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning Paper • 2410.17779 • Published Oct 23, 2024 • 9
SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution Paper • 2402.17133 • Published Feb 27, 2024 • 1
Data-efficient Large Vision Models through Sequential Autoregression Paper • 2402.04841 • Published Feb 7, 2024
One-for-All: Bridge the Gap Between Heterogeneous Architectures in Knowledge Distillation Paper • 2310.19444 • Published Oct 30, 2023