A Comprehensive Survey on Long Context Language Modeling Paper • 2503.17407 • Published 5 days ago • 36
Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs Paper • 2503.01307 • Published 22 days ago • 33
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 4 days ago • 767k • 1.23k
SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference Paper • 2502.18137 • Published 28 days ago • 54
CodeDPO/qwen25-coder-inst-7b-reinforce-plus_v2_mini_processed_r1_cold_start Updated 27 days ago • 35