CommVQ: Commutative Vector Quantization for KV Cache Compression Paper • 2506.18879 • Published 4 days ago • 5
Enhancing Step-by-Step and Verifiable Medical Reasoning in MLLMs Paper • 2506.16962 • Published 8 days ago • 9
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation Paper • 2506.18349 • Published 5 days ago • 9
OAgents: An Empirical Study of Building Effective Agents Paper • 2506.15741 • Published 10 days ago • 31
Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models Paper • 2506.18945 • Published 5 days ago • 33
iiiorg/piiranha-v1-detect-personal-information Token Classification • 0.3B • Updated Sep 13, 2024 • 116k • • 195
mistralai/Mistral-Small-3.2-24B-Instruct-2506 Image-Text-to-Text • 24B • Updated 5 days ago • 19.2k • 270
Inherently Faithful Attention Maps for Vision Transformers Paper • 2506.08915 • Published 17 days ago • 4
AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions Paper • 2506.09038 • Published 17 days ago • 7
Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards Paper • 2506.11474 • Published 15 days ago • 16
Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback Paper • 2506.11930 • Published 14 days ago • 53
Language Surgery in Multilingual Large Language Models Paper • 2506.12450 • Published 14 days ago • 16
AR-RAG: Autoregressive Retrieval Augmentation for Image Generation Paper • 2506.06962 • Published 20 days ago • 28
Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning Paper • 2506.13654 • Published 11 days ago • 42
Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency Paper • 2506.08343 • Published 18 days ago • 46
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents Paper • 2506.11763 • Published 15 days ago • 58