Taming the Titans: A Survey of Efficient LLM Inference Serving Paper • 2504.19720 • Published 6 days ago • 9
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning Paper • 2504.18904 • Published 8 days ago • 8
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax Paper • 2504.20966 • Published 5 days ago • 22
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published 4 days ago • 33
I-Con: A Unifying Framework for Representation Learning Paper • 2504.16929 • Published 11 days ago • 29
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models Paper • 2504.15279 • Published 13 days ago • 72
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference Paper • 2504.05897 • Published 26 days ago • 14
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 26 days ago • 156
TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization Paper • 2503.19901 • Published Mar 25 • 40
Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion Paper • 2503.15851 • Published Mar 20 • 10
CLS-RL: Image Classification with Rule-Based Reinforcement Learning Paper • 2503.16188 • Published Mar 20 • 9
NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes Paper • 2503.16375 • Published Mar 20 • 9
LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds Paper • 2503.10625 • Published Mar 13 • 32
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity Paper • 2503.16418 • Published Mar 20 • 35
Unleashing Vecset Diffusion Model for Fast Shape Generation Paper • 2503.16302 • Published Mar 20 • 44
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation Paper • 2503.13358 • Published Mar 17 • 96