SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning Paper • 2504.07891 • Published 6 days ago • 3 • 2
Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Paper • 2504.07866 • Published 6 days ago • 7 • 3
VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model Paper • 2504.07615 • Published 7 days ago • 20 • 2
MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft Paper • 2504.08388 • Published 6 days ago • 37 • 3
BlenderGym: Benchmarking Foundational Model Systems for Graphics Editing Paper • 2504.01786 • Published 15 days ago • 4 • 2
Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images Paper • 2504.08727 • Published 5 days ago • 8 • 2
ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration Paper • 2504.08591 • Published 6 days ago • 14 • 2
Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging Paper • 2504.08635 • Published 5 days ago • 3 • 2
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization Paper • 2504.08641 • Published 5 days ago • 4 • 2
CoRAG: Collaborative Retrieval-Augmented Generation Paper • 2504.01883 • Published 14 days ago • 8 • 2
UKBOB: One Billion MRI Labeled Masks for Generalizable 3D Medical Image Segmentation Paper • 2504.06908 • Published 8 days ago • 4 • 2
PixelFlow: Pixel-Space Generative Models with Flow Paper • 2504.07963 • Published 6 days ago • 14 • 6
Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models Paper • 2504.05262 • Published 9 days ago • 8 • 4