VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning Paper • 2504.08837 • Published 10 days ago • 40
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model Paper • 2504.10068 • Published 6 days ago • 29
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations Paper • 2504.10481 • Published 6 days ago • 79
Efficient Generative Model Training via Embedded Representation Warmup Paper • 2504.10188 • Published 6 days ago • 11
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce Paper • 2504.11343 • Published 5 days ago • 12
NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors Paper • 2504.11427 • Published 5 days ago • 16
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Paper • 2504.11536 • Published 5 days ago • 50
D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation Paper • 2504.09454 • Published 8 days ago • 11
Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning Paper • 2504.08672 • Published 9 days ago • 52
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling Paper • 2504.13169 • Published 3 days ago • 36
DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging Paper • 2504.12364 • Published 4 days ago • 15
Iterative Self-Training for Code Generation via Reinforced Re-Ranking Paper • 2504.09643 • Published 7 days ago • 33
DataDecide: How to Predict Best Pretraining Data with Small Experiments Paper • 2504.11393 • Published 5 days ago • 14
Syzygy of Thoughts: Improving LLM CoT with the Minimal Free Resolution Paper • 2504.09566 • Published 7 days ago • 8
AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference Paper • 2504.10326 • Published 6 days ago • 24
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework Paper • 2504.12395 • Published 4 days ago • 13
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation Paper • 2504.13055 • Published 3 days ago • 16
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models Paper • 2504.10449 • Published 6 days ago • 8
InteractVLM: 3D Interaction Reasoning from 2D Foundational Models Paper • 2504.05303 • Published 13 days ago • 4
IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments Paper • 2504.06827 • Published 11 days ago