ViStoryBench: Comprehensive Benchmark Suite for Story Visualization Paper • 2505.24862 • Published 11 days ago • 31
Time Blindness: Why Video-Language Models Can't See What Humans Can? Paper • 2505.24867 • Published 11 days ago • 75
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time Paper • 2505.24863 • Published 11 days ago • 90
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Paper • 2505.24864 • Published 11 days ago • 118
Taming LLMs by Scaling Learning Rates with Gradient Grouping Paper • 2506.01049 • Published 9 days ago • 36
From Guidelines to Practice: A New Paradigm for Arabic Language Model Evaluation Paper • 2506.01920 • Published 8 days ago • 4
MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling Paper • 2505.15772 • Published 20 days ago • 2
ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement Paper • 2504.01934 • Published Apr 2 • 23
PaperBench: Evaluating AI's Ability to Replicate AI Research Paper • 2504.01848 • Published Apr 2 • 36
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization Paper • 2504.00999 • Published Apr 1 • 92