DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation Paper • 2506.20639 • Published 2 days ago • 1
Pre-trained Large Language Models Learn Hidden Markov Models In-context Paper • 2506.07298 • Published 19 days ago • 23
Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models Paper • 2506.06751 • Published 21 days ago • 72
Seedance 1.0: Exploring the Boundaries of Video Generation Models Paper • 2506.09113 • Published 17 days ago • 90
DiffusionBlocks: Blockwise Training for Generative Models via Score-Based Diffusion Paper • 2506.14202 • Published 11 days ago • 3
Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression Paper • 2506.09482 • Published 17 days ago • 46
Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs Paper • 2506.14245 • Published 11 days ago • 35
VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models Paper • 2505.23656 • Published 29 days ago • 24
Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding Paper • 2505.22618 • Published about 1 month ago • 42
Time Blindness: Why Video-Language Models Can't See What Humans Can? Paper • 2505.24867 • Published 28 days ago • 77
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models Paper • 2505.22617 • Published about 1 month ago • 122
B-score: Detecting biases in large language models using response history Paper • 2505.18545 • Published May 24 • 30
LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models Paper • 2505.19223 • Published May 25 • 8