From Bytes to Ideas: Language Modeling with Autoregressive U-Nets Paper • 2506.14761 • Published 7 days ago • 13
Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression Paper • 2506.09482 • Published 14 days ago • 46
Discrete Diffusion in Large Language and Multimodal Models: A Survey Paper • 2506.13759 • Published 8 days ago • 40