RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published 10 days ago • 131
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published 16 days ago • 62
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 16 days ago • 351