RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published 10 days ago • 131
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published 16 days ago • 62
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 16 days ago • 351
Q-Filters Collection Pre-computed Q-Filters for efficient KV cache compression. • 15 items • Updated 24 days ago • 6
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper • 2502.18449 • Published about 1 month ago • 71
Text2World: Benchmarking Large Language Models for Symbolic World Model Generation Paper • 2502.13092 • Published Feb 18 • 13
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment Paper • 2502.10391 • Published Feb 14 • 33
Teuken-7B-v0.4 Collection OpenGPT-X Teuken 7B models trained on 4 trillion tokens • 4 items • Updated Dec 6, 2024 • 3
MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents Paper • 2502.05957 • Published Feb 9 • 16
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published Feb 10 • 148
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 271
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control Feb 4 • 123
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems Paper • 2411.02959 • Published Nov 5, 2024 • 70