R-Zero: Self-Evolving Reasoning LLM from Zero Data Paper • 2508.05004 • Published 21 days ago • 122
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published 19 days ago • 163
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL Paper • 2508.13167 • Published 21 days ago • 115
view post Post 3523 Qwen3-30B-A3B-Thinking-2507 🔥 latest step in scaling thinking capabilities from Alibaba Qwen team. Qwen/Qwen3-30B-A3B-Thinking-2507-FP8✨ 30B total / 3B active - Apache 2.0 ✨ Native 256K context✨ SOTA coding, alignment, agentic reasoning See translation 🔥 9 9 + Reply
OpenReasoning-Nemotron Collection Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science. • 6 items • Updated 13 days ago • 42
view article Article Welcome Llama 4 Maverick & Scout on Hugging Face! By burtenshaw and 6 others • Apr 5 • 146
Bielik-7B-v0.1 Collection A collection of models based on Bielik-7B-v0.1 - base model, instructional and quantized versions, and MLX (Apple). • 9 items • Updated Jun 6 • 5
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free Paper • 2410.10814 • Published Oct 14, 2024 • 52