TeleChat3-36B-Thinking: ✨ Native support for the Ascend + MindSpore ecosystem ✨ Inspired by DeepSeek’s architecture design, bringing training stability and efficiency gains.
✨ Hybrid Architecture: combined autoregressive + diffusion design delivers strong semantic alignment with high-fidelity details ✨ Strong performance in long, dense, and multilingual text rendering ✨ MIT licensed (VQ tokenizer & ViT weights under Apache 2.0) ✨ Now live on Hugging Face inference provider 🤗
AgentCPM-Explore🔥 on device agent foundation model released by OpenBMB openbmb/AgentCPM-Explore ✨ 4B - Apache2.0 ✨ Supports 100+ multi-turn environment interactions with search + verification ✨ Full training/inference stack is openly shared as well
✨ Big wave of foundation models: still scaling, but efficiency, reasoning, and deployment now matter more than size - DeepSeek-V3.2 - Z.ai GLM-4.7 - MiniMax-M2.1 - Xiaomi: MiMo-V2-Flash
✨ Multimodal reasoning is now default - Z.ai GLM-4.6V - Z.ai AutoGLM-Phone 9B - Bytedance: Dolphin-v2