AI & ML interests

large scale real-robot-based benchmark platform of embodied intelligence

Recent Activity

AdinaY 
posted an update about 6 hours ago
view post
Post
58
Chinese open source AI in December 2025 was about the stack coming together: open, end to end, and ready to ship 🔥

https://huggingface.co/collections/zh-ai-community/december-2025-china-open-source-highlights

✨ Big wave of foundation models: still scaling, but efficiency, reasoning, and deployment now matter more than size
- DeepSeek-V3.2
- Z.ai GLM-4.7
- MiniMax-M2.1
- Xiaomi: MiMo-V2-Flash

✨ Multimodal reasoning is now default
- Z.ai GLM-4.6V
- Z.ai AutoGLM-Phone 9B
- Bytedance: Dolphin-v2

✨ Image & video: editable assets and real workflows
- Qwen-Image-Layered / Image-2512
- Meituan: LongCat-Image & Image Edit
- AIDC: Ovis-Image-7B
- Live-Avatar / LongCat-Video-Avatar
- HY-WorldPlay / RealVideo

✨ Audio goes edge ready
- GLM-ASR-Nano / Fun-ASR-Nano
- GLM-TTS / VoxCPM1.5
- CosyVoice 0.5B

✨ The quiet backbone: data & infrastructure
- Finch (FinWorkBench)
- Tencent ARC: TimeLens-100K
- BIGAI: TongSIM-Asset
- MiniMax: VTP-Base

✨ Also congrats on Minimax and Z.ai announced their IPOs and Moonshot announced a new $500M funding round 🔥

Like everyone else, I was OOO at the end of December, so feel free to share (in comments or PR) any I missed in this list!
AdinaY 
posted an update about 13 hours ago
AdinaY 
posted an update 1 day ago
view post
Post
2318
2025.1 - DeepSeek entered the scene, backed by High Flyer Quant
2026.1 - IQuest enters the game, backed by Uniquant Quant 📈 and launching IQuest-Coder on huggingface
https://huggingface.co/collections/IQuestLab/iquest-coder

✨ 40B models: Instruct / Thinking / Loop
✨ Loop = MoE-level performance with only ~5% extra training cost
✨ Native 128K context
AdinaY 
posted an update 17 days ago
AdinaY 
posted an update 20 days ago
view post
Post
4572
Finch 💰 an enterprise-grade benchmark that measures whether AI agents can truly handle real world finance & accounting work.

FinWorkBench/Finch

✨ Built from real enterprise data (Enron + financial institutions), not synthetic tasks
✨ Tests end-to-end finance workflows
✨ Multimodal & cross-file reasoning
✨ Expert annotated (700+ hours) and genuinely challenging hard
AdinaY 
posted an update 2 months ago
view post
Post
3373
Kimi K2 Thinking is now live on the hub 🔥

moonshotai/Kimi-K2-Thinking

✨ 1T MoE for deep reasoning & tool use
✨ Native INT4 quantization = 2× faster inference
✨ 256K context window
✨ Modified MIT license
AdinaY 
posted an update 2 months ago
view post
Post
727
Chinese open source AI in October wasn’t about bigger models, it was about real world impact 🔥

https://huggingface.co/collections/zh-ai-community/october-2025-china-open-source-highlights

✨ Vision-Language & OCR wave 🌊
- DeepSeek-OCR : 3B
- PaddleOCR-VL : 0.9B
- Qwen3-VL : 2B / 4B / 8B / 32B /30B-A3B
- Open-Bee: Bee-8B-RL
- http://Z.ai Glyph :10B

OCR is industrializing, the real game now is understanding the (long context) document, not just reading it.

✨ Text generation: scale or innovation?
- MiniMax-M2: 229B
- Antgroup Ling-1T & Ring-1T
- Moonshot Kimi-Linear : linear-attention challenger
- Kwaipilot KAT-Dev

Efficiency is the key.

✨ Any-to-Any & World-Model : one step forward to the real world
- BAAI Emu 3.5
- Antgroup Ming-flash-omni
- HunyuanWorld-Mirror: 3D

Aligning with the “world model” globally

✨ Audio & Speech + Video & Visual: released from entertainment labs to delivery platforms
- SoulX-Podcast TTS
- LongCat-Audio-Codec & LongCat-Video by Meituan delivery paltform
- xiabs DreamOmni 2

Looking forward to what's next 🚀
AdinaY 
posted an update 2 months ago
AdinaY 
posted an update 2 months ago
view post
Post
1785
Ming-flash-omni Preview 🚀 Multimodal foundation model from AntGroup

inclusionAI/Ming-flash-omni-Preview

✨ Built on Ling-Flash-2.0: 10B total/6B active
✨ Generative segmentation-as-editing
✨ SOTA contextual & dialect ASR
✨ High-fidelity image generation
AdinaY 
posted an update 2 months ago
view post
Post
1889

Glyph 🔥 a framework that scales context length by compressing text into images and processing them with vision–language models, released by Z.ai.

Paper:https://huggingface.co/papers/2510.17800
Model:https://huggingface.co/zai-org/Glyph

✨ Compresses long sequences visually to bypass token limits
✨ Reduces computational and memory costs
✨ Preserves meaning through multimodal encoding
✨ Built on GLM-4.1V-9B-Base
AdinaY 
posted an update 3 months ago
view post
Post
2674
HunyuanWorld Mirror🔥a versatile feed forward model for universal 3D world reconstruction by Tencent

tencent/HunyuanWorld-Mirror

✨ Any prior in → 3D world out
✨ Mix camera, intrinsics, depth as priors
✨ Predict point clouds, normals, Gaussians & more in one pass
✨ Unified architecture for all 3D task