20 56 92

Yu Zhang

yzhangcs

https://yzhang.site

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

Welcome Gemma 4: Frontier multimodal intelligence on device

liked a model about 2 months ago

meituan-longcat/LongCat-Flash-Lite

commentedon a paper 2 months ago

Attention Residuals

View all activity

Organizations

upvoted an article about 1 month ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 899

liked a model about 2 months ago

meituan-longcat/LongCat-Flash-Lite

Text Generation • Updated Feb 6 • 3.78k • 185

commented a paper 2 months ago

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 185 •

authored a paper 2 months ago

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 185

upvoted a paper 2 months ago

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 185

New activity in moonshotai/Kimi-Linear-48B-A3B-Instruct 5 months ago

Fix deprecated import for Transformers v5 compatibility

#19 opened 5 months ago by

hmellor

updated 2 models 6 months ago

moonshotai/Kimi-Linear-48B-A3B-Base

Text Generation • 49B • Updated Jan 30 • 926 • 73

moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated Dec 16, 2025 • 65.3k • • 560

liked a model 6 months ago

XiaomiMiMo/MiMo-7B-MTPs

Feature Extraction • Updated Nov 14, 2025 • 60 • 7

upvoted a paper 6 months ago

Kimi K2: Open Agentic Intelligence

Paper • 2507.20534 • Published Jul 28, 2025 • 15

updated a collection 6 months ago

Kimi-K2

Collection

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated Jan 27 • 173

New activity in moonshotai/Kimi-Linear-48B-A3B-Instruct 6 months ago

Need assistance in running on Mac

#16 opened 6 months ago by

x-polyglot-x

fla-core is not enough

#13 opened 7 months ago by

amarinference

upvoted an article 6 months ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

moonshotai

•

Jun 21, 2025

• 77

liked a model 6 months ago

cerebras/Kimi-Linear-REAP-35B-A3B-Instruct

Text Generation • 35B • Updated Nov 6, 2025 • 40 • 68

New activity in moonshotai/Kimi-Linear-48B-A3B-Instruct 7 months ago

Podcast

🚀 1

#12 opened 7 months ago by

dcaustin33

liked a model 7 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • 1.1T • Updated Jan 30 • 196k • • 1.7k

commented a paper 7 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 133 •

upvoted an article 7 months ago

Article

Ring-flash-linear-2.0: A Highly Efficient Hybrid Architecture for Test-Time Scaling

RichardBian

•

Oct 9, 2025

• 11

authored a paper 7 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 133

Yu Zhang

AI & ML interests

Recent Activity

Organizations

yzhangcs's activity

Welcome Gemma 4: Frontier multimodal intelligence on device

Fix deprecated import for Transformers v5 compatibility

Need assistance in running on Mac

fla-core is not enough

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

Podcast

Ring-flash-linear-2.0: A Highly Efficient Hybrid Architecture for Test-Time Scaling