Zhang Ruichong's picture

Zhang Ruichong

ZhangRC

·

https://www.zhihu.com/people/triangjyeddriung

Triang-jyed-driung

AI & ML interests

Mathematics (Real analysis, functional analysis, commutative algebra, etc)

Recent Activity

liked a model 14 days ago

stepfun-ai/step3

liked a model 14 days ago

Qwen/Qwen-Image

liked a model 23 days ago

Qwen/Qwen3-235B-A22B-Thinking-2507

View all activity

Organizations

upvoted an article 2 months ago

Article

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

By

and 6 others •

Jun 12

• 125

upvoted 2 papers 3 months ago

FEAT: Full-Dimensional Efficient Attention Transformer for Medical Video Generation

Paper • 2506.04956 • Published Jun 5 • 3

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

Paper • 2504.07866 • Published Apr 10 • 12

upvoted 2 papers 4 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 184

RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale

Paper • 2505.03005 • Published May 5 • 35

upvoted a collection 4 months ago

Qwen3

84 items • Updated 18 days ago • 1.13k

upvoted a paper 4 months ago

Volume estimates for unions of convex sets, and the Kakeya set conjecture in three dimensions

Paper • 2502.17655 • Published Feb 24 • 1

upvoted a paper 5 months ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8 • 111

upvoted 2 collections 5 months ago

RWKV-7 Goose

RWKV-7 Goose related resources. • 53 items • Updated Mar 19 • 1

paper weekly

8 items • Updated Mar 22 • 1

upvoted a paper 5 months ago

People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text

Paper • 2501.15654 • Published Jan 26 • 15

upvoted a collection 5 months ago

RNN

18 items • Updated Mar 19 • 4

upvoted a paper 5 months ago

xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference

Paper • 2503.13427 • Published Mar 17 • 3

upvoted a collection 5 months ago

interesting architecture

19 items • Updated 5 days ago • 2

upvoted a paper 5 months ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 153

upvoted a collection 5 months ago

RWKV v7

9 items • Updated Mar 17 • 6

upvoted 3 collections 6 months ago

QwQ

Qwen with Questions • 6 items • Updated Jul 21 • 98

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 632

🪿 RWKV7

RWKV7 models 🪿 • 15 items • Updated Jul 23 • 7

upvoted a paper 6 months ago

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published Feb 9 • 38