Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
9
78
liuheng
heng
Follow
21world's profile picture
ltim's profile picture
kirch's profile picture
5 followers
·
17 following
liuheng2cqupt
AI & ML interests
None yet
Recent Activity
upvoted
an
article
2 days ago
KV Cache from scratch in nanoVLM
upvoted
an
article
2 days ago
🕳️ Attention Sinks in LLMs for endless fluency
upvoted
an
article
2 days ago
Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained – What’s Really Changing in Transformers?
View all activity
Organizations
None yet
spaces
1
No application file
Code Gen
💻
models
2
Sort: Recently updated
heng/Qwen2.5-1.5B-Open-R1-GRPO
Updated
Feb 19
heng/distilgpt2-finetuned-wikitext2
0.1B
•
Updated
Aug 17, 2024
•
2
datasets
0
None public yet