Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
2
yuxuanxie
yuxuan99
Follow
0 followers
·
6 following
AI & ML interests
None yet
Recent Activity
reacted
to
Jaward
's
post
with 🤗
about 2 months ago
nice clean GRPO implementation: - no transformers - no vllm - has improved grpo (DAPO) - under 300 lines - runs on 24GB (RTX 4090 GPU) Code: https://github.com/policy-gradient/GRPO-Zero
replied
to
Jaward
's
post
about 2 months ago
nice clean GRPO implementation: - no transformers - no vllm - has improved grpo (DAPO) - under 300 lines - runs on 24GB (RTX 4090 GPU) Code: https://github.com/policy-gradient/GRPO-Zero
upvoted
an
article
3 months ago
Open R1: Update #3
View all activity
Organizations
yuxuan99
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
3 months ago
PrimeIntellect/verifiable-coding-problems
Viewer
•
Updated
Feb 6
•
144k
•
592
•
32
liked
a dataset
5 months ago
HuggingFaceFW/fineweb
Viewer
•
Updated
Jan 31
•
25B
•
356k
•
2.19k