yuxuanxie's picture

4 2

yuxuanxie

yuxuan99

·

AI & ML interests

None yet

Recent Activity

reacted to Jaward's post with 🤗 about 2 months ago

nice clean GRPO implementation: - no transformers - no vllm - has improved grpo (DAPO) - under 300 lines - runs on 24GB (RTX 4090 GPU) Code: https://github.com/policy-gradient/GRPO-Zero

replied to Jaward's post about 2 months ago

nice clean GRPO implementation: - no transformers - no vllm - has improved grpo (DAPO) - under 300 lines - runs on 24GB (RTX 4090 GPU) Code: https://github.com/policy-gradient/GRPO-Zero

upvoted an article 3 months ago

Open R1: Update #3

View all activity

Organizations

yuxuan99's activity

liked a dataset 3 months ago

PrimeIntellect/verifiable-coding-problems

Viewer • Updated Feb 6 • 144k • 592 • 32

liked a dataset 5 months ago

HuggingFaceFW/fineweb

Viewer • Updated Jan 31 • 25B • 356k • 2.19k