Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
6
7
wangpeiyi
peiyi9979
Follow
tongyx361's profile picture
rb's profile picture
drl0lama's profile picture
22 followers
·
2 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
Reinforcement Pre-Training
upvoted
a
paper
2 months ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
liked
a model
5 months ago
deepseek-ai/DeepSeek-R1
View all activity
Organizations
Papers
1
arxiv:
2306.04387
spaces
1
Runtime error
My Metric
💻
models
3
Sort:Â Recently updated
peiyi9979/math-shepherd-mistral-7b-rl
Text Generation
•
Updated
Jan 15, 2024
•
1.54k
•
6
peiyi9979/mistral-7b-sft
Text Generation
•
Updated
Jan 15, 2024
•
1.24k
•
7
peiyi9979/math-shepherd-mistral-7b-prm
Text Generation
•
Updated
Jan 15, 2024
•
2.92k
•
47
datasets
1
peiyi9979/Math-Shepherd
Viewer
•
Updated
Jan 3, 2024
•
445k
•
321
•
97