wangpeiyi's picture

3 6 7

wangpeiyi

peiyi9979

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Reinforcement Pre-Training

upvoted a paper 3 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

liked a model 6 months ago

deepseek-ai/DeepSeek-R1

View all activity

Organizations

Papers 1

arxiv:2306.04387

spaces 1

My Metric

models 3

peiyi9979/math-shepherd-mistral-7b-rl

Text Generation • Updated Jan 15, 2024 • 1.12k • 6

peiyi9979/mistral-7b-sft

Text Generation • Updated Jan 15, 2024 • 1.15k • 7

peiyi9979/math-shepherd-mistral-7b-prm

Text Generation • Updated Jan 15, 2024 • 3.11k • 47

datasets 1

peiyi9979/Math-Shepherd

Viewer • Updated Jan 3, 2024 • 445k • 203 • 98