PRM-CoT

community

AI & ML interests

None defined yet.

Recent Activity

FlippyDora submitted a paper about 12 hours ago

Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models

FlippyDora authored a paper 2 months ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

FlippyDora submitted a paper 2 months ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

View all activity

PRM-CoT 's datasets

None public yet