metadata
license: apache-2.0
base_model:
- Qwen/Qwen2.5-Math-7B
pipeline_tag: text-generation
🚨 This repo does not include the Process Reward Model (PRM). For access to the PRM, please refer to here.
This repository hosts a fine-tuned LLM optimized for better mathematical reasoning capabilities via only process rewards.