princepride commited on
Commit
d94bc85
·
verified ·
1 Parent(s): 7c85f99

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -1,8 +1,8 @@
1
  ---
2
- base_model: Qwen-2.5-7B-Simple-RL-Test
3
  datasets: DigitalLearningGmbH/MATH-lighteval
4
  library_name: transformers
5
- model_name: Qwen-2.5-7B_Base_Math_smalllr
6
  tags:
7
  - generated_from_trainer
8
  - open-r1
@@ -11,7 +11,7 @@ tags:
11
  licence: license
12
  ---
13
 
14
- # Model Card for Qwen-2.5-7B_Base_Math_smalllr
15
 
16
  This model is a fine-tuned version of [Qwen/Qwen2.5-Math-7B](https://huggingface.co/Qwen/Qwen2.5-Math-7B) on the [DigitalLearningGmbH/MATH-lighteval](https://huggingface.co/datasets/DigitalLearningGmbH/MATH-lighteval) dataset.
17
  It has been trained using [TRL](https://github.com/huggingface/trl).
 
1
  ---
2
+ base_model: Qwen-2.5-7B-Math
3
  datasets: DigitalLearningGmbH/MATH-lighteval
4
  library_name: transformers
5
+ model_name: Qwen-2.5-7B-Simple-RL-Test
6
  tags:
7
  - generated_from_trainer
8
  - open-r1
 
11
  licence: license
12
  ---
13
 
14
+ # Model Card for Qwen-2.5-7B-Simple-RL-Test
15
 
16
  This model is a fine-tuned version of [Qwen/Qwen2.5-Math-7B](https://huggingface.co/Qwen/Qwen2.5-Math-7B) on the [DigitalLearningGmbH/MATH-lighteval](https://huggingface.co/datasets/DigitalLearningGmbH/MATH-lighteval) dataset.
17
  It has been trained using [TRL](https://github.com/huggingface/trl).