princepride
/

Qwen-2.5-7B-Simple-RL-Test

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

princepride commited on Feb 12

Commit

d94bc85

·

verified ·

1 Parent(s): 7c85f99

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -1,8 +1,8 @@
 ---
-base_model: Qwen-2.5-7B-Simple-RL-Test
 datasets: DigitalLearningGmbH/MATH-lighteval
 library_name: transformers
-model_name: Qwen-2.5-7B_Base_Math_smalllr
 tags:
 - generated_from_trainer
 - open-r1
@@ -11,7 +11,7 @@ tags:
 licence: license
 ---
-# Model Card for Qwen-2.5-7B_Base_Math_smalllr
 This model is a fine-tuned version of [Qwen/Qwen2.5-Math-7B](https://huggingface.co/Qwen/Qwen2.5-Math-7B) on the [DigitalLearningGmbH/MATH-lighteval](https://huggingface.co/datasets/DigitalLearningGmbH/MATH-lighteval) dataset.
 It has been trained using [TRL](https://github.com/huggingface/trl).

 ---
+base_model: Qwen-2.5-7B-Math
 datasets: DigitalLearningGmbH/MATH-lighteval
 library_name: transformers
+model_name: Qwen-2.5-7B-Simple-RL-Test
 tags:
 - generated_from_trainer
 - open-r1
 licence: license
 ---
+# Model Card for Qwen-2.5-7B-Simple-RL-Test
 This model is a fine-tuned version of [Qwen/Qwen2.5-Math-7B](https://huggingface.co/Qwen/Qwen2.5-Math-7B) on the [DigitalLearningGmbH/MATH-lighteval](https://huggingface.co/datasets/DigitalLearningGmbH/MATH-lighteval) dataset.
 It has been trained using [TRL](https://github.com/huggingface/trl).