back-prop commited on
Commit
451c63c
·
verified ·
1 Parent(s): 2972ae6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -22
README.md CHANGED
@@ -13,29 +13,8 @@ language:
13
  # Qwen2.5-3B-GRPO-MATH-1EPOCH
14
 
15
  **Description:**
16
- A GRPO-fine-tuned version of Qwen2.5-3B-Instruct trained on the MATH dataset. It is optimized to produce more accurate contest-style math solutions.
17
 
18
- ---
19
-
20
- ## Usage
21
-
22
- ```python
23
- from transformers import pipeline
24
-
25
- generator = pipeline(
26
- "text-generation",
27
- model="USERNAME/Qwen-2.5-3B-GRPO-Math",
28
- device="cuda"
29
- )
30
-
31
- prompt = "Evaluate the integral ∫₀¹ x² dx."
32
- result = generator(
33
- [{"role": "user", "content": prompt}],
34
- max_new_tokens=50,
35
- return_full_text=False
36
- )[0]
37
- print(result["generated_text"])
38
- ````
39
 
40
  ---
41
 
 
13
  # Qwen2.5-3B-GRPO-MATH-1EPOCH
14
 
15
  **Description:**
 
16
 
17
+ A GRPO-fine-tuned version of Qwen2.5-3B trained on the MATH dataset.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
 
19
  ---
20