Update README.md
Browse files
README.md
CHANGED
@@ -4,7 +4,7 @@ pipeline_tag: text-generation
|
|
4 |
license: cc-by-nc-4.0
|
5 |
---
|
6 |
|
7 |
-
This repository contains the Guru-
|
8 |
|
9 |
The leaderboard is evaluated with our evaluation [code](https://github.com/LLM360/Reasoning360/tree/main/scripts/offline_eval). The parameters we set in evaluation for all models: temperature=1.0, top_p=0.7.
|
10 |
|
|
|
4 |
license: cc-by-nc-4.0
|
5 |
---
|
6 |
|
7 |
+
This repository contains the Guru-7B (base Qwen2.5-7B) model presented in [Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective](https://huggingface.co/papers/2506.14965).
|
8 |
|
9 |
The leaderboard is evaluated with our evaluation [code](https://github.com/LLM360/Reasoning360/tree/main/scripts/offline_eval). The parameters we set in evaluation for all models: temperature=1.0, top_p=0.7.
|
10 |
|