Update README.md
Browse files
README.md
CHANGED
|
@@ -107,17 +107,37 @@ This model is a fine-tuned version of [meta-llama/Llama-3.1-8B-Instruct](https:/
|
|
| 107 |
It achieves the following results on the evaluation set:
|
| 108 |
- Loss: 1.1173
|
| 109 |
|
| 110 |
-
## Model description
|
| 111 |
-
|
| 112 |
-
More information needed
|
| 113 |
|
| 114 |
## Intended uses & limitations
|
| 115 |
|
| 116 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 117 |
|
| 118 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 119 |
|
| 120 |
-
More information needed
|
| 121 |
|
| 122 |
## Training procedure
|
| 123 |
|
|
|
|
| 107 |
It achieves the following results on the evaluation set:
|
| 108 |
- Loss: 1.1173
|
| 109 |
|
|
|
|
|
|
|
|
|
|
| 110 |
|
| 111 |
## Intended uses & limitations
|
| 112 |
|
| 113 |
+
The model performs best in summarization tasks, specifically in English and maybe Chinese. The model provides reasoning ON/OFF via system prompt trigger, all instructions should be contained within the user prompt.
|
| 114 |
+
|
| 115 |
+
Reasoning on example:
|
| 116 |
+
```json
|
| 117 |
+
messages = [
|
| 118 |
+
{"role": "system", "content": "reasoning on"},
|
| 119 |
+
{"role": "user", "content": "Summarize the following into 5 bullet points, each with 20 words max.\n\nMarch 28 (Reuters) -..."}
|
| 120 |
+
]
|
| 121 |
+
|
| 122 |
+
# output
|
| 123 |
+
- Elon Musk's xAI acquires X ...
|
| 124 |
+
```
|
| 125 |
|
| 126 |
+
Reasoning off example:
|
| 127 |
+
```json
|
| 128 |
+
messages = [
|
| 129 |
+
{"role": "system", "content": "reasoning off"},
|
| 130 |
+
{"role": "user", "content": "Summarize the following into 5 bullet points, each with 20 words max.\n\nMarch 28 (Reuters) -..."}
|
| 131 |
+
]
|
| 132 |
+
|
| 133 |
+
# output
|
| 134 |
+
<think>
|
| 135 |
+
Okay, I need to summarize this article into 5 bullet points, each with a maximum of 20 words. ...
|
| 136 |
+
</think>
|
| 137 |
+
|
| 138 |
+
- Musk's xAI acquires X ...
|
| 139 |
+
```
|
| 140 |
|
|
|
|
| 141 |
|
| 142 |
## Training procedure
|
| 143 |
|