Update README.md
Browse files
README.md
CHANGED
|
@@ -1,7 +1,3 @@
|
|
| 1 |
-
# DeepSeek-R1-Distill-SRE-Qwen-32B-INT8
|
| 2 |
-
|
| 3 |
-
## Model Introduction
|
| 4 |
-
|
| 5 |
---
|
| 6 |
license: apache-2.0
|
| 7 |
datasets:
|
|
@@ -20,6 +16,10 @@ tags:
|
|
| 20 |
- deepseek
|
| 21 |
---
|
| 22 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 23 |
`DeepSeek-R1-Distill-SRE-Qwen-32B-INT8` is the industry's first publicly available operations large model. It is a specialized mixed-precision 8-bit quantized large language model fine-tuned from the `DeepSeek-R1-Distill-Qwen-32B` model, optimized specifically for **operations** and **Site Reliability Engineering (SRE)** scenarios. This model inherits the powerful reasoning capabilities of the DeepSeek-R1 series and has been further fine-tuned using the [ahmedgongi/Devops_LLM](https://huggingface.co/datasets/ahmedgongi/Devops_LLM) dataset, significantly enhancing its utility in the following tasks:
|
| 24 |
|
| 25 |
- Automated script generation
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
datasets:
|
|
|
|
| 16 |
- deepseek
|
| 17 |
---
|
| 18 |
|
| 19 |
+
# DeepSeek-R1-Distill-SRE-Qwen-32B-INT8
|
| 20 |
+
|
| 21 |
+
## Model Introduction
|
| 22 |
+
|
| 23 |
`DeepSeek-R1-Distill-SRE-Qwen-32B-INT8` is the industry's first publicly available operations large model. It is a specialized mixed-precision 8-bit quantized large language model fine-tuned from the `DeepSeek-R1-Distill-Qwen-32B` model, optimized specifically for **operations** and **Site Reliability Engineering (SRE)** scenarios. This model inherits the powerful reasoning capabilities of the DeepSeek-R1 series and has been further fine-tuned using the [ahmedgongi/Devops_LLM](https://huggingface.co/datasets/ahmedgongi/Devops_LLM) dataset, significantly enhancing its utility in the following tasks:
|
| 24 |
|
| 25 |
- Automated script generation
|