mwitiderrick commited on
Commit
68dc6a4
·
1 Parent(s): f4d957f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -2
README.md CHANGED
@@ -11,8 +11,15 @@ tags:
11
  - deepsparse
12
  ---
13
  # MiniChat-3B - DeepSparse
14
- ## Usage
15
 
 
 
 
 
 
 
 
16
  ```python
17
  from deepsparse import TextGeneration
18
 
@@ -47,7 +54,15 @@ To become a great software engineer, you need to have a strong foundation in com
47
  By following these tips, you can become a great software engineer and develop software applications that can benefit society.
48
  """
49
  ```
50
- ## One-shot and Export
 
 
 
 
 
 
 
 
51
 
52
  ```bash
53
  git clone https://github.com/neuralmagic/sparseml
 
11
  - deepsparse
12
  ---
13
  # MiniChat-3B - DeepSparse
14
+ This repo contains model files for [MiniChat-3B ](https://huggingface.co/GeneZC/MiniChat-3B) optimized for [DeepSparse](https://github.com/neuralmagic/deepsparse), a CPU inference runtime for sparse models.
15
 
16
+ This model was quantized and pruned with [SparseGPT](https://arxiv.org/abs/2301.00774), using [SparseML](https://github.com/neuralmagic/sparseml).
17
+ ## Inference
18
+ Install [DeepSparse LLM](https://github.com/neuralmagic/deepsparse) for fast inference on CPUs:
19
+ ```bash
20
+ pip install deepsparse-nightly[llm]
21
+ ```
22
+ Run in a [Python pipeline](https://github.com/neuralmagic/deepsparse/blob/main/docs/llms/text-generation-pipeline.md):
23
  ```python
24
  from deepsparse import TextGeneration
25
 
 
54
  By following these tips, you can become a great software engineer and develop software applications that can benefit society.
55
  """
56
  ```
57
+ ## Prompt template
58
+ ```
59
+
60
+ <s> [|User|]\n
61
+ {prompt}
62
+ </s>[|Assistant|]\n
63
+ ```
64
+ ## Sparsification
65
+ For details on how this model was sparsified, see the `recipe.yaml` in this repo and follow the instructions below.
66
 
67
  ```bash
68
  git clone https://github.com/neuralmagic/sparseml