apepkuss79 commited on
Commit
746f9d4
·
verified ·
1 Parent(s): 482b2ea

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -50,7 +50,7 @@ tags:
50
  {{ user_message_2 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
51
  ```
52
 
53
- - Context size: `4096`
54
 
55
  - Run as LlamaEdge service
56
 
@@ -58,7 +58,7 @@ tags:
58
  wasmedge --dir .:. --nn-preload default:GGML:AUTO:Llama3.1-8B-Chinese-Chat-Q5_K_M.gguf \
59
  llama-api-server.wasm \
60
  --prompt-template llama-3-chat \
61
- --ctx-size 4096 \
62
  --model-name Llama-3-8B-Chinese-Chat \
63
  ```
64
 
@@ -68,7 +68,7 @@ tags:
68
  wasmedge --dir .:. --nn-preload default:GGML:AUTO:Llama3.1-8B-Chinese-Chat-Q5_K_M.gguf \
69
  llama-chat.wasm \
70
  --prompt-template llama-3-chat \
71
- --ctx-size 4096
72
  ```
73
 
74
  ## Quantized GGUF Models
 
50
  {{ user_message_2 }}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
51
  ```
52
 
53
+ - Context size: `128000`
54
 
55
  - Run as LlamaEdge service
56
 
 
58
  wasmedge --dir .:. --nn-preload default:GGML:AUTO:Llama3.1-8B-Chinese-Chat-Q5_K_M.gguf \
59
  llama-api-server.wasm \
60
  --prompt-template llama-3-chat \
61
+ --ctx-size 128000 \
62
  --model-name Llama-3-8B-Chinese-Chat \
63
  ```
64
 
 
68
  wasmedge --dir .:. --nn-preload default:GGML:AUTO:Llama3.1-8B-Chinese-Chat-Q5_K_M.gguf \
69
  llama-chat.wasm \
70
  --prompt-template llama-3-chat \
71
+ --ctx-size 128000
72
  ```
73
 
74
  ## Quantized GGUF Models