prudant commited on
Commit
e4f23fb
·
verified ·
1 Parent(s): c11530c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -14,6 +14,8 @@ pipeline_tag: feature-extraction
14
 
15
  This is a compressed version of Qwen/Qwen3-Embedding-0.6B using llm-compressor with the following scheme: W8A8
16
 
 
 
17
  ## Model Details
18
 
19
  - **Original Model**: Qwen/Qwen3-Embedding-0.6B
 
14
 
15
  This is a compressed version of Qwen/Qwen3-Embedding-0.6B using llm-compressor with the following scheme: W8A8
16
 
17
+ **Important**: You MUST read the following guide for correct usage of this model here [Guide](https://github.com/vllm-project/vllm/pull/19260)
18
+
19
  ## Model Details
20
 
21
  - **Original Model**: Qwen/Qwen3-Embedding-0.6B