geoffmunn
/

Qwen3-Coder-30B-A3B-Instruct-f16

@@ -3,6 +3,10 @@ license: apache-2.0
 tags:
   - gguf
   - qwen
   - llama.cpp
   - quantized
   - text-generation
@@ -14,7 +18,7 @@ base_model: Qwen/Qwen3-Coder-30B-A3B-Instruct
 author: geoffmunn
 ---
-# Qwen3-Coder-30B-A3B-Instruct:Q6_K
 Quantized version of [Qwen/Qwen3-Coder-30B-A3B-Instruct](https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct) at **Q6_K** level, derived from **f16** base weights.
@@ -28,11 +32,11 @@ Quantized version of [Qwen/Qwen3-Coder-30B-A3B-Instruct](https://huggingface.co/
 ## Quality & Performance
-| Metric | Value |
-|-------|-------|
-| **Quality** | Near-FP16 |
-| **Speed** | 🐌 Slow |
-| **RAM Required** | ~37.5 GB |
 | **Recommendation** | Near-lossless. Minor gains. Use only if RAM allows. |
 ## Prompt Template (ChatML)
@@ -53,13 +57,13 @@ Set this in your app (LM Studio, OpenWebUI, etc.) for best results.
 Recommended defaults:
-| Parameter | Value |
-|---------|-------|
-| Temperature | 0.6 |
-| Top-P | 0.95 |
-| Top-K | 20 |
-| Min-P | 0.0 |
-| Repeat Penalty | 1.1 |
 Stop sequences: `<|im_end|>`, `<|im_start|>`
@@ -69,7 +73,7 @@ Here’s how you can query this model via API using `curl` and `jq`. Replace the
 ```bash
 curl http://localhost:11434/api/generate -s -N -d '{
-  "model": "hf.co/geoffmunn/Qwen3-Coder-30B-A3B-Instruct:Q6_K",
   "prompt": "Respond exactly as follows: Explain how photosynthesis converts sunlight into chemical energy in plants.",
   "temperature": 0.5,
   "top_p": 0.95,

 tags:
   - gguf
   - qwen
+  - qwen3-coder
+  - qwen3-coder-30b-q6
+  - qwen3-coder-30b-q6_k
+  - qwen3-coder-30b-q6_k-gguf
   - llama.cpp
   - quantized
   - text-generation
 author: geoffmunn
 ---
+# Qwen3-Coder-30B-A3B-Instruct-f16:Q6_K
 Quantized version of [Qwen/Qwen3-Coder-30B-A3B-Instruct](https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct) at **Q6_K** level, derived from **f16** base weights.
 ## Quality & Performance
+| Metric             | Value                                               |
+|--------------------|-----------------------------------------------------|
+| **Quality**        | Near-FP16                                           |
+| **Speed**          | 🐌 Slow                                             |
+| **RAM Required**   | ~37.5 GB                                            |
 | **Recommendation** | Near-lossless. Minor gains. Use only if RAM allows. |
 ## Prompt Template (ChatML)
 Recommended defaults:
+| Parameter      | Value |
+|----------------|-------|
+| Temperature    | 0.6   |
+| Top-P          | 0.95  |
+| Top-K          | 20    |
+| Min-P          | 0.0   |
+| Repeat Penalty | 1.1   |
 Stop sequences: `<|im_end|>`, `<|im_start|>`
 ```bash
 curl http://localhost:11434/api/generate -s -N -d '{
+  "model": "hf.co/geoffmunn/Qwen3-Coder-30B-A3B-Instruct-f16:Q6_K",
   "prompt": "Respond exactly as follows: Explain how photosynthesis converts sunlight into chemical energy in plants.",
   "temperature": 0.5,
   "top_p": 0.95,