drkameleon Mungert commited on
Commit
c8e078b
·
verified ·
0 Parent(s):

Duplicate from Mungert/Phi-4-mini-instruct.gguf

Browse files

Co-authored-by: Mungert <[email protected]>

.gitattributes ADDED
@@ -0,0 +1,40 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ phi-4-mini-q4_k_l.gguf filter=lfs diff=lfs merge=lfs -text
37
+ phi-4-mini-bf16-q8.gguf filter=lfs diff=lfs merge=lfs -text
38
+ phi-4-mini-bf16.gguf filter=lfs diff=lfs merge=lfs -text
39
+ phi-4-mini-q6_k.gguf filter=lfs diff=lfs merge=lfs -text
40
+ phi-4-mini-q8.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+
5
+ # **Phi-4-mini-instruct GGUF Models**
6
+
7
+ This repository contains the **Phi-4-mini-instruct** model quantized using a specialized branch of **llama.cpp**:
8
+ 🔗 [ns3284/llama.cpp](https://github.com/ns3284/llama.cpp/tree/master)
9
+
10
+ Special thanks to [@nisparks](https://github.com/nisparks) for adding support for **Phi-4-mini-instruct** in **llama.cpp**.
11
+ This branch is expected to be merged into the master branch soon, so once that happens, it's recommended to use the main **llama.cpp** repository instead.
12
+
13
+ ---
14
+
15
+ ## **Included Files**
16
+
17
+ ### `phi-4-mini-bf16.gguf`
18
+ - Model weights preserved in **BF16**.
19
+ - Use this if you want to **requantize** the model into a different format.
20
+
21
+ ### `phi-4-mini-bf16-q8.gguf`
22
+ - **Output & embeddings** remain in **BF16**.
23
+ - All other layers quantized to **Q8_0**.
24
+
25
+ ### `phi-4-mini-q4_k_l.gguf`
26
+ - **Output & embeddings** quantized to **Q8_0**.
27
+ - All other layers quantized to **Q4_K**.
28
+ - **Note:** No custom matrix quantization applied, so default **llama.cpp** quantization settings are used.
29
+
30
+ ### `phi-4-mini-q6_k.gguf`
31
+ - All layers quantized to **Q6_K**, using **default quantization settings**.
phi-4-mini-bf16-q8.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5ac768c8e2f3853955ab6378de171f1c286a15d9b31ecccb598a580eaba70105
3
+ size 4660795552
phi-4-mini-bf16.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f9253d91a880739f770f987fa61ffc886c82ea82416f1323d1173d851046e43
3
+ size 7680694432
phi-4-mini-q4_k_l.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:290e7a9d82926ff260ec537cd24f7223d630d9c5a3027a7d8068050b035d76c2
3
+ size 2640722080
phi-4-mini-q6_k.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cc50079a86c0a53de80c863c825c7903b618ade3840d6d387335c46c7c231953
3
+ size 3155623072
phi-4-mini-q8.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fdf04d4a2556e3271dc6e2d7187d5ebf7b4b61a42a4acb559930e3aecdd2cc41
3
+ size 4084611232