morriszms commited on
Commit
11e3031
·
verified ·
1 Parent(s): b0bc40b

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ gemma-3-1b-it-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ gemma-3-1b-it-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
38
+ gemma-3-1b-it-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ gemma-3-1b-it-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ gemma-3-1b-it-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
41
+ gemma-3-1b-it-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
42
+ gemma-3-1b-it-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
43
+ gemma-3-1b-it-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
44
+ gemma-3-1b-it-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
45
+ gemma-3-1b-it-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
46
+ gemma-3-1b-it-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
47
+ gemma-3-1b-it-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,123 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: gemma
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ extra_gated_heading: Access Gemma on Hugging Face
6
+ extra_gated_prompt: To access Gemma on Hugging Face, you’re required to review and
7
+ agree to Google’s usage license. To do this, please ensure you’re logged in to Hugging
8
+ Face and click below. Requests are processed immediately.
9
+ extra_gated_button_content: Acknowledge license
10
+ base_model: google/gemma-3-1b-it
11
+ tags:
12
+ - TensorBlock
13
+ - GGUF
14
+ ---
15
+
16
+ <div style="width: auto; margin-left: auto; margin-right: auto">
17
+ <img src="https://i.imgur.com/jC7kdl8.jpeg" alt="TensorBlock" style="width: 100%; min-width: 400px; display: block; margin: auto;">
18
+ </div>
19
+ <div style="display: flex; justify-content: space-between; width: 100%;">
20
+ <div style="display: flex; flex-direction: column; align-items: flex-start;">
21
+ <p style="margin-top: 0.5em; margin-bottom: 0em;">
22
+ Feedback and support: TensorBlock's <a href="https://x.com/tensorblock_aoi">Twitter/X</a>, <a href="https://t.me/TensorBlock">Telegram Group</a> and <a href="https://x.com/tensorblock_aoi">Discord server</a>
23
+ </p>
24
+ </div>
25
+ </div>
26
+
27
+ ## google/gemma-3-1b-it - GGUF
28
+
29
+ This repo contains GGUF format model files for [google/gemma-3-1b-it](https://huggingface.co/google/gemma-3-1b-it).
30
+
31
+ The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b5165](https://github.com/ggml-org/llama.cpp/commit/1d735c0b4fa0551c51c2f4ac888dd9a01f447985).
32
+
33
+ ## Our projects
34
+ <table border="1" cellspacing="0" cellpadding="10">
35
+ <tr>
36
+ <th style="font-size: 25px;">Awesome MCP Servers</th>
37
+ <th style="font-size: 25px;">TensorBlock Studio</th>
38
+ </tr>
39
+ <tr>
40
+ <th><img src="https://imgur.com/2Xov7B7.jpeg" alt="Project A" width="450"/></th>
41
+ <th><img src="https://imgur.com/pJcmF5u.jpeg" alt="Project B" width="450"/></th>
42
+ </tr>
43
+ <tr>
44
+ <th>A comprehensive collection of Model Context Protocol (MCP) servers.</th>
45
+ <th>A lightweight, open, and extensible multi-LLM interaction studio.</th>
46
+ </tr>
47
+ <tr>
48
+ <th>
49
+ <a href="https://github.com/TensorBlock/awesome-mcp-servers" target="_blank" style="
50
+ display: inline-block;
51
+ padding: 8px 16px;
52
+ background-color: #FF7F50;
53
+ color: white;
54
+ text-decoration: none;
55
+ border-radius: 6px;
56
+ font-weight: bold;
57
+ font-family: sans-serif;
58
+ ">👀 See what we built 👀</a>
59
+ </th>
60
+ <th>
61
+ <a href="https://github.com/TensorBlock/TensorBlock-Studio" target="_blank" style="
62
+ display: inline-block;
63
+ padding: 8px 16px;
64
+ background-color: #FF7F50;
65
+ color: white;
66
+ text-decoration: none;
67
+ border-radius: 6px;
68
+ font-weight: bold;
69
+ font-family: sans-serif;
70
+ ">👀 See what we built 👀</a>
71
+ </th>
72
+ </tr>
73
+ </table>
74
+
75
+ ## Prompt template
76
+
77
+ ```
78
+ <bos><start_of_turn>user
79
+ {system_prompt}
80
+
81
+ {prompt}<end_of_turn>
82
+ <start_of_turn>model
83
+ ```
84
+
85
+ ## Model file specification
86
+
87
+ | Filename | Quant type | File Size | Description |
88
+ | -------- | ---------- | --------- | ----------- |
89
+ | [gemma-3-1b-it-Q2_K.gguf](https://huggingface.co/tensorblock/google_gemma-3-1b-it-GGUF/blob/main/gemma-3-1b-it-Q2_K.gguf) | Q2_K | 0.007 GB | smallest, significant quality loss - not recommended for most purposes |
90
+ | [gemma-3-1b-it-Q3_K_S.gguf](https://huggingface.co/tensorblock/google_gemma-3-1b-it-GGUF/blob/main/gemma-3-1b-it-Q3_K_S.gguf) | Q3_K_S | 0.007 GB | very small, high quality loss |
91
+ | [gemma-3-1b-it-Q3_K_M.gguf](https://huggingface.co/tensorblock/google_gemma-3-1b-it-GGUF/blob/main/gemma-3-1b-it-Q3_K_M.gguf) | Q3_K_M | 0.007 GB | very small, high quality loss |
92
+ | [gemma-3-1b-it-Q3_K_L.gguf](https://huggingface.co/tensorblock/google_gemma-3-1b-it-GGUF/blob/main/gemma-3-1b-it-Q3_K_L.gguf) | Q3_K_L | 0.007 GB | small, substantial quality loss |
93
+ | [gemma-3-1b-it-Q4_0.gguf](https://huggingface.co/tensorblock/google_gemma-3-1b-it-GGUF/blob/main/gemma-3-1b-it-Q4_0.gguf) | Q4_0 | 0.007 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
94
+ | [gemma-3-1b-it-Q4_K_S.gguf](https://huggingface.co/tensorblock/google_gemma-3-1b-it-GGUF/blob/main/gemma-3-1b-it-Q4_K_S.gguf) | Q4_K_S | 0.007 GB | small, greater quality loss |
95
+ | [gemma-3-1b-it-Q4_K_M.gguf](https://huggingface.co/tensorblock/google_gemma-3-1b-it-GGUF/blob/main/gemma-3-1b-it-Q4_K_M.gguf) | Q4_K_M | 0.007 GB | medium, balanced quality - recommended |
96
+ | [gemma-3-1b-it-Q5_0.gguf](https://huggingface.co/tensorblock/google_gemma-3-1b-it-GGUF/blob/main/gemma-3-1b-it-Q5_0.gguf) | Q5_0 | 0.007 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
97
+ | [gemma-3-1b-it-Q5_K_S.gguf](https://huggingface.co/tensorblock/google_gemma-3-1b-it-GGUF/blob/main/gemma-3-1b-it-Q5_K_S.gguf) | Q5_K_S | 0.007 GB | large, low quality loss - recommended |
98
+ | [gemma-3-1b-it-Q5_K_M.gguf](https://huggingface.co/tensorblock/google_gemma-3-1b-it-GGUF/blob/main/gemma-3-1b-it-Q5_K_M.gguf) | Q5_K_M | 0.007 GB | large, very low quality loss - recommended |
99
+ | [gemma-3-1b-it-Q6_K.gguf](https://huggingface.co/tensorblock/google_gemma-3-1b-it-GGUF/blob/main/gemma-3-1b-it-Q6_K.gguf) | Q6_K | 0.007 GB | very large, extremely low quality loss |
100
+ | [gemma-3-1b-it-Q8_0.gguf](https://huggingface.co/tensorblock/google_gemma-3-1b-it-GGUF/blob/main/gemma-3-1b-it-Q8_0.gguf) | Q8_0 | 0.007 GB | very large, extremely low quality loss - not recommended |
101
+
102
+
103
+ ## Downloading instruction
104
+
105
+ ### Command line
106
+
107
+ Firstly, install Huggingface Client
108
+
109
+ ```shell
110
+ pip install -U "huggingface_hub[cli]"
111
+ ```
112
+
113
+ Then, downoad the individual model file the a local directory
114
+
115
+ ```shell
116
+ huggingface-cli download tensorblock/google_gemma-3-1b-it-GGUF --include "gemma-3-1b-it-Q2_K.gguf" --local-dir MY_LOCAL_DIR
117
+ ```
118
+
119
+ If you wanna download multiple model files with a pattern (e.g., `*Q4_K*gguf`), you can try:
120
+
121
+ ```shell
122
+ huggingface-cli download tensorblock/google_gemma-3-1b-it-GGUF --local-dir MY_LOCAL_DIR --local-dir-use-symlinks False --include='*Q4_K*gguf'
123
+ ```
gemma-3-1b-it-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3b4431b9861ea8cd7adabc16e21e1d56923b7ebc54a18217652f42d2fcc34b8d
3
+ size 6512960
gemma-3-1b-it-Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:61402c4bfae82abdf5fe5426fe7641114ff922cd8e2d6a453563124e71f53bb4
3
+ size 6512960
gemma-3-1b-it-Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2a7c576e9ff8748c05ed638ef838f56122f89acb16851d1c226d2dfe4aa4b091
3
+ size 6512960
gemma-3-1b-it-Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bcd1bfc5003c182a9e66538b254c89aa0401560feee367bbb65cc22ae41f1ee2
3
+ size 6512960
gemma-3-1b-it-Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec97c0ceb4dcb2d281fab0bb9c636403708480deea0890d95fe40f97b6c3c791
3
+ size 6512960
gemma-3-1b-it-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9859272cea878b82962865d990f8a9c8fcac5c7b6efb042aa917044dcf3ddfd1
3
+ size 6512960
gemma-3-1b-it-Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4d17fdfa2be97d5c2cebbd5d55ffbc778edf394db9b5f8d191284b168a171715
3
+ size 6512960
gemma-3-1b-it-Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6f3907f3e77f13e1a8269a0e3ddcb93e9062c5280131b6ef86ef9e2ffedffc3d
3
+ size 6512960
gemma-3-1b-it-Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2588ca775396d979b8d0070f62f7a26232a74b3b7bf5e168234b6334bf868401
3
+ size 6512960
gemma-3-1b-it-Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3aa5ca5890ccc411878c51681c2403df5bef55a934408404f798d8e52348262f
3
+ size 6512960
gemma-3-1b-it-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:79d4ac30aca411389fa6bd9d7d39f39c234d002c2142f82f703bfa222b4469a6
3
+ size 6512960
gemma-3-1b-it-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e9f1e847b066cc2ad481b5fea7f5963a5c0de536b9d35af96025c86c3c740fce
3
+ size 6512960