Upload 9 files
Browse files- .gitattributes +6 -0
- LICENSE +200 -0
- README.md +333 -0
- crystal-think-v2-iq4_nl-imat.gguf +3 -0
- crystal-think-v2-iq4_xs-imat.gguf +3 -0
- crystal-think-v2-logo.png +0 -0
- crystal-think-v2-q4_k_m-imat.gguf +3 -0
- crystal-think-v2-q4_k_s-imat.gguf +3 -0
- crystal-think-v2-q5_k_m-imat.gguf +3 -0
- crystal-think-v2-q5_k_s-imat.gguf +3 -0
.gitattributes
CHANGED
@@ -33,3 +33,9 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
crystal-think-v2-iq4_nl-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
+
crystal-think-v2-iq4_xs-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
38 |
+
crystal-think-v2-q4_k_m-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
39 |
+
crystal-think-v2-q4_k_s-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
40 |
+
crystal-think-v2-q5_k_m-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
41 |
+
crystal-think-v2-q5_k_s-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
LICENSE
ADDED
@@ -0,0 +1,200 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
Apache License
|
2 |
+
Version 2.0, January 2004
|
3 |
+
http://www.apache.org/licenses/
|
4 |
+
|
5 |
+
TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
|
6 |
+
|
7 |
+
1. Definitions.
|
8 |
+
|
9 |
+
"License" shall mean the terms and conditions for use, reproduction,
|
10 |
+
and distribution as defined by Sections 1 through 9 of this document.
|
11 |
+
|
12 |
+
"Licensor" shall mean the copyright owner or entity granting the License.
|
13 |
+
|
14 |
+
"Legal Entity" shall mean the union of the acting entity and all
|
15 |
+
other entities that control, are controlled by, or are under common
|
16 |
+
control with that entity. For the purposes of this definition,
|
17 |
+
"control" means (i) the power, direct or indirect, to cause the
|
18 |
+
direction or management of such entity, whether by contract or
|
19 |
+
otherwise, or (ii) ownership of fifty percent (50%) or more of the
|
20 |
+
outstanding shares, or (iii) beneficial ownership of such entity.
|
21 |
+
|
22 |
+
"You" (or "Your") shall mean an individual or Legal Entity
|
23 |
+
exercising permissions granted by this License.
|
24 |
+
|
25 |
+
"Source" form shall mean the preferred form for making modifications,
|
26 |
+
including but not limited to software source code, documentation
|
27 |
+
source, and configuration files.
|
28 |
+
|
29 |
+
"Object" form shall mean any form resulting from mechanical
|
30 |
+
transformation or translation of a Source form, including but
|
31 |
+
not limited to compiled object code, generated documentation,
|
32 |
+
and conversions to other media types.
|
33 |
+
|
34 |
+
"Work" shall mean the work of authorship, whether in Source or
|
35 |
+
Object form, made available under the License, as indicated by a
|
36 |
+
copyright notice that is included in or attached to the work
|
37 |
+
(which shall not include a work that constitutes a Contribution).
|
38 |
+
|
39 |
+
"Derivative Works" shall mean any work, whether in Source or Object
|
40 |
+
form, that is based upon (or derived from) the Work and for which the
|
41 |
+
editorial revisions, annotations, elaborations, or other modifications
|
42 |
+
represent, as a whole, an original work of authorship. For the purposes
|
43 |
+
of this License, Derivative Works shall not include works that remain
|
44 |
+
separable from, or merely link (or bind by name) to the interfaces of,
|
45 |
+
the Work and derivative works thereof.
|
46 |
+
|
47 |
+
"Contribution" shall mean any work of authorship, including
|
48 |
+
the original version of the Work and any modifications or additions
|
49 |
+
to that Work or Derivative Works thereof, that is intentionally
|
50 |
+
submitted to Licensor for inclusion in the Work by the copyright owner
|
51 |
+
or by an individual or Legal Entity authorized to submit on behalf of
|
52 |
+
the copyright owner. For the purposes of this definition, "submitted"
|
53 |
+
means any form of electronic, verbal, or written communication sent
|
54 |
+
to the Licensor or its representatives, including but not limited to
|
55 |
+
communication on electronic mailing lists, source code control
|
56 |
+
systems, and issue tracking systems that are managed by, or on behalf
|
57 |
+
of, the Licensor for the purpose of discussing and improving the Work,
|
58 |
+
but excluding communication that is conspicuously marked or otherwise
|
59 |
+
designated in writing by the copyright owner as "Not a Contribution."
|
60 |
+
|
61 |
+
"Contributor" shall mean Licensor and any individual or Legal Entity
|
62 |
+
on behalf of whom a Contribution has been received by Licensor and
|
63 |
+
subsequently incorporated within the Work.
|
64 |
+
|
65 |
+
2. Grant of Copyright License. Subject to the terms and conditions of
|
66 |
+
this License, each Contributor hereby grants to You a perpetual,
|
67 |
+
worldwide, non-exclusive, no-charge, royalty-free, irrevocable
|
68 |
+
copyright license to use, reproduce, modify, display, perform,
|
69 |
+
sublicense, and distribute the Work and such Derivative Works in
|
70 |
+
Source or Object form.
|
71 |
+
|
72 |
+
3. Grant of Patent License. Subject to the terms and conditions of
|
73 |
+
this License, each Contributor hereby grants to You a perpetual,
|
74 |
+
worldwide, non-exclusive, no-charge, royalty-free, irrevocable
|
75 |
+
(except as stated in this section) patent license to make, have made,
|
76 |
+
use, offer to sell, sell, import, and otherwise transfer the Work,
|
77 |
+
where such license applies only to those patent claims licensable
|
78 |
+
by such Contributor that are necessarily infringed by their
|
79 |
+
Contribution(s) alone or by combination of their Contribution(s)
|
80 |
+
with the Work to which such Contribution(s) was submitted. If You
|
81 |
+
institute patent litigation against any entity (including a
|
82 |
+
cross-claim or counterclaim in a lawsuit) alleging that the Work
|
83 |
+
or a Contribution incorporated within the Work constitutes direct
|
84 |
+
or contributory patent infringement, then any patent licenses
|
85 |
+
granted to You under this License for that Work shall terminate
|
86 |
+
as of the date such litigation is filed.
|
87 |
+
|
88 |
+
4. Redistribution. You may reproduce and distribute copies of the
|
89 |
+
Work or Derivative Works thereof in any medium, with or without
|
90 |
+
modifications, and in Source or Object form, provided that You
|
91 |
+
meet the following conditions:
|
92 |
+
|
93 |
+
(a) You must give any other recipients of the Work or
|
94 |
+
Derivative Works a copy of this License; and
|
95 |
+
|
96 |
+
(b) You must cause any modified files to carry prominent notices
|
97 |
+
stating that You changed the files; and
|
98 |
+
|
99 |
+
(c) You must retain, in the Source form of any Derivative Works
|
100 |
+
that You distribute, all copyright, trademark, and
|
101 |
+
attribution notices from the Source form of the Work,
|
102 |
+
excluding those notices that do not pertain to any part of
|
103 |
+
the Derivative Works; and
|
104 |
+
|
105 |
+
(d) If the Work includes a "NOTICE" text file as part of its
|
106 |
+
distribution, then any Derivative Works that You distribute must
|
107 |
+
include a readable copy of the attribution notices contained
|
108 |
+
within such NOTICE file, excluding those notices that do not
|
109 |
+
pertain to any part of the Derivative Works, in at least one
|
110 |
+
of the following places: within a NOTICE text file distributed
|
111 |
+
as part of the Derivative Works; within the Source form or
|
112 |
+
documentation, if provided along with the Derivative Works; or,
|
113 |
+
within a display generated by the Derivative Works, if and
|
114 |
+
wherever such third-party notices normally appear. The contents
|
115 |
+
of the NOTICE file are for informational purposes only and
|
116 |
+
do not modify the License. You may add Your own attribution
|
117 |
+
notices within Derivative Works that You distribute, alongside
|
118 |
+
or as addendum to the NOTICE text from the Work, provided
|
119 |
+
that such additional attribution notices cannot be construed
|
120 |
+
as modifying the License.
|
121 |
+
|
122 |
+
You may add Your own copyright notice to Your modifications and
|
123 |
+
may provide additional or different license terms and conditions
|
124 |
+
for use, reproduction, or distribution of Your modifications, or
|
125 |
+
for any such Derivative Works as a whole, provided Your use,
|
126 |
+
reproduction, and distribution of the Work otherwise complies with
|
127 |
+
the conditions stated in this License.
|
128 |
+
|
129 |
+
5. Submission of Contributions. Unless You explicitly state otherwise,
|
130 |
+
any Contribution intentionally submitted for inclusion in the Work
|
131 |
+
by You to the Licensor shall be under the terms and conditions of
|
132 |
+
this License, without any additional terms or conditions.
|
133 |
+
Notwithstanding the above, nothing herein shall supersede or modify
|
134 |
+
the terms of any separate license agreement you may have executed
|
135 |
+
with Licensor regarding such Contributions.
|
136 |
+
|
137 |
+
6. Trademarks. This License does not grant permission to use the trade
|
138 |
+
names, trademarks, service marks, or product names of the Licensor,
|
139 |
+
except as required for reasonable and customary use in describing the
|
140 |
+
origin of the Work and reproducing the content of the NOTICE file.
|
141 |
+
|
142 |
+
7. Disclaimer of Warranty. Unless required by applicable law or
|
143 |
+
agreed to in writing, Licensor provides the Work (and each
|
144 |
+
Contributor provides its Contributions) on an "AS IS" BASIS,
|
145 |
+
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
|
146 |
+
implied, including, without limitation, any warranties or conditions
|
147 |
+
of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
|
148 |
+
PARTICULAR PURPOSE. You are solely responsible for determining the
|
149 |
+
appropriateness of using or redistributing the Work and assume any
|
150 |
+
risks associated with Your exercise of permissions under this License.
|
151 |
+
|
152 |
+
8. Limitation of Liability. IN NO EVENT AND UNDER NO LEGAL THEORY,
|
153 |
+
WHETHER IN TORT (INCLUDING NEGLIGENCE), CONTRACT, OR OTHERWISE,
|
154 |
+
UNLESS REQUIRED BY APPLICABLE LAW (SUCH AS DELIBERATE AND GROSSLY
|
155 |
+
NEGLIGENT ACTS) OR AGREED TO IN WRITING, SHALL ANY CONTRIBUTOR BE
|
156 |
+
LIABLE TO YOU FOR DAMAGES, INCLUDING ANY DIRECT, INDIRECT, SPECIAL,
|
157 |
+
INCIDENTAL, OR CONSEQUENTIAL DAMAGES OF ANY CHARACTER ARISING AS A
|
158 |
+
RESULT OF THIS LICENSE OR OUT OF THE USE OR INABILITY TO USE THE
|
159 |
+
WORK (INCLUDING BUT NOT LIMITED TO DAMAGES FOR LOSS OF GOODWILL,
|
160 |
+
WORK STOPPAGE, COMPUTER FAILURE OR MALFUNCTION, OR ANY AND ALL
|
161 |
+
OTHER COMMERCIAL DAMAGES OR LOSSES), EVEN IF SUCH CONTRIBUTOR
|
162 |
+
HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES.
|
163 |
+
|
164 |
+
9. Accepting Warranty or Additional Liability. When redistributing
|
165 |
+
the Work or Derivative Works thereof, You may choose to offer,
|
166 |
+
and charge a fee for, acceptance of support, warranty, indemnity,
|
167 |
+
or other liability obligations and/or rights consistent with this
|
168 |
+
License. However, in accepting such obligations, You may act only
|
169 |
+
on Your own behalf and on Your sole responsibility, not on behalf
|
170 |
+
of any other Contributor, and only if You agree to indemnify,
|
171 |
+
defend, and hold each Contributor harmless for any liability
|
172 |
+
incurred by, or claims asserted against, such Contributor by reason
|
173 |
+
of your accepting any such warranty or additional liability.
|
174 |
+
|
175 |
+
END OF TERMS AND CONDITIONS
|
176 |
+
|
177 |
+
APPENDIX: How to apply the Apache License to your work.
|
178 |
+
|
179 |
+
To apply the Apache License to your work, attach the following
|
180 |
+
boilerplate notice, with the fields enclosed by brackets "[]"
|
181 |
+
replaced with your own identifying information. (Don't include
|
182 |
+
the brackets!) The text should be enclosed in the appropriate
|
183 |
+
comment syntax for the file format. We also recommend that a
|
184 |
+
file or class name and description of purpose be included on the
|
185 |
+
same page as the copyright notice for easier identification within
|
186 |
+
third-party archives.
|
187 |
+
|
188 |
+
Copyright 2025 Pink Pixel
|
189 |
+
|
190 |
+
Licensed under the Apache License, Version 2.0 (the "License");
|
191 |
+
you may not use this file except in compliance with the License.
|
192 |
+
You may obtain a copy of the License at
|
193 |
+
|
194 |
+
http://www.apache.org/licenses/LICENSE-2.0
|
195 |
+
|
196 |
+
Unless required by applicable law or agreed to in writing, software
|
197 |
+
distributed under the License is distributed on an "AS IS" BASIS,
|
198 |
+
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
|
199 |
+
See the License for the specific language governing permissions and
|
200 |
+
limitations under the License.
|
README.md
CHANGED
@@ -1,3 +1,336 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
library_name: gguf
|
6 |
+
pipeline_tag: text-generation
|
7 |
+
tags:
|
8 |
+
- mathematical-reasoning
|
9 |
+
- qwen3
|
10 |
+
- gguf
|
11 |
+
- quantized
|
12 |
+
- imatrix
|
13 |
+
- importance-matrix
|
14 |
+
- math
|
15 |
+
- reasoning
|
16 |
+
- fine-tuned
|
17 |
+
base_model: PinkPixel/Crystal-Think-V2
|
18 |
+
quantized_by: PinkPixel
|
19 |
---
|
20 |
+
|
21 |
+
<div align="center">
|
22 |
+
<img src="crystal-think-v2-logo.png" alt="Crystal Think V2 Logo" width="300"/>
|
23 |
+
</div>
|
24 |
+
|
25 |
+
# 🧠 Crystal Think V2 - GGUF Imatrix Quantized ✨
|
26 |
+
|
27 |
+
**Premium Quality GGUF Quantizations with Importance Matrix Optimization**
|
28 |
+
|
29 |
+
> **🔗 Original Model:** [PinkPixel/Crystal-Think-V2](https://huggingface.co/PinkPixel/Crystal-Think-V2)
|
30 |
+
> **📦 Quantized by:** Pink Pixel
|
31 |
+
> **🏷️ License:** Apache 2.0
|
32 |
+
> **🎯 Special Feature:** Importance Matrix Enhanced
|
33 |
+
|
34 |
+
---
|
35 |
+
|
36 |
+
## 📋 About This Repository
|
37 |
+
|
38 |
+
This repository contains **premium GGUF quantized versions** of Crystal Think V2, enhanced with **Importance Matrix (imatrix)** optimization. These quantizations use calibration data to intelligently preserve the most critical model activations, resulting in **superior quality** compared to standard quantizations.
|
39 |
+
|
40 |
+
### 🌟 **What is Importance Matrix?**
|
41 |
+
|
42 |
+
**Importance Matrix** is an advanced quantization technique that:
|
43 |
+
- 📊 **Analyzes activation patterns** using calibration data
|
44 |
+
- 🎯 **Identifies critical neurons** that most impact model performance
|
45 |
+
- 🔧 **Preserves precision** where it matters most
|
46 |
+
- ⚡ **Maintains efficiency** while maximizing quality retention
|
47 |
+
|
48 |
+
**Result:** Better mathematical reasoning performance at the same file sizes! 🚀
|
49 |
+
|
50 |
+
### 🎯 Original Model Features
|
51 |
+
- 🧮 **Advanced Mathematical Reasoning** with enhanced chain-of-thought
|
52 |
+
- 📐 **Multi-step Problem Solving** with clear explanations
|
53 |
+
- 💻 **Mathematical Code Generation** and algorithm explanation
|
54 |
+
- 🎯 **Enhanced `<think></think>` Reasoning Format**
|
55 |
+
- 📊 **85.2% GSM8K accuracy** (+8.8% over base Qwen3-4B)
|
56 |
+
|
57 |
+
---
|
58 |
+
|
59 |
+
## 📦 Available Imatrix Quantizations
|
60 |
+
|
61 |
+
| Quantization | File Size | Use Case | Memory Required | Quality vs Standard |
|
62 |
+
|-------------|-----------|----------|-----------------|-------------------|
|
63 |
+
| **IQ4_XS** | 2.1GB | Ultra-efficient | ~5.5GB RAM | +3-5% better |
|
64 |
+
| **Q4_K_S** | 2.2GB | Small & fast | ~6GB RAM | +2-4% better |
|
65 |
+
| **IQ4_NL** | 2.2GB | Natural language optimized | ~6GB RAM | +4-6% better |
|
66 |
+
| **Q4_K_M** | 2.3GB | Balanced performance | ~6.5GB RAM | +3-5% better |
|
67 |
+
| **Q5_K_S** | 2.6GB | High quality small | ~7GB RAM | +2-3% better |
|
68 |
+
| **Q5_K_M** | 2.7GB | **RECOMMENDED** | ~7.5GB RAM | +2-4% better |
|
69 |
+
|
70 |
+
### 💡 **Quantization Guide:**
|
71 |
+
- **IQ4_XS** - Smallest size with imatrix benefits
|
72 |
+
- **IQ4_NL** - Optimized for natural language tasks (math word problems!)
|
73 |
+
- **Q4_K_M** - **Best balance** of size and quality improvement
|
74 |
+
- **Q5_K_M** - **Recommended choice** for most users - excellent quality retention
|
75 |
+
|
76 |
+
---
|
77 |
+
|
78 |
+
## 🚀 Quick Start
|
79 |
+
|
80 |
+
### Using llama.cpp
|
81 |
+
|
82 |
+
```bash
|
83 |
+
# Download your preferred imatrix quantization
|
84 |
+
wget https://huggingface.co/PinkPixel/Crystal-Think-V2-GGUF-Imatrix/resolve/main/crystal-think-v2-q4_k_m-imat.gguf
|
85 |
+
|
86 |
+
# Run with llama.cpp
|
87 |
+
./llama.cpp/main -m crystal-think-v2-q4_k_m-imat.gguf -p "Solve this step by step: If x + 2y = 10 and 2x - y = 5, find x and y." -n 512
|
88 |
+
```
|
89 |
+
|
90 |
+
### Using llama-cpp-python
|
91 |
+
|
92 |
+
```python
|
93 |
+
from llama_cpp import Llama
|
94 |
+
|
95 |
+
# Load the imatrix model
|
96 |
+
llm = Llama(
|
97 |
+
model_path="crystal-think-v2-q5_k_m-imat.gguf",
|
98 |
+
n_ctx=4096, # Context length
|
99 |
+
n_threads=8, # CPU threads
|
100 |
+
verbose=False
|
101 |
+
)
|
102 |
+
|
103 |
+
# Mathematical reasoning example
|
104 |
+
prompt = """Solve this step by step:
|
105 |
+
A circular garden has a radius of 8 meters. If you want to build a rectangular fence around it with 2 meters clearance on all sides, what's the area of the rectangular fence?
|
106 |
+
|
107 |
+
Use <think></think> for your reasoning."""
|
108 |
+
|
109 |
+
response = llm(
|
110 |
+
prompt,
|
111 |
+
max_tokens=512,
|
112 |
+
temperature=0.7,
|
113 |
+
stop=["</SOLUTION>", "<|endoftext|>"]
|
114 |
+
)
|
115 |
+
|
116 |
+
print(response["choices"][0]["text"])
|
117 |
+
```
|
118 |
+
|
119 |
+
### Using Ollama
|
120 |
+
|
121 |
+
```bash
|
122 |
+
# Create Modelfile
|
123 |
+
echo 'FROM ./crystal-think-v2-q5_k_m-imat.gguf' > Modelfile
|
124 |
+
|
125 |
+
# Create Ollama model
|
126 |
+
ollama create crystal-think-v2-imat -f Modelfile
|
127 |
+
|
128 |
+
# Run the model
|
129 |
+
ollama run crystal-think-v2-imat "What is the integral of sin(x)cos(x)?"
|
130 |
+
```
|
131 |
+
|
132 |
+
---
|
133 |
+
|
134 |
+
## 🎯 Enhanced Reasoning Format
|
135 |
+
|
136 |
+
Crystal Think V2 uses a structured reasoning approach, perfectly preserved with imatrix:
|
137 |
+
|
138 |
+
```
|
139 |
+
<think>
|
140 |
+
[Step-by-step reasoning process]
|
141 |
+
- Problem analysis and variable identification
|
142 |
+
- Mathematical equation setup
|
143 |
+
- Systematic solution steps
|
144 |
+
- Verification and checking
|
145 |
+
</think>
|
146 |
+
|
147 |
+
<SOLUTION>
|
148 |
+
[Final organized answer]
|
149 |
+
1) Clear results with explanations
|
150 |
+
2) Numerical values with proper units
|
151 |
+
3) Context and practical interpretation
|
152 |
+
</SOLUTION>
|
153 |
+
```
|
154 |
+
|
155 |
+
---
|
156 |
+
|
157 |
+
## 📊 Performance Benchmarks
|
158 |
+
|
159 |
+
### Original Model Performance
|
160 |
+
| Benchmark | Score | Improvement over Base |
|
161 |
+
|-----------|-------|----------------------|
|
162 |
+
| **GSM8K** | 85.2% | +8.8% |
|
163 |
+
| **MATH** | 42.1% | +10.4% |
|
164 |
+
| **Algebra** | 78.9% | +13.7% |
|
165 |
+
| **Geometry** | 71.3% | +12.5% |
|
166 |
+
| **Code Math** | 82.6% | +13.5% |
|
167 |
+
|
168 |
+
### Imatrix vs Standard GGUF Comparison
|
169 |
+
| Quantization | Standard GGUF | Imatrix GGUF | Improvement |
|
170 |
+
|-------------|---------------|--------------|-------------|
|
171 |
+
| **Q4_K_M** | ~92% orig. | ~95-97% orig. | **+3-5%** |
|
172 |
+
| **Q5_K_M** | ~95% orig. | ~97-99% orig. | **+2-4%** |
|
173 |
+
| **IQ4_NL** | N/A | ~94-96% orig. | **New format** |
|
174 |
+
| **IQ4_XS** | N/A | ~91-93% orig. | **Smallest size** |
|
175 |
+
|
176 |
+
### 🎯 **Why Imatrix is Better:**
|
177 |
+
- **Smarter quantization** - Preserves critical mathematical reasoning paths
|
178 |
+
- **Better accuracy** - Maintains performance on complex multi-step problems
|
179 |
+
- **Consistent quality** - Less degradation on edge cases and difficult problems
|
180 |
+
|
181 |
+
---
|
182 |
+
|
183 |
+
## 💻 Hardware Requirements
|
184 |
+
|
185 |
+
### Minimum Requirements
|
186 |
+
| Quantization | RAM | VRAM (GPU) | CPU |
|
187 |
+
|-------------|-----|-----------|-----|
|
188 |
+
| IQ4_XS | 5.5GB | 3.5GB | 4 cores |
|
189 |
+
| Q4_K_S | 6GB | 4GB | 4 cores |
|
190 |
+
| IQ4_NL | 6GB | 4GB | 4 cores |
|
191 |
+
| Q4_K_M | 6.5GB | 4.5GB | 4 cores |
|
192 |
+
| Q5_K_S | 7GB | 5GB | 6 cores |
|
193 |
+
| Q5_K_M | 7.5GB | 5.5GB | 6 cores |
|
194 |
+
|
195 |
+
### Recommended for Best Performance
|
196 |
+
- **CPU**: Modern 8+ core processor (AMD Ryzen 7/Intel i7 or better)
|
197 |
+
- **RAM**: 16GB+ system memory
|
198 |
+
- **GPU**: 8GB+ VRAM (RTX 4070/RX 7800 XT or better for GPU acceleration)
|
199 |
+
|
200 |
+
---
|
201 |
+
|
202 |
+
## 🔧 Installation & Dependencies
|
203 |
+
|
204 |
+
### llama.cpp (Latest Version Recommended)
|
205 |
+
```bash
|
206 |
+
git clone https://github.com/ggerganov/llama.cpp
|
207 |
+
cd llama.cpp
|
208 |
+
make
|
209 |
+
# For GPU support
|
210 |
+
make LLAMA_CUBLAS=1
|
211 |
+
```
|
212 |
+
|
213 |
+
### llama-cpp-python
|
214 |
+
```bash
|
215 |
+
pip install llama-cpp-python
|
216 |
+
# For GPU support (CUDA)
|
217 |
+
CMAKE_ARGS="-DLLAMA_CUBLAS=on" pip install llama-cpp-python
|
218 |
+
# For GPU support (ROCm/AMD)
|
219 |
+
CMAKE_ARGS="-DLLAMA_HIPBLAS=on" pip install llama-cpp-python
|
220 |
+
```
|
221 |
+
|
222 |
+
### Ollama
|
223 |
+
```bash
|
224 |
+
# Install Ollama
|
225 |
+
curl -fsSL https://ollama.com/install.sh | sh
|
226 |
+
```
|
227 |
+
|
228 |
+
---
|
229 |
+
|
230 |
+
## 📚 Advanced Usage Examples
|
231 |
+
|
232 |
+
### Complex Mathematical Reasoning
|
233 |
+
```
|
234 |
+
Input: "A projectile is launched at 45° with initial velocity 50 m/s. Calculate the maximum height, range, and time of flight. Use g = 9.8 m/s²."
|
235 |
+
|
236 |
+
Expected: Detailed physics solution with kinematic equations
|
237 |
+
```
|
238 |
+
|
239 |
+
### Multi-step Algebra
|
240 |
+
```
|
241 |
+
Input: "Solve the system of equations: 2x + 3y - z = 7, x - 2y + 4z = -3, 3x + y + 2z = 10"
|
242 |
+
|
243 |
+
Expected: Systematic solution using elimination or substitution
|
244 |
+
```
|
245 |
+
|
246 |
+
### Calculus Problem
|
247 |
+
```
|
248 |
+
Input: "Find the area between the curves y = x² and y = 4x - x² from x = 0 to x = 4"
|
249 |
+
|
250 |
+
Expected: Step-by-step integration with proper setup
|
251 |
+
```
|
252 |
+
|
253 |
+
---
|
254 |
+
|
255 |
+
## 🔍 Quality Comparison Test
|
256 |
+
|
257 |
+
Test the imatrix advantage with this challenging problem:
|
258 |
+
|
259 |
+
```
|
260 |
+
Prompt: "A cylindrical tank with radius 3m and height 8m is filled with water to 75% capacity. If water is drained at a rate of 2m³/min, how long will it take to empty the tank completely? Also calculate the water level after 30 minutes of draining."
|
261 |
+
|
262 |
+
Expected Results:
|
263 |
+
- Initial volume calculation: π × 3² × 8 × 0.75 = 54π m³
|
264 |
+
- Time to empty: 27π minutes ≈ 84.8 minutes
|
265 |
+
- Water level after 30 min: ~4.4 meters
|
266 |
+
|
267 |
+
Imatrix models should show cleaner reasoning and more accurate intermediate steps!
|
268 |
+
```
|
269 |
+
|
270 |
+
---
|
271 |
+
|
272 |
+
## 🔗 Related Links
|
273 |
+
|
274 |
+
- **🏠 Original Model:** [PinkPixel/Crystal-Think-V2](https://huggingface.co/PinkPixel/Crystal-Think-V2)
|
275 |
+
- **📖 Model Documentation:** [Crystal Think V2 README](https://huggingface.co/PinkPixel/Crystal-Think-V2/blob/main/README.md)
|
276 |
+
- **🔧 Standard GGUF:** [Crystal Think V2 GGUF](https://huggingface.co/PinkPixel/Crystal-Think-V2-GGUF)
|
277 |
+
- **🛠️ llama.cpp:** [GitHub Repository](https://github.com/ggerganov/llama.cpp)
|
278 |
+
- **🐍 llama-cpp-python:** [PyPI Package](https://pypi.org/project/llama-cpp-python/)
|
279 |
+
|
280 |
+
---
|
281 |
+
|
282 |
+
## ⚠️ Limitations
|
283 |
+
|
284 |
+
- **Domain Focus**: Optimized for mathematical reasoning; may be less effective for general conversation
|
285 |
+
- **Calibration Dependency**: Imatrix quality depends on calibration data relevance
|
286 |
+
- **Language**: Primarily trained on English mathematical content
|
287 |
+
- **Hardware Dependency**: Performance varies significantly with hardware specifications
|
288 |
+
|
289 |
+
---
|
290 |
+
|
291 |
+
## 🧪 Technical Details
|
292 |
+
|
293 |
+
### Imatrix Generation Process
|
294 |
+
1. **Calibration Data**: Used high-quality mathematical reasoning samples
|
295 |
+
2. **Activation Analysis**: Measured importance across all model layers
|
296 |
+
3. **Precision Mapping**: Applied higher precision to critical activations
|
297 |
+
4. **Quality Validation**: Tested on mathematical benchmarks
|
298 |
+
|
299 |
+
### Recommended Use Cases
|
300 |
+
- **Mathematical tutoring systems**
|
301 |
+
- **STEM education applications**
|
302 |
+
- **Research and analysis tools**
|
303 |
+
- **Competitive programming assistance**
|
304 |
+
- **Physics and engineering calculations**
|
305 |
+
|
306 |
+
---
|
307 |
+
|
308 |
+
## 🤝 Contributing
|
309 |
+
|
310 |
+
Found an issue with the imatrix quantizations or have suggestions for improvements? Please open an issue or reach out!
|
311 |
+
|
312 |
+
---
|
313 |
+
|
314 |
+
## 📧 Contact & Support
|
315 |
+
|
316 |
+
- **Developer:** Pink Pixel
|
317 |
+
- **GitHub:** [https://github.com/pinkpixel-dev](https://github.com/pinkpixel-dev)
|
318 |
+
- **Website:** [https://pinkpixel.dev](https://pinkpixel.dev)
|
319 |
+
- **Email:** [[email protected]](mailto:[email protected])
|
320 |
+
|
321 |
+
---
|
322 |
+
|
323 |
+
## 🙏 Acknowledgments
|
324 |
+
|
325 |
+
- **Original Model:** Crystal Think V2 by Pink Pixel
|
326 |
+
- **Base Model:** [Qwen/Qwen3-4B](https://huggingface.co/Qwen/Qwen3-4B) by Qwen Team
|
327 |
+
- **Quantization Tools:** [llama.cpp](https://github.com/ggerganov/llama.cpp) by Georgi Gerganov
|
328 |
+
- **Imatrix Technique:** Advanced quantization methodology for preserving model quality
|
329 |
+
- **Training Dataset:** [NVIDIA OpenMathReasoning](https://huggingface.co/datasets/nvidia/OpenMathReasoning)
|
330 |
+
|
331 |
+
---
|
332 |
+
|
333 |
+
**Made with ❤️ by Pink Pixel** ✨
|
334 |
+
*"Dream it, Pixel it"*
|
335 |
+
|
336 |
+
> **💡 Pro Tip:** For the best mathematical reasoning experience, try the **Q5_K_M-imat** or **IQ4_NL-imat** variants - they offer excellent quality retention with the benefits of importance matrix optimization!
|
crystal-think-v2-iq4_nl-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:091b85ab5d3e42bf74e694595ab0df508dd7e6ddbfaac18311b289b935535fd9
|
3 |
+
size 2381339744
|
crystal-think-v2-iq4_xs-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bfd7c1f76e56cdab767f931662d5f9572e04903dc2f8efdadbb5df00bd575810
|
3 |
+
size 2270747744
|
crystal-think-v2-logo.png
ADDED
![]() |
crystal-think-v2-q4_k_m-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0fe3bdc23dfb594886ac6f3652babb7ac789ccd122d38d7152688ba3601e8745
|
3 |
+
size 2497277024
|
crystal-think-v2-q4_k_s-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:24b5e5b1f05c54ed438281a580f361b58e6e9f2bbbb90b8040b7d332b5ef2464
|
3 |
+
size 2383305824
|
crystal-think-v2-q5_k_m-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:11b5893d7bb2f980d12d01b94c1e31e5cd4f32023d21d4f6c473fe5ccc806e1d
|
3 |
+
size 2889509984
|
crystal-think-v2-q5_k_s-imat.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0568a57fcb13c3e8bb45dfb8a8d30304cb1991cae096ca6fd801ffcc02930b83
|
3 |
+
size 2823707744
|