MoT Experimental Reasoning Traces R1
Collection
Mixture-of-Thoughts
•
6 items
•
Updated
•
1
Theta-Crucis-0.6B-Turbo1 is a compact, high-performance model designed for code generation, technical reasoning, and structured output tasks. Fine-tuned from Qwen3-0.6B using the Mixture of Thoughts (MoT) dataset with an emphasis on code expert clusters, this model delivers agile and accurate coding assistance in low-resource environments. At only 0.6B parameters, it offers strong fluency in programming, structured syntax, and technical language generation.
File Name | Size | Format | Description |
---|---|---|---|
Theta-Crucis-0.6B-Turbo1.BF16.gguf | 1.2 GB | GGUF (BF16) | BFloat16 precision model file |
Theta-Crucis-0.6B-Turbo1.F32.gguf | 2.39 GB | GGUF (F32) | Float32 precision model file |
Theta-Crucis-0.6B-Turbo1.Q4_K_M.gguf | 397 MB | GGUF (Q4_K_M) | 4-bit quantized model file |
Theta-Crucis-0.6B-Turbo1.Q5_K_M.gguf | 444 MB | GGUF (Q5_K_M) | 5-bit quantized model file |
Theta-Crucis-0.6B-Turbo1.Q8_0.gguf | 639 MB | GGUF (Q8_0) | 8-bit quantized model file |
config.json | 31 B | JSON | Configuration file |
.gitattributes | 1.88 kB | Text | Git attributes configuration |
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):
4-bit
5-bit
8-bit
16-bit
32-bit
Base model
Qwen/Qwen3-0.6B-Base