Vortex Collection ModelCloud optimized and validated quants that pass/meet strict quality assurance on multiple benchmarks. • 13 items • Updated 2 days ago • 7
ModelCloud/Falcon3-10B-Instruct-gptqmodel-4bit-vortex-v1 Text Generation • Updated 28 days ago • 95 • 3
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v2 Text Generation • Updated about 1 month ago • 822 • 15
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v1 Text Generation • Updated about 1 month ago • 167 • 51
Vortex Collection ModelCloud optimized and validated quants that pass/meet strict quality assurance on multiple benchmarks. • 13 items • Updated 2 days ago • 7
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v1 Text Generation • Updated about 1 month ago • 167 • 51
view article Article Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick By cxdu • Oct 24, 2024 • 10
ModelCloud/Qwen2.5-Coder-32B-Instruct-gptqmodel-4bit-vortex-v1 Text Generation • Updated Nov 14, 2024 • 177 • 12
ModelCloud/Qwen2.5-Coder-32B-Instruct-gptqmodel-4bit-vortex-v1 Text Generation • Updated Nov 14, 2024 • 177 • 12
ModelCloud/Llama-3.2-1B-Instruct-gptqmodel-4bit-vortex-v2.5 Text Generation • Updated Nov 11, 2024 • 659 • 5
ModelCloud/Llama-3.2-3B-Instruct-gptqmodel-4bit-vortex-v3 Text Generation • Updated Nov 11, 2024 • 141 • 5
ModelCloud/Llama-3.2-1B-Instruct-gptqmodel-4bit-vortex-v2 Text Generation • Updated Nov 11, 2024 • 7 • 3
ModelCloud/Llama-3.2-1B-Instruct-gptqmodel-4bit-vortex-v1 Text Generation • Updated Nov 11, 2024 • 135 • 2