ByteDance-Seed/Seed-OSS-36B-Instruct Text Generation • 36B • Updated about 19 hours ago • 2.15k • 234
unsloth/Qwen3-Coder-30B-A3B-Instruct-1M-GGUF Text Generation • 0.0B • Updated 18 days ago • 56.9k • 94
MMLU Pro benchmark for GGUFs (1 shot) Collection "Not all quantized model perform good", serving framework ollama uses NVIDIA gpu, llama.cpp uses CPU with AVX & AMX • 13 items • Updated 8 days ago • 7