IntelligentEstate/Gambit-7B-Q4_K_M-GGUF

This model was converted to GGUF format from nvidia/AceReason-Nemotron-7B

Use with llama.cpp

Install llama.cpp through brew (works on Mac and Linux)

brew install llama.cpp

Invoke the llama.cpp server or the CLI.

GGUF

Model size

7.62B params

Architecture

qwen2

Hardware compatibility

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Quantized

(15)

this model