GGUF Quantized Model

This is a GGUF quantized version of iben/medical model, optimized for use with llama.cpp.

Model Details

  • Original Model: iben/medical
  • Quantization Type: Q8_0
  • File Size: 7.54 GB
  • Created: 2025-02-01

Usage

This model can be used with llama.cpp or other GGUF-compatible inference engines.

Downloads last month
1
GGUF
Model size
7.62B params
Architecture
qwen2
Hardware compatibility
Log In to view the estimation

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for iben/DrM-4bit

Base model

Qwen/Qwen2.5-7B
Finetuned
iben/medical
Quantized
(1)
this model