metadata
license: apache-2.0
language:
- en
- multilingual
datasets:
- agentlans/LinguaNova
base_model:
- agentlans/Qwen3-4B-multilingual-sft
tags:
- qwen
- multilingual
- quantized
- llama.cpp
- natural-language-processing
- text-generation
- efficient-inference
- lightweight-model
Qwen 3 4B Multilingual Quantized Models
This repo contains quantized versions of the agentlans/Qwen3-4B-multilingual-sft model, optimized for efficient local use with llama.cpp.
The models were quantized using an unofficial Docker image and calibrated on the first 100 rows of the LinguaNova dataset to maintain strong multilingual performance.
These quantized models share the same strengths and limitations as the original Qwen 3 4B multilingual model. They offer a lighter, faster alternative for inference with minor trade-offs in precision.