--- license: apache-2.0 language: - en base_model: - Menlo/Jan-nano pipeline_tag: text-generation library_name: transformers tags: - text-generation-inference - MCP --- # **Jan-nano-GGUF** > Jan-Nano is a compact 4-billion parameter language model specifically designed and trained for deep research tasks. This model has been optimized to work seamlessly with Model Context Protocol (MCP) servers, enabling efficient integration with various research tools and data sources. ## Model Files | File Name | Size | Format | Description | |-----------|------|--------|-------------| | Jan-nano.F32.gguf | 16.1 GB | F32 | Full precision 32-bit floating point | | Jan-nano.F16.gguf | 8.05 GB | F16 | Half precision 16-bit floating point | | Jan-nano.BF16.gguf | 8.05 GB | BF16 | Brain floating point 16-bit | ## Usage These GGUF format files are optimized for use with llama.cpp and compatible inference engines. Choose the appropriate precision level based on your hardware capabilities and quality requirements: - **F32**: Highest quality, requires most memory - **F16/BF16**: Good balance of quality and memory efficiency ## Configuration The model configuration is available in `config.json`. ## Quants Usage (sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better): ![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)