a-m-team
/

AM-Thinking-v1

Text Generation

text-generation-inference

Model card Files Files and versions Community

barius commited on 25 days ago

Commit

2ba33ce

·

verified ·

1 Parent(s): a411e68

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -106,6 +106,11 @@ print (f"model answer: {answer_content}")
 ```
 > Note: We have included the system prompt in the tokenizer configuration, as it was used during both the SFT and RL stages. To ensure consistent output quality, we recommend including the same system prompt during actual usage; otherwise, the model's responses may be significantly affected.
 ## 🔧 Post-training pipeline
 To achieve its strong reasoning ability, AM‑Thinking‑v1 goes through a carefully designed post-training pipeline.

 ```
 > Note: We have included the system prompt in the tokenizer configuration, as it was used during both the SFT and RL stages. To ensure consistent output quality, we recommend including the same system prompt during actual usage; otherwise, the model's responses may be significantly affected.
+### Quantized versions for compact devices
+A series of quantized versions for [AM-Thinking-v1](https://huggingface.co/a-m-team/AM-Thinking-v1-gguf) model.
+For use with [llama.cpp](https://github.com/ggml-org/llama.cpp) and [Ollama](https://github.com/ollama/ollama) is available at (AM-Thinking-v1-gguf)[https://huggingface.co/a-m-team/AM-Thinking-v1-gguf].
 ## 🔧 Post-training pipeline
 To achieve its strong reasoning ability, AM‑Thinking‑v1 goes through a carefully designed post-training pipeline.