Update README.md
Browse files
README.md
CHANGED
@@ -35,6 +35,13 @@ extra_gated_description: >-
|
|
35 |
pipeline_tag: text2text-generation
|
36 |
---
|
37 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
38 |
# Devstral-Small-2505
|
39 |
|
40 |
Devstral is an agentic LLM for software engineering tasks built under a collaboration between [Mistral AI](https://mistral.ai/) and [All Hands AI](https://www.all-hands.dev/) 🙌. Devstral excels at using tools to explore codebases, editing multiple files and power software engineering agents. The model achieves remarkable performance on SWE-bench which positionates it as the #1 open source model on this [benchmark](#benchmark-results).
|
|
|
35 |
pipeline_tag: text2text-generation
|
36 |
---
|
37 |
|
38 |
+
Quantized to FP8 with [LLMCompressor](https://github.com/vllm-project/llm-compressor)
|
39 |
+
|
40 |
+
Ideal to run on a dual GPU system like 2x3090 with vLLM or SGlang:
|
41 |
+
|
42 |
+
`vllm serve bullerwins/Devstral-Small-2505-fp8 --max-model-len 16000 --host 0.0.0.0 --port 5000 -tp 2 --tokenizer_mode mistral`
|
43 |
+
|
44 |
+
|
45 |
# Devstral-Small-2505
|
46 |
|
47 |
Devstral is an agentic LLM for software engineering tasks built under a collaboration between [Mistral AI](https://mistral.ai/) and [All Hands AI](https://www.all-hands.dev/) 🙌. Devstral excels at using tools to explore codebases, editing multiple files and power software engineering agents. The model achieves remarkable performance on SWE-bench which positionates it as the #1 open source model on this [benchmark](#benchmark-results).
|