ISTA-DASLab
/

Mistral-Small-3.1-24B-Instruct-2503-GPTQ-4b-128g

Image-Text-to-Text

compressed-tensors

Model card Files Files and versions Community

SpiridonSunRotator commited on Apr 6

Commit

e81d14b

·

verified ·

1 Parent(s): 1545c69

Added evaluation metrics

Files changed (1) hide show

README.md +24 -0

README.md CHANGED Viewed

@@ -19,6 +19,30 @@ Only the weights of the linear operators within `language_model` transformers bl
 Model checkpoint is saved in [compressed_tensors](https://github.com/neuralmagic/compressed-tensors) format.
 ## Usage
 * To use the model in `transformers` update the package to stable release of Mistral-3

 Model checkpoint is saved in [compressed_tensors](https://github.com/neuralmagic/compressed-tensors) format.
+## Evaluation
+This model was evaluated on the OpenLLM v1 benchmarks. Model outputs were generated with the `vLLM` engine.
+| Model                      |  ArcC  |  GSM8k | Hellaswag |  MMLU  | TruthfulQA-mc2 | Winogrande | Average | Recovery |
+|----------------------------|:------:|:------:|:---------:|:------:|:--------------:|:----------:|:-------:|:--------:|
+| Mistral-Small-3.1-24B-Instruct-2503             | 0.7125 | 0.8848	| 0.8576  | 0.8107	| 0.6409 | 0.8398 | 0.7910 | 1.0000 |
+| Mistral-Small-3.1-24B-Instruct-2503-INT4 (this) | 0.7073 | 0.8711	| 0.8530  | 0.8062	| 0.6252 | 0.8256 | 0.7814 | 0.9878 |
+## Reproduction
+The results were obtained using the following commands:
+```bash
+MODEL=ISTA-DASLab/Mistral-Small-3.1-24B-Instruct-2503-GPTQ-4b-128g
+MODEL_ARGS="pretrained=$MODEL,max_model_len=4096,tensor_parallel_size=1,dtype=auto,gpu_memory_utilization=0.80"
+lm_eval \
+  --model vllm \
+  --model_args $MODEL_ARGS \
+  --tasks openllm \
+  --batch_size auto
+```
 ## Usage
 * To use the model in `transformers` update the package to stable release of Mistral-3