base_model: | |
- TheDrummer/Cydonia-24B-v2 | |
- arcee-ai/Arcee-Blitz | |
- unsloth/Mistral-Small-24B-Base-2501 | |
library_name: transformers | |
tags: | |
- mergekit | |
- merge | |
# merge | |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). | |
## Merge Details | |
### Merge Method | |
This model was merged using the [SCE](https://arxiv.org/abs/2408.07990) merge method using [unsloth/Mistral-Small-24B-Base-2501](https://huggingface.co/unsloth/Mistral-Small-24B-Base-2501) as a base. | |
### Models Merged | |
The following models were included in the merge: | |
* [TheDrummer/Cydonia-24B-v2](https://huggingface.co/TheDrummer/Cydonia-24B-v2) | |
* [arcee-ai/Arcee-Blitz](https://huggingface.co/arcee-ai/Arcee-Blitz) | |
### Configuration | |
The following YAML configuration was used to produce this model: | |
```yaml | |
base_model: unsloth/Mistral-Small-24B-Base-2501 | |
merge_method: sce | |
dype: float32 | |
out_dtype: bfloat16 | |
tokenizer: | |
source: TheDrummer/Cydonia-24B-v2 | |
models: | |
- model: TheDrummer/Cydonia-24B-v2 | |
parameters: | |
select_topk: 0.15 | |
- model: arcee-ai/Arcee-Blitz | |
parameters: | |
select_topk: 0.15 | |
``` | |