--- base_model: - mistralai/Mistral-Nemo-Base-2407 - migtissera/Tess-3-Mistral-Nemo-12B - crestf411/nemo-sunfall-v0.6.1 library_name: transformers tags: - mergekit - merge --- # merged This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [Task Arithmetic](https://arxiv.org/abs/2212.04089) merge method using [mistralai/Mistral-Nemo-Base-2407](https://huggingface.co/mistralai/Mistral-Nemo-Base-2407) as a base. ### Models Merged The following models were included in the merge: * [migtissera/Tess-3-Mistral-Nemo-12B](https://huggingface.co/migtissera/Tess-3-Mistral-Nemo-12B) * muse-writer * [crestf411/nemo-sunfall-v0.6.1](https://huggingface.co/crestf411/nemo-sunfall-v0.6.1) ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: migtissera/Tess-3-Mistral-Nemo-12B parameters: weight: 0.07 # - model: nbeerbower/mistral-nemo-bophades-12B # parameters: # weight: 0.05 - model: muse-writer parameters: weight: 0.7 - model: crestf411/nemo-sunfall-v0.6.1 parameters: weight: 0.3 merge_method: task_arithmetic base_model: mistralai/Mistral-Nemo-Base-2407 dtype: bfloat16 tokenizer: source: muse-writer ```