--- base_model: - huihui-ai/Llama-3.3-70B-Instruct-abliterated - huihui-ai/DeepSeek-R1-Distill-Llama-70B-abliterated - nbeerbower/Llama-3.1-Nemotron-lorablated-70B - mlabonne/Hermes-3-Llama-3.1-70B-lorablated library_name: transformers tags: - mergekit - merge --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [huihui-ai/DeepSeek-R1-Distill-Llama-70B-abliterated](https://huggingface.co/huihui-ai/DeepSeek-R1-Distill-Llama-70B-abliterated) as a base. ### Models Merged The following models were included in the merge: * [huihui-ai/Llama-3.3-70B-Instruct-abliterated](https://huggingface.co/huihui-ai/Llama-3.3-70B-Instruct-abliterated) * [nbeerbower/Llama-3.1-Nemotron-lorablated-70B](https://huggingface.co/nbeerbower/Llama-3.1-Nemotron-lorablated-70B) * [mlabonne/Hermes-3-Llama-3.1-70B-lorablated](https://huggingface.co/mlabonne/Hermes-3-Llama-3.1-70B-lorablated) ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: mlabonne/Hermes-3-Llama-3.1-70B-lorablated - model: nbeerbower/Llama-3.1-Nemotron-lorablated-70B - model: huihui-ai/DeepSeek-R1-Distill-Llama-70B-abliterated - model: huihui-ai/Llama-3.3-70B-Instruct-abliterated base_model: huihui-ai/DeepSeek-R1-Distill-Llama-70B-abliterated merge_method: model_stock parameters: int8_mask: true dtype: float32 out_dtype: bfloat16 chat_template: llama3 tokenizer: source: nbeerbower/Llama-3.1-Nemotron-lorablated-70B pad_to_multiple_of: 8 ```