EryriLabs
/

DeepSeek-R1-Distill-Qwen-YARA-Thinker-7B-GGUF

Model card Files Files and versions

DeepSeek-R1-Distill-Qwen-YARA-Thinker-7B

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the SLERP merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
  - model: vtriple/Qwen-2.5-7B-Threatflux
merge_method: slerp
base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
dtype: bfloat16
parameters:
  t: [0, 0.5, 0.25]

Downloads last month: 28

GGUF

Model size

7.62B params

Architecture

qwen2

Hardware compatibility

Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for EryriLabs/DeepSeek-R1-Distill-Qwen-YARA-Thinker-7B-GGUF

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

vtriple/Qwen-2.5-7B-Threatflux

Merge model

this model