abdullahalzubaer
/

abdullah-Qwen-Qwen2.5-0.5B-Instruct-merged-two-same-model-slerp

Qwen/Qwen2.5-0.5B-Instruct

Model card Files Files and versions Community

abdullahalzubaer commited on Jan 17

Commit

a83c755

·

verified ·

1 Parent(s): 1a0658c

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -15,6 +15,9 @@ abdullah-Qwen-Qwen2.5-0.5B-Instruct-merged-two-same-model-slerp is a merge of th
 ## 🧩 Configuration
 ```yaml
 slices:
   - sources:
@@ -77,6 +80,12 @@ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ```
 ## Citation

 ## 🧩 Configuration
+Note: You nedd to change the layer range based on `number of layers` in the model. For `Qwen/Qwen2.5-0.5B-Instruct` number of layer is 24. The notebook used (see below in reference section
+had `layer_range` `[0, 32]`, this is not true for the qwen model I am using here)
 ```yaml
 slices:
   - sources:
 ```
+## Reference:
+Used this https://huggingface.co/blog/mlabonne/merge-models for creating the merged model
 ## Citation