abdullahalzubaer
/

abdullah-Qwen-Qwen2.5-0.5B-Instruct-merged-two-same-model-slerp

Qwen/Qwen2.5-0.5B-Instruct

Model card Files Files and versions Community

abdullahalzubaer commited on Jan 17

Commit

30cf755

·

verified ·

1 Parent(s): a83c755

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -7,7 +7,9 @@ tags:
 - Qwen/Qwen2.5-0.5B-Instruct
 ---
-# abdullah-Qwen-Qwen2.5-0.5B-Instruct-merged-two-same-model-slerp
 abdullah-Qwen-Qwen2.5-0.5B-Instruct-merged-two-same-model-slerp is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
 * [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct)
@@ -84,9 +86,11 @@ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 Used this https://huggingface.co/blog/mlabonne/merge-models for creating the merged model
 ## Citation
 If you find this work helpful, feel free to give the works below a cite and this repo.

 - Qwen/Qwen2.5-0.5B-Instruct
 ---
+# Model Merging
+## abdullah-Qwen-Qwen2.5-0.5B-Instruct-merged-two-same-model-slerp
 abdullah-Qwen-Qwen2.5-0.5B-Instruct-merged-two-same-model-slerp is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
 * [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct)
 Used this https://huggingface.co/blog/mlabonne/merge-models for creating the merged model
+## Further Reading
+[An Introduction to Model Merging for LLMs](https://developer.nvidia.com/blog/an-introduction-to-model-merging-for-llms/)
+[Fine-tuning German LLMs with Model Merging and DPO for Improving Customer Support](https://blog.mayflower.de/17424-fine-tuning-german-llm.html)
 ## Citation
 If you find this work helpful, feel free to give the works below a cite and this repo.