Update README.md
Browse files
README.md
CHANGED
@@ -7,7 +7,9 @@ tags:
|
|
7 |
- Qwen/Qwen2.5-0.5B-Instruct
|
8 |
---
|
9 |
|
10 |
-
#
|
|
|
|
|
11 |
|
12 |
abdullah-Qwen-Qwen2.5-0.5B-Instruct-merged-two-same-model-slerp is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
|
13 |
* [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct)
|
@@ -84,9 +86,11 @@ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
|
|
84 |
|
85 |
Used this https://huggingface.co/blog/mlabonne/merge-models for creating the merged model
|
86 |
|
|
|
87 |
|
|
|
88 |
|
89 |
-
|
90 |
## Citation
|
91 |
|
92 |
If you find this work helpful, feel free to give the works below a cite and this repo.
|
|
|
7 |
- Qwen/Qwen2.5-0.5B-Instruct
|
8 |
---
|
9 |
|
10 |
+
# Model Merging
|
11 |
+
|
12 |
+
## abdullah-Qwen-Qwen2.5-0.5B-Instruct-merged-two-same-model-slerp
|
13 |
|
14 |
abdullah-Qwen-Qwen2.5-0.5B-Instruct-merged-two-same-model-slerp is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
|
15 |
* [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct)
|
|
|
86 |
|
87 |
Used this https://huggingface.co/blog/mlabonne/merge-models for creating the merged model
|
88 |
|
89 |
+
## Further Reading
|
90 |
|
91 |
+
[An Introduction to Model Merging for LLMs](https://developer.nvidia.com/blog/an-introduction-to-model-merging-for-llms/)
|
92 |
|
93 |
+
[Fine-tuning German LLMs with Model Merging and DPO for Improving Customer Support](https://blog.mayflower.de/17424-fine-tuning-german-llm.html)
|
94 |
## Citation
|
95 |
|
96 |
If you find this work helpful, feel free to give the works below a cite and this repo.
|