jiangchengchengNLP
/

qwen2.5-distill-QWQ

Text Generation

Model card Files Files and versions Community

jiangchengchengNLP commited on Feb 28

Commit

b7c45f0

·

verified ·

1 Parent(s): 8a0ea85

Update README.md

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -84,7 +84,7 @@ sampling_params=SamplingParams(
     skip_special_tokens=True,
     temperature=0.0
 )
-# For the exact raspberry sample in the paper see
 import re
 pattn=re.compile("\*\*Final Answer\*\*.*",re.S)
@@ -148,4 +148,19 @@ print("With budget forcing:")
 print(prompt + o[0].outputs[0].text)
 ```
 - PEFT 0.14.0

     skip_special_tokens=True,
     temperature=0.0
 )
+# For the math sample
 import re
 pattn=re.compile("\*\*Final Answer\*\*.*",re.S)
 print(prompt + o[0].outputs[0].text)
 ```
+## If you want to combine lora weights into one model then use the following code
+```python
+from peft import PeftModel
+from transformers import AutoModelForCausalLM, AutoTokenizer
+tokenizer = AutoTokenizer.from_pretrained("jiangchengchengNLP/qwen2.5-distill-QWQ")
+base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-Coder-7B-Instruct",device_map='cpu',torch_dtype="bfloat16")
+model = PeftModel.from_pretrained(base_model, "jiangchengchengNLP/qwen2.5-distill-QWQ")
+mergemodel = model.merge_and_unload()
+mergemodel.save_pretrained("./merge_model")
+tokenizer.save_pretrained("./merge_model")
+print("model have merged!")
+```
 - PEFT 0.14.0