Testrumoe-2x1.5b-instruct / mergekit_moe_config.yml
ehristoforu's picture
Upload folder using huggingface_hub
90358aa verified
base_model: Vikhrmodels/QVikhr-2.5-1.5B-Instruct-SMPO
gate_mode: hidden
dtype: float16
experts_per_token: 2
experts:
- source_model: Vikhrmodels/QVikhr-2.5-1.5B-Instruct-SMPO
positive_prompts:
- "Hi, how are you?"
- "write an essay on coffee"
- "think about liberalism"
- "pros and cons of living in china"
- "write a creative story"
- "summarize the text"
- "think, reflect step by step"
negative_prompts:
- "help with math/physics homework"
- "solve the math problem"
- "solve a physics problem"
- "calculate the example"
- source_model: nvidia/AceInstruct-1.5B
positive_prompts:
- "help with math/physics homework"
- "solve the math problem"
- "solve a physics problem"
- "calculate the example"
- "prove the theory of relativity"
- "prove the equality of the angles at the base of an isosceles triangle"
negative_prompts:
- "Hi, how are you?"
- "write an essay on coffee"
- "think about liberalism"
- "pros and cons of living in china"
- "write a creative story"
- "summarize the text"
shared_experts:
- source_model: Vikhrmodels/QVikhr-2.5-1.5B-Instruct-SMPO
positive_prompts:
- "chat assistant"
residual_scale: 0.1