YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Combined Task Vector Model

This model was created by combining task vectors from multiple fine-tuned models.

Task Vector Computation

t_1 = TaskVector("Qwen/Qwen2.5-1.5B-Instruct", "coastalcph/Qwen2.5-1.5B-Instruct-gcd_sycophancy")
t_2 = TaskVector("Qwen/Qwen2.5-1.5B-Instruct", "coastalcph/Qwen2.5-1.5B-Instruct-personality-non-sycophancy")
t_combined = 1.0 * t_1 + 1.0 * t_2 - 1.0 * t_3
new_model = t_combined.apply_to("Qwen/Qwen2.5-1.5B-Instruct", scaling_coef=1.0)

Models Used

Technical Details

  • Creation Script Git Hash: d0db42d73be516ec04f0ecdc8003189e98b5f722
  • Task Vector Method: Additive combination
  • Args: { "pretrained_model": "Qwen/Qwen2.5-1.5B-Instruct", "finetuned_model1": "coastalcph/Qwen2.5-1.5B-Instruct-gcd_sycophancy", "finetuned_model2": "coastalcph/Qwen2.5-1.5B-Instruct-personality-non-sycophancy", "finetuned_model3": "coastalcph/Qwen2.5-1.5B-Instruct-personality-sycophancy", "output_model_name": "coastalcph/Qwen2.5-1.5B-1t_gcd_sycophanct-1t_diff_sycophant", "output_dir": "/projects/nlp/data/constanzam/weight-interp/task-vectors/math_non_sycophant_12Aug", "scaling_coef": 1.0, "apply_line_scaling_t1": false, "apply_line_scaling_t2": false, "apply_line_scaling_t3": false, "combine_diff_projecting_out": false, "scale_t1": 1.0, "scale_t2": 1.0, "scale_t3": 1.0 }
Downloads last month
-
Safetensors
Model size
1.54B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support