wisent-ai
/

qwen2.5-coder-7b-wisent-caa

@@ -15,6 +15,19 @@ datasets:
 metrics:
 - pass@1
 base_model: Qwen/Qwen2.5-Coder-7B-Instruct
 ---
 # Wisent-Qwen2.5-Coder-7B-Instruct with CAA Steering
@@ -26,7 +39,7 @@ This is an enhanced version of Qwen2.5-Coder-7B-Instruct that integrates **Contr
 ### Key Features
 - 🚀 **Automatic CAA Steering**: No manual hook management required
-- 🎯 **Optimized Parameters**: Layer 24, α=0.9
 - 🗂️ **Trait-Based Organization**: Steering vectors organized by traits
 - 🔧 **Runtime Configurable**: Adjust or disable steering on the fly
 - 🤗 **HuggingFace Compatible**: Works with standard transformers API
@@ -131,7 +144,7 @@ To switch traits, simply update the configuration:
 - **Steering Method**: Contrastive Activation Addition (CAA)
 - **Optimal Layer**: 24 (out of 28 transformer layers)
-- **Steering Strength (α)**: 0.9
 - **Vector Format**: Safetensors format for efficient loading and HuggingFace compatibility
 - **Vector Dimension**: 3584 (pre-normalized during training)
 - **Storage Path**: `./vectors/mbpp_plus/steering_vector.safetensors`
@@ -151,7 +164,7 @@ The CAA parameters were optimized using:
 - **Framework**: Optuna with TPE sampler
 - **Search Space**: Layers 15-28, α ∈ [0.1, 5.0]
 - **Objective**: Maximize accuracy on MBPP Plus validation set
-- **Validation Results**: Optimized for improved performance on MBPP Plus tasks
 ## Model Architecture

 metrics:
 - pass@1
 base_model: Qwen/Qwen2.5-Coder-7B-Instruct
+model-index:
+- name: wisent-ai/qwen2.5-coder-7b-wisent-caa
+  results:
+  - task:
+      type: code-generation
+      name: Code Generation
+    dataset:
+      type: mbppplus
+      name: MBPP Plus
+    metrics:
+    - type: pass@1
+      value: 0.521
+      name: Pass@1
 ---
 # Wisent-Qwen2.5-Coder-7B-Instruct with CAA Steering
 ### Key Features
 - 🚀 **Automatic CAA Steering**: No manual hook management required
+- 🎯 **Optimized Parameters**: Layer 24, α=1.4
 - 🗂️ **Trait-Based Organization**: Steering vectors organized by traits
 - 🔧 **Runtime Configurable**: Adjust or disable steering on the fly
 - 🤗 **HuggingFace Compatible**: Works with standard transformers API
 - **Steering Method**: Contrastive Activation Addition (CAA)
 - **Optimal Layer**: 24 (out of 28 transformer layers)
+- **Steering Strength (α)**: 1.4
 - **Vector Format**: Safetensors format for efficient loading and HuggingFace compatibility
 - **Vector Dimension**: 3584 (pre-normalized during training)
 - **Storage Path**: `./vectors/mbpp_plus/steering_vector.safetensors`
 - **Framework**: Optuna with TPE sampler
 - **Search Space**: Layers 15-28, α ∈ [0.1, 5.0]
 - **Objective**: Maximize accuracy on MBPP Plus validation set
+- **Best Performance**: 52.1% accuracy on MBPP Plus (378 problems)
 ## Model Architecture

config.json CHANGED Viewed

@@ -116,7 +116,7 @@
   },
   "caa_enabled": true,
   "caa_layer_id": 24,
-  "caa_alpha": 0.9,
   "steering_method": "caa",
   "wisent_optimization": {
     "best_value": 0.64,

   },
   "caa_enabled": true,
   "caa_layer_id": 24,
+  "caa_alpha": 1.4,
   "steering_method": "caa",
   "wisent_optimization": {
     "best_value": 0.64,