Update README.md
Browse files
README.md
CHANGED
@@ -31,15 +31,6 @@ This model is fine-tuned to engage in productive disagreement, overthinking, and
|
|
31 |
- **License**: Apache 2.0
|
32 |
- **Finetuning Approach**: Instruction tuning with friction-based reasoning examples
|
33 |
|
34 |
-
### Training Data
|
35 |
-
|
36 |
-
The model was trained on a combination of three datasets:
|
37 |
-
1. `leonvanbokhorst/friction-disagreement-v2` (8.5% weight)
|
38 |
-
- Examples of productive disagreement and challenging assumptions
|
39 |
-
2. `leonvanbokhorst/friction-overthinking-v2` (9.5% weight)
|
40 |
-
- Examples of deep analytical thinking and self-reflection
|
41 |
-
3. `leonvanbokhorst/reluctance-v6.1` (82% weight)
|
42 |
-
- Examples of hesitation and careful consideration
|
43 |
|
44 |
### Training Procedure
|
45 |
|
@@ -54,12 +45,6 @@ The model was trained on a combination of three datasets:
|
|
54 |
- **Gradient Clipping**: 0.5
|
55 |
- **Mixed Precision**: bfloat16
|
56 |
|
57 |
-
### Performance Metrics
|
58 |
-
|
59 |
-
- **Training Loss**: 1.437 (final)
|
60 |
-
- **Best Validation Loss**: 1.527 (epoch 3.57)
|
61 |
-
- **Memory Usage**: 3.813 GB for training (15.9% of GPU memory)
|
62 |
-
|
63 |
## Intended Use
|
64 |
|
65 |
This model is designed for:
|
|
|
31 |
- **License**: Apache 2.0
|
32 |
- **Finetuning Approach**: Instruction tuning with friction-based reasoning examples
|
33 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
34 |
|
35 |
### Training Procedure
|
36 |
|
|
|
45 |
- **Gradient Clipping**: 0.5
|
46 |
- **Mixed Precision**: bfloat16
|
47 |
|
|
|
|
|
|
|
|
|
|
|
|
|
48 |
## Intended Use
|
49 |
|
50 |
This model is designed for:
|