Update README.md
Browse files
README.md
CHANGED
@@ -92,7 +92,7 @@ print("\n==================================\n")
|
|
92 |
|
93 |
## Performance
|
94 |
Unlike the other three benchmarks, which solely evaluate Safety Assessment (i.e., binary safe/unsafe classification), BeaverTails is a multi-class classification benchmark. Its F1 score evaluation extends beyond simple Safety Assessment to measure accuracy across multiple risk categories, providing a more fine-grained assessment of model performance.
|
95 |
-
|
96 |
## Model Description
|
97 |
|
98 |
- **Model type:** Guardrail model fine-tuned to enhance safety classification with critiques-augmented finetuning.
|
|
|
92 |
|
93 |
## Performance
|
94 |
Unlike the other three benchmarks, which solely evaluate Safety Assessment (i.e., binary safe/unsafe classification), BeaverTails is a multi-class classification benchmark. Its F1 score evaluation extends beyond simple Safety Assessment to measure accuracy across multiple risk categories, providing a more fine-grained assessment of model performance.
|
95 |
+

|
96 |
## Model Description
|
97 |
|
98 |
- **Model type:** Guardrail model fine-tuned to enhance safety classification with critiques-augmented finetuning.
|