Rakancorle1
/

ThinkGuard

Text Classification

text-generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Metrics Training metrics Community

Rakancorle1 commited on Feb 28

Commit

b202736

·

verified ·

1 Parent(s): fa9521e

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -92,7 +92,7 @@ print("\n==================================\n")
 ## Performance
 Unlike the other three benchmarks, which solely evaluate Safety Assessment (i.e., binary safe/unsafe classification), BeaverTails is a multi-class classification benchmark. Its F1 score evaluation extends beyond simple Safety Assessment to measure accuracy across multiple risk categories, providing a more fine-grained assessment of model performance.
-<!-- ![Table-1](./Table-1.png) -->
 ## Model Description
 - **Model type:** Guardrail model fine-tuned to enhance safety classification with critiques-augmented finetuning.

 ## Performance
 Unlike the other three benchmarks, which solely evaluate Safety Assessment (i.e., binary safe/unsafe classification), BeaverTails is a multi-class classification benchmark. Its F1 score evaluation extends beyond simple Safety Assessment to measure accuracy across multiple risk categories, providing a more fine-grained assessment of model performance.
+![Table-1](./Table-1-0227.png)
 ## Model Description
 - **Model type:** Guardrail model fine-tuned to enhance safety classification with critiques-augmented finetuning.