Update README.md
Browse files
README.md
CHANGED
|
@@ -53,8 +53,8 @@ It achieves the following results on the evaluation set:
|
|
| 53 |
## Model description
|
| 54 |
|
| 55 |
This model uses a shared `XLM-RoBERTa` base to encode input text. The resulting text representation is then fed into two separate, independent classification layers (heads):
|
| 56 |
-
* A **Sentiment Head** with 3 outputs for `positive`, `neutral`, and `negative` classes.
|
| 57 |
-
* A **Multi-Label Head** with 41 outputs, which are decoded to predict the presence or absence of 37 different disaster-related categories.
|
| 58 |
|
| 59 |
This dual-head architecture allows for a nuanced understanding of a message, capturing both its emotional content and its specific, actionable information.
|
| 60 |
|
|
|
|
| 53 |
## Model description
|
| 54 |
|
| 55 |
This model uses a shared `XLM-RoBERTa` base to encode input text. The resulting text representation is then fed into two separate, independent classification layers (heads):
|
| 56 |
+
* A **Sentiment Head (Frozen from pre-trained model)** with 3 outputs for `positive`, `neutral`, and `negative` classes.
|
| 57 |
+
* A **Multi-Label Head (Newly created and fine-tuned)** with 41 outputs, which are decoded to predict the presence or absence of 37 different disaster-related categories.
|
| 58 |
|
| 59 |
This dual-head architecture allows for a nuanced understanding of a message, capturing both its emotional content and its specific, actionable information.
|
| 60 |
|