Update README.md
Browse files
README.md
CHANGED
@@ -51,7 +51,7 @@ This variant is trained only on our synthetic data and RAGTruth dataset for hall
|
|
51 |
## Training Procedure
|
52 |
|
53 |
- Tokenizer: AutoTokenizer; DataCollatorForTokenClassification; label pad −100
|
54 |
-
- Max length: 4096; batch size:
|
55 |
- Optimizer: AdamW (lr 1e‑5, weight_decay 0.01)
|
56 |
- Hardware: Single A100 80GB
|
57 |
|
|
|
51 |
## Training Procedure
|
52 |
|
53 |
- Tokenizer: AutoTokenizer; DataCollatorForTokenClassification; label pad −100
|
54 |
+
- Max length: 4096; batch size: 16; epochs: 5
|
55 |
- Optimizer: AdamW (lr 1e‑5, weight_decay 0.01)
|
56 |
- Hardware: Single A100 80GB
|
57 |
|