Tuned
Collection
4 items
โข
Updated
This model is a fine-tuned version of Qwen/Qwen3-1.7B on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss | Perplexity |
---|---|---|---|---|
No log | 0 | 0 | 6.3047 | 547.1235 |
No log | 0.6011 | 333 | 1.8454 | 6.3306 |
1.9738 | 1.2022 | 666 | 1.7511 | 5.7610 |
1.9738 | 1.8032 | 999 | 1.6936 | 5.4388 |
1.7084 | 2.4043 | 1332 | 1.6532 | 5.2239 |