Pretrained (not necessarily LLMs)
Collection
some of my pre-trained models will be present over here.
•
1 item
•
Updated
•
1
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Hyperparameters | Value |
|---|---|
| name | Adam |
| weight_decay | None |
| clipnorm | None |
| global_clipnorm | None |
| clipvalue | None |
| use_ema | False |
| ema_momentum | 0.99 |
| ema_overwrite_frequency | None |
| jit_compile | True |
| is_legacy_optimizer | False |
| learning_rate | 0.0010000000474974513 |
| beta_1 | 0.9 |
| beta_2 | 0.999 |
| epsilon | 1e-07 |
| amsgrad | False |
| training_precision | float32 |