Good work. Can you share the following details regarding the pretraining of Supra-50M base model?
- GPU(s) used for pretraining
- Total GPU hours and cost
- Cloud platform (GPU) used for pretraining
Good work. Can you share the following details regarding the pretraining of Supra-50M base model?