Text Generation
Transformers
Safetensors
PyTorch
nvidia
conversational

Any plans to release the training recipe?

#21
by nskwal - opened

Are there any plans to release the training recipe and configuration used with Megatron-LM?

Have you seen this https://arxiv.org/pdf/2508.14444 ?

Sign up or log in to comment