sravanthib/custom_accelerate-Qwen1.5b-20k-wd-warmup-same-as-nemo Text Generation • Updated 5 days ago • 11
sravanthib/custom_accelerate-llama-20k-wd-warmup-same-as-nemo Text Generation • Updated 5 days ago • 17
sravanthib/stage-2-customauto-config-llama-3-2-custom-1000-steps-logging-old-deepspeed Text Generation • Updated 7 days ago • 4
sravanthib/stage-2-custom-llama-3-2-custom-1000-steps-logging-old-deepspeed Text Generation • Updated 7 days ago • 5
sravanthib/lr-20k-stage-0-1e-4-Qwen2-5-1-5-B-Instruct-custom-1000-steps-logging-old-deepspeed Text Generation • Updated 8 days ago • 2
sravanthib/lr-20k-stage-0-1e-4-llama-3-2-1b-custom-1000-steps-logging-old-deepspeed Text Generation • Updated 8 days ago • 3
sravanthib/lr-1e-4-llama-3-2-1b-custom-1000-steps-logging-old-deepspeed Text Generation • Updated 8 days ago • 4
sravanthib/lr-3e-6-llama-3-2-1b-custom-1000-steps-logging-old-deepspeed Text Generation • Updated 9 days ago • 3
sravanthib/llama-3-2-1b-custom-100-steps-logging-old-deepspeed Text Generation • Updated 16 days ago • 7