Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
LM-Parallel
/
grpo_llama-hs-v3_bs64_rollout5-lr1e-5-seq-weighted-kl0.01-20250319052012
like
0
Follow
LM-Parallel
4
Safetensors
Model card
Files
Files and versions
Community
main
grpo_llama-hs-v3_bs64_rollout5-lr1e-5-seq-weighted-kl0.01-20250319052012
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
longlian
Upload folder using huggingface_hub
9f58191
verified
4 days ago
global_step_100
Upload folder using huggingface_hub
4 days ago
global_step_150
Upload folder using huggingface_hub
4 days ago
global_step_200
Upload folder using huggingface_hub
4 days ago
global_step_250
Upload folder using huggingface_hub
4 days ago
global_step_300
Upload folder using huggingface_hub
4 days ago
global_step_50
Upload folder using huggingface_hub
4 days ago
.gitattributes
Safe
1.52 kB
initial commit
4 days ago