Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
LM-Parallel
/
grpo_llama-hs-v3_bs64_rollout5-lr1e-5-seq-weighted-kl0.01-20250319052012
like
0
Follow
LM-Parallel
4
Safetensors
Model card
Files
Files and versions
Community
main
grpo_llama-hs-v3_bs64_rollout5-lr1e-5-seq-weighted-kl0.01-20250319052012
/
global_step_50
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
longlian
Upload folder using huggingface_hub
9f58191
verified
10 days ago
config.json
Safe
1.03 kB
Upload folder using huggingface_hub
10 days ago
generation_config.json
Safe
132 Bytes
Upload folder using huggingface_hub
10 days ago
model.safetensors
587 MB
LFS
Upload folder using huggingface_hub
10 days ago
special_tokens_map.json
Safe
552 Bytes
Upload folder using huggingface_hub
10 days ago
tokenizer.json
Safe
3.62 MB
Upload folder using huggingface_hub
10 days ago
tokenizer.model
Safe
500 kB
LFS
Upload folder using huggingface_hub
10 days ago
tokenizer_config.json
Safe
979 Bytes
Upload folder using huggingface_hub
10 days ago