Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
LM-Parallel
/
grpo_llama-hs-v3_bs64_rollout5-lr1e-5-seq-weighted-kl0.01-20250319052012
like
0
Follow
LM-Parallel
4
Safetensors
Model card
Files
Files and versions
Community
No model card
Downloads last month
-
Downloads are not tracked for this model.
How to track
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
HF Inference deployability: The model has no library tag.