Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jquad
/
DeepSeek-R1-0528-Qwen3-8B-German-GRPO
like
2
Safetensors
German
grpo
lora
german
math-reasoning
deepseek
unsloth
License:
apache-2.0
Model card
Files
Files and versions
Community
main
DeepSeek-R1-0528-Qwen3-8B-German-GRPO
Commit History
Update README.md
c93050e
verified
jquad
commited on
25 days ago
Upload README.md with huggingface_hub
d9c6e0d
verified
jquad
commited on
25 days ago
Upload LoRA adapters (Trained with Unsloth)
8cce8b8
verified
jquad
commited on
25 days ago
Upload model trained with Unsloth
cdf5f56
verified
jquad
commited on
25 days ago
Upload model trained with Unsloth
b91a323
verified
jquad
commited on
25 days ago
Upload README.md with huggingface_hub
b14af22
verified
jquad
commited on
25 days ago
initial commit
09b9056
verified
jquad
commited on
25 days ago