Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

jquad
/
DeepSeek-R1-0528-Qwen3-8B-German-GRPO

Safetensors
German
grpo
lora
german
math-reasoning
deepseek
unsloth
Model card Files Files and versions Community
DeepSeek-R1-0528-Qwen3-8B-German-GRPO
Ctrl+K
Ctrl+K
  • 1 contributor
History: 7 commits
jquad's picture
jquad
Update README.md
c93050e verified 24 days ago
  • .gitattributes
    1.57 kB
    Upload model trained with Unsloth 25 days ago
  • README.md
    1.85 kB
    Update README.md 24 days ago
  • adapter_config.json
    880 Bytes
    Upload LoRA adapters (Trained with Unsloth) 24 days ago
  • adapter_model.safetensors
    327 MB
    LFS
    Upload LoRA adapters (Trained with Unsloth) 24 days ago
  • chat_template.jinja
    5.27 kB
    Upload model trained with Unsloth 25 days ago
  • generation_config.json
    171 Bytes
    Upload model trained with Unsloth 25 days ago
  • special_tokens_map.json
    472 Bytes
    Upload model trained with Unsloth 25 days ago
  • tokenizer.json
    11.4 MB
    LFS
    Upload model trained with Unsloth 25 days ago
  • tokenizer_config.json
    5.61 kB
    Upload model trained with Unsloth 25 days ago