Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
alfiwillianz 's Collections
SemiQwenn

SemiQwenn

updated 14 days ago

RISTEK UI Datathon 2025 Submission

Upvote
-

  • alfiwillianz/SemiQwenn-7b

    8B • Updated 23 days ago • 10

  • alfiwillianz/SemiQwenn-1.5b

    2B • Updated 23 days ago • 10

  • alfiwillianz/SemiQwenn-0.5b

    0.5B • Updated 23 days ago • 11

  • sahil2801/CodeAlpaca-20k

    Viewer • Updated Oct 3, 2023 • 20k • 1.91k • 200

    Note Dataset that used on the distillation


  • openai/gsm8k

    Viewer • Updated Jan 4, 2024 • 17.6k • 328k • 813

    Note Dataset that used on the distillation


  • mistralai/Devstral-Small-2505_gguf

    24B • Updated about 8 hours ago • 16.7k • 73

    Note Teacher Model, the model which will be distilled to smaller model so it can fit on single consumer 16GB GPU


  • Qwen/Qwen2.5-0.5B

    Text Generation • 0.5B • Updated Sep 25, 2024 • 655k • 289

    Note Student Model that later being SemiQwenn-0.5B


  • Qwen/Qwen2.5-1.5B

    Text Generation • 2B • Updated Oct 8, 2024 • 452k • • 110

    Note Student Model that later being SemiQwenn-1.5B


  • Qwen/Qwen2.5-7B

    Text Generation • 8B • Updated Sep 25, 2024 • 1.12M • • 206

    Note Student Model that later being SemiQwenn-7B


  • deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

    Text Generation • 8B • Updated Feb 24 • 589k • • 680

    Note Used as LLM-as-a-Judge to Judge model performance

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs