Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
kevinwang676
/
Qwen2.5-1.5B-Distillation-r-32
like
0
Safetensors
License:
mit
Model card
Files
Files and versions
Community
main
Qwen2.5-1.5B-Distillation-r-32
/
unsloth_compiled_cache
1 contributor
History:
1 commit
kevinwang676
Upload folder using huggingface_hub
94d23fe
verified
14 days ago
__pycache__
Upload folder using huggingface_hub
14 days ago
UnslothAlignPropTrainer.py
26.9 kB
Upload folder using huggingface_hub
14 days ago
UnslothBCOTrainer.py
87.9 kB
Upload folder using huggingface_hub
14 days ago
UnslothCPOTrainer.py
75.4 kB
Upload folder using huggingface_hub
14 days ago
UnslothDDPOTrainer.py
38.2 kB
Upload folder using huggingface_hub
14 days ago
UnslothDPOTrainer.py
109 kB
Upload folder using huggingface_hub
14 days ago
UnslothGKDTrainer.py
40 kB
Upload folder using huggingface_hub
14 days ago
UnslothGRPOTrainer.py
74.6 kB
Upload folder using huggingface_hub
14 days ago
UnslothKTOTrainer.py
89.9 kB
Upload folder using huggingface_hub
14 days ago
UnslothNashMDTrainer.py
43.5 kB
Upload folder using huggingface_hub
14 days ago
UnslothORPOTrainer.py
75.4 kB
Upload folder using huggingface_hub
14 days ago
UnslothOnlineDPOTrainer.py
62.5 kB
Upload folder using huggingface_hub
14 days ago
UnslothPPOTrainer.py
62 kB
Upload folder using huggingface_hub
14 days ago
UnslothPRMTrainer.py
37.9 kB
Upload folder using huggingface_hub
14 days ago
UnslothRLOOTrainer.py
54.2 kB
Upload folder using huggingface_hub
14 days ago
UnslothRewardTrainer.py
39 kB
Upload folder using huggingface_hub
14 days ago
UnslothSFTTrainer.py
49.5 kB
Upload folder using huggingface_hub
14 days ago
UnslothXPOTrainer.py
46.5 kB
Upload folder using huggingface_hub
14 days ago