Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kevinwang676
/
Qwen2.5-1.5B-Distillation-r-32
like
0
Safetensors
License:
mit
Model card
Files
Files and versions
Community
main
Qwen2.5-1.5B-Distillation-r-32
/
unsloth_compiled_cache
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
kevinwang676
Upload folder using huggingface_hub
94d23fe
verified
4 months ago
__pycache__
Upload folder using huggingface_hub
4 months ago
UnslothAlignPropTrainer.py
Safe
26.9 kB
Upload folder using huggingface_hub
4 months ago
UnslothBCOTrainer.py
Safe
87.9 kB
Upload folder using huggingface_hub
4 months ago
UnslothCPOTrainer.py
Safe
75.4 kB
Upload folder using huggingface_hub
4 months ago
UnslothDDPOTrainer.py
Safe
38.2 kB
Upload folder using huggingface_hub
4 months ago
UnslothDPOTrainer.py
Safe
109 kB
Upload folder using huggingface_hub
4 months ago
UnslothGKDTrainer.py
Safe
40 kB
Upload folder using huggingface_hub
4 months ago
UnslothGRPOTrainer.py
Safe
74.6 kB
Upload folder using huggingface_hub
4 months ago
UnslothKTOTrainer.py
Safe
89.9 kB
Upload folder using huggingface_hub
4 months ago
UnslothNashMDTrainer.py
Safe
43.5 kB
Upload folder using huggingface_hub
4 months ago
UnslothORPOTrainer.py
Safe
75.4 kB
Upload folder using huggingface_hub
4 months ago
UnslothOnlineDPOTrainer.py
Safe
62.5 kB
Upload folder using huggingface_hub
4 months ago
UnslothPPOTrainer.py
Safe
62 kB
Upload folder using huggingface_hub
4 months ago
UnslothPRMTrainer.py
Safe
37.9 kB
Upload folder using huggingface_hub
4 months ago
UnslothRLOOTrainer.py
Safe
54.2 kB
Upload folder using huggingface_hub
4 months ago
UnslothRewardTrainer.py
Safe
39 kB
Upload folder using huggingface_hub
4 months ago
UnslothSFTTrainer.py
Safe
49.5 kB
Upload folder using huggingface_hub
4 months ago
UnslothXPOTrainer.py
Safe
46.5 kB
Upload folder using huggingface_hub
4 months ago