Code datasets for pretraining
Orion
Orion-zhen
·
AI & ML interests
Eco-friendly training using Tesla P4. Prefers (FSDP+)QLoRA.
Recent Activity
updated
a model
about 2 hours ago
Orion-zhen/our
updated
a Space
9 days ago
Orion-zhen/gemini
published
a Space
9 days ago
Orion-zhen/gemini
Organizations
Qwen3-Dense-AWQ
AWQ quantization of Qwen3 Dense series at Day0!
🤯Emoji datasets
All for emoji!
Calibration datasets
Datasets used for various calibrations
Unalignments
Datasets used to unalign models
Free Spaces
Powerful apps built on free HF space
Qwen2.5 Series
-
Orion-zhen/Qwen2.5-14B-Instruct-Uncensored
15B • Updated • 1.04k • 20 -
Orion-zhen/Meissa-Qwen2.5-14B-Instruct
15B • Updated • 7 • 5 -
Orion-zhen/Meissa-Qwen2.5-7B-Instruct
Text Generation • 8B • Updated • 1.08k • • 19 -
Orion-zhen/Qwen2.5-7B-Instruct-Uncensored
Text Generation • 8B • Updated • 2.82k • • 20
Llama3-Orion
My llama3 models
-
Orion-zhen/Llama3-70B-Orion-Chinese
Text Generation • 71B • Updated • 22 • • 15 -
Orion-zhen/Llama3-70B-Orion-Chinese-SE
Text Generation • 71B • Updated • 6 -
Orion-zhen/Llama3-70B-Orion-Chinese-Plus
Text Generation • 71B • Updated • 12 -
Orion-zhen/Llama3-70B-Orion-Chinese-Ultra
Text Generation • 71B • Updated • 10 • 1
Reasoning
Datasets focus on reasoning
Code4Pretrain
Code datasets for pretraining
Free Spaces
Powerful apps built on free HF space
Qwen3-Dense-AWQ
AWQ quantization of Qwen3 Dense series at Day0!
Qwen2.5 Series
-
Orion-zhen/Qwen2.5-14B-Instruct-Uncensored
15B • Updated • 1.04k • 20 -
Orion-zhen/Meissa-Qwen2.5-14B-Instruct
15B • Updated • 7 • 5 -
Orion-zhen/Meissa-Qwen2.5-7B-Instruct
Text Generation • 8B • Updated • 1.08k • • 19 -
Orion-zhen/Qwen2.5-7B-Instruct-Uncensored
Text Generation • 8B • Updated • 2.82k • • 20
🤯Emoji datasets
All for emoji!
Llama3-Orion
My llama3 models
-
Orion-zhen/Llama3-70B-Orion-Chinese
Text Generation • 71B • Updated • 22 • • 15 -
Orion-zhen/Llama3-70B-Orion-Chinese-SE
Text Generation • 71B • Updated • 6 -
Orion-zhen/Llama3-70B-Orion-Chinese-Plus
Text Generation • 71B • Updated • 12 -
Orion-zhen/Llama3-70B-Orion-Chinese-Ultra
Text Generation • 71B • Updated • 10 • 1
Calibration datasets
Datasets used for various calibrations
Reasoning
Datasets focus on reasoning
Unalignments
Datasets used to unalign models