MiniCPM4 Collection MiniCPM4: Ultra-Efficient LLMs on End Devices β’ 22 items β’ Updated 22 days ago β’ 68
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. β’ 65 items β’ Updated 12 days ago β’ 163
DeepSeek-R1-ReDistill Collection Re-distilled DeepSeek R1 models β’ 4 items β’ Updated Jan 30 β’ 14
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others β’ Jan 28 β’ 876
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 11 items β’ Updated Apr 28 β’ 502
QuietImpostor/FuseChat-Llama-3.2-3B-Instruct-Base-Merge Text Generation β’ 3B β’ Updated Jan 10 β’ 2