
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation
•
Updated
•
1.53M
•
•
632
This is a collection of Llama and Qwen-based models ranging from 1.5B to 70B parameters with are distilled from DeepSeek's new R1 models.