deepseek-ai/DeepSeek-R1-0528-Qwen3-8B Text Generation • 8B • Updated about 1 month ago • 552k • • 810
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B Text Generation • 8B • Updated about 1 month ago • 552k • • 810
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Paper • 2505.09343 • Published May 14 • 65
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Paper • 2505.09343 • Published May 14 • 65
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Paper • 2505.09343 • Published May 14 • 65
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Paper • 2505.09343 • Published May 14 • 65