view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 53
deepseek-ai/DeepSeek-R1-Distill-Llama-8B Text Generation • 8B • Updated Feb 24 • 1.01M • • 770
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 408