siyeng feng

siyengfeng

AI & ML interests

None yet

Recent Activity

Organizations

None yet

siyengfeng's activity

upvoted an article 8 days ago
view article
Article

Process Reinforcement through Implicit Rewards

By ganqu
20
reacted to AdinaY's post with 🔥 10 days ago
view post
Post
2786
BIG release by DeepSeek AI🔥🔥🔥

DeepSeek-R1 & DeepSeek-R1-Zero: two 660B reasoning models are here, alongside 6 distilled dense models (based on Llama & Qwen) for the community!
https://huggingface.co/deepseek-ai
deepseek-ai/DeepSeek-R1

✨ MIT License : enabling distillation for custom models
✨ 32B & 70B models match OpenAI o1-mini in multiple capabilities
✨ API live now! Access Chain of Thought reasoning with model='deepseek-reasoner'