Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji PRO
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked a model 1 day ago
Qwen/Qwen3.6-27B liked a model 4 days ago
deepseek-ai/DeepSeek-V4-Pro liked a model 4 days ago
deepseek-ai/DeepSeek-V4-Flash