Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked
a model
1 day ago
google/gemma-3-4b-pt
liked
a model
11 days ago
rednote-hilab/dots.ocr
liked
a model
17 days ago
openai/gpt-oss-120b