Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked
a model
11 days ago
ai21labs/AI21-Jamba-Mini-1.6
liked
a model
2 months ago
sand-ai/MAGI-1
liked
a model
3 months ago
Qwen/Qwen2.5-VL-32B-Instruct