Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked
a dataset
8 days ago
HuggingFaceM4/FineVision
liked
a dataset
8 days ago
HuggingFaceFW/finepdfs
liked
a model
12 days ago
google/embeddinggemma-300m