naver-hyperclovax/HyperCLOVAX-SEED-Think-32B Text Generation β’ 33B β’ Updated 19 days ago β’ 32.8k β’ 391
naver-hyperclovax/HyperCLOVAX-SEED-Text-Instruct-0.5B Text Generation β’ 0.6B β’ Updated Jul 21, 2025 β’ 3.63k β’ 80
naver-hyperclovax/HyperCLOVAX-SEED-Vision-Instruct-3B Text Generation β’ 4B β’ Updated Sep 16, 2025 β’ 85.5k β’ 220
naver-hyperclovax/HyperCLOVAX-SEED-Text-Instruct-1.5B Text Generation β’ 2B β’ Updated Oct 2, 2025 β’ 2.94k β’ 154
nvidia/Llama-3.1-Nemotron-Nano-8B-v1 Text Generation β’ 8B β’ Updated Oct 15, 2025 β’ 14.4k β’ β’ 216
naver-hyperclovax/HyperCLOVAX-SEED-Think-14B Text Generation β’ 15B β’ Updated Aug 27, 2025 β’ 4.98k β’ 111
Qwen/Qwen3-235B-A22B-Instruct-2507 Text Generation β’ 235B β’ Updated Sep 17, 2025 β’ 81.8k β’ β’ 750
naver-hyperclovax/HyperCLOVAX-SEED-Think-14B Text Generation β’ 15B β’ Updated Aug 27, 2025 β’ 4.98k β’ 111
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge Feb 7, 2025 β’ 272
meta-llama/Meta-Llama-3-8B-Instruct Text Generation β’ 8B β’ Updated Jun 18, 2025 β’ 1.51M β’ β’ 4.36k