Models Qwen/Qwen2.5-Omni-7B Any-to-Any • 11B • Updated Apr 30 • 270k • 1.8k deepseek-ai/DeepSeek-R1 Text Generation • 685B • Updated Mar 27 • 550k • • 12.8k upstage/TinySolar-248m-4k Text Generation • 0.2B • Updated Feb 7, 2024 • 667 • 8 upstage/TinySolar-248m-4k-code-instruct Text Generation • 0.2B • Updated Apr 19, 2024 • 59 • 8
datasets HuggingFaceH4/llava-instruct-mix-vsft Viewer • Updated Apr 11, 2024 • 273k • 1.88k • 47 togethercomputer/RedPajama-Data-1T Viewer • Updated Jun 17, 2024 • 1.73M • 1.1k • 1.1k
Models Qwen/Qwen2.5-Omni-7B Any-to-Any • 11B • Updated Apr 30 • 270k • 1.8k deepseek-ai/DeepSeek-R1 Text Generation • 685B • Updated Mar 27 • 550k • • 12.8k upstage/TinySolar-248m-4k Text Generation • 0.2B • Updated Feb 7, 2024 • 667 • 8 upstage/TinySolar-248m-4k-code-instruct Text Generation • 0.2B • Updated Apr 19, 2024 • 59 • 8
datasets HuggingFaceH4/llava-instruct-mix-vsft Viewer • Updated Apr 11, 2024 • 273k • 1.88k • 47 togethercomputer/RedPajama-Data-1T Viewer • Updated Jun 17, 2024 • 1.73M • 1.1k • 1.1k