mistralai/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text • Updated 28 days ago • 77k • • 1.19k
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Paper • 2408.03314 • Published Aug 6, 2024 • 63