microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 806k • 1.49k
ds4sd/SmolDocling-256M-preview Image-Text-to-Text • 0.3B • Updated 15 days ago • 231k • 1.58k
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper • 2503.00865 • Published Mar 2 • 64
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking Paper • 2502.20730 • Published Feb 28 • 37