Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 4 days ago • 286
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 8 days ago • 270
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 16 days ago • 128
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 7 days ago • 61
Enhancing Human-Like Responses in Large Language Models Paper • 2501.05032 • Published 21 days ago • 49
Cosmos Tokenizer Collection A suite of image and video tokenizers • 13 items • Updated 14 days ago • 37
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding Paper • 2412.10302 • Published Dec 13, 2024 • 12