Structure Alignement Collection Protein Language Models fine-tuned with our structural alignment pipeline. • 2 items • Updated May 22 • 1
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5 • 269
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated May 5 • 79
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm • 5 items • Updated May 5 • 38
STICKERCONV: Generating Multimodal Empathetic Responses from Scratch Paper • 2402.01679 • Published Jan 20, 2024 • 1
🍓 Ichigo v0.3 Collection The experimental family designed to train LLMs to understand sound natively. • 6 items • Updated Nov 11, 2024 • 18
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning Paper • 2506.01713 • Published 26 days ago • 46
Revisual-R1 Collection 🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement le • 4 items • Updated 2 days ago • 3
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization Paper • 2410.19609 • Published Oct 25, 2024 • 18
👩‍💻 OlympicCoder Collection Reasoning datasets and models for competitive coding • 4 items • Updated May 13 • 18
📊 CodeForces Collection Datasets with FULLY VERIFIABLE competitive programming problems, reasoning traces, and human created solutions • 3 items • Updated May 14 • 3
CPRet Collection CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive Programming • 5 items • Updated May 16 • 1
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Paper • 2505.00703 • Published May 1 • 43
One-RL-to-See-Them-All Collection https://github.com/MiniMax-AI/One-RL-to-See-Them-All • 5 items • Updated May 26 • 14