UIGEN-T1.5 REASONING MODEL Collection UIGEN'S Next Iteration. UIGEN-T1.5 is a midway model between 1 and 2, reflecting our new data collection pipeline changes. • 5 items • Updated about 3 hours ago • 4
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published 4 days ago • 38
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning Paper • 2503.15265 • Published 5 days ago • 42
view article Article NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets 7 days ago • 29
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video Paper • 2503.11647 • Published 10 days ago • 117
YuE: Scaling Open Foundation Models for Long-Form Music Generation Paper • 2503.08638 • Published 13 days ago • 59
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL Paper • 2503.07536 • Published 14 days ago • 80
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published 14 days ago • 95
Gemma 3 Collection All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated 8 days ago • 43
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 13 days ago • 342
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published 19 days ago • 215
Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning Paper • 2502.18080 • Published 27 days ago • 2
YuLan-Mini Collection A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 6 items • Updated 7 days ago • 15
Instella ✨ Collection Announcing Instella, a series of 3 billion parameter language models developed by AMD, trained from scratch on 128 Instinct MI300X GPUs. • 5 items • Updated 19 days ago • 5
Multi-Turn Code Generation Through Single-Step Rewards Paper • 2502.20380 • Published 25 days ago • 30
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V Paper • 2310.11441 • Published Oct 17, 2023 • 28