MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft Paper • 2504.08388 • Published 5 days ago • 37
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 4 days ago • 58
LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis Paper • 2503.21749 • Published 20 days ago • 25
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework Paper • 2503.21758 • Published 20 days ago • 18
Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models Paper • 2503.13939 • Published 29 days ago • 4
CLS-RL: Image Classification with Rule-Based Reinforcement Learning Paper • 2503.16188 • Published 27 days ago • 9
IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models Paper • 2501.13920 • Published Jan 23 • 17
Flux_SD3_MJ_Dalle_Human_Annotation_Sets Collection A Datatest of a 2M+ human annotations that was split into three modalities: Preference, Coherence, Text-to-Image Alignment • 3 items • Updated Dec 12, 2024 • 3
Rapidata Benchmark Data Collection The data that powers our text-2-image leaderboard • 8 items • Updated Mar 11 • 5
Open Image Preferences Collection Containing all artifacts for the Stable Diffusion 3.5L vs Flux Dev image preference community sprint. • 14 items • Updated Dec 19, 2024 • 9
Boosting Open-Domain Continual Learning via Leveraging Intra-domain Category-aware Prototype Paper • 2408.09984 • Published Aug 19, 2024 • 1
Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models Paper • 2312.06685 • Published Dec 9, 2023 • 1
Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models Paper • 2410.00363 • Published Oct 1, 2024 • 1
From screenshots to HTML Collection WebSight is a dataset of 823,000 HTML/CSS codes representing synthetically generated English websites, each accompanied by a corresponding screenshot. • 4 items • Updated Apr 15, 2024 • 21