Efficient Agentic Reasoning Through Self-Regulated Simulative Planning Paper • 2605.22138 • Published 10 days ago • 11
Paris 2.0: A Decentralized Diffusion Model for Video Generation Paper • 2605.26064 • Published 6 days ago • 1
Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings Paper • 2605.22391 • Published 10 days ago • 35
PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing Paper • 2601.21957 • Published Jan 29 • 22
Beyond Visual Fidelity: Benchmarking Super-Resolution Models for Large-Scale Remote Sensing Imagery via Downstream Task Integration Paper • 2605.00310 • Published 30 days ago • 1
HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer Paper • 2505.22705 • Published May 28, 2025 • 1
DiffSpot: Can VLMs Spot Fine-Grained Visual Differences in Web Interfaces? Paper • 2605.29615 • Published 3 days ago • 1
LiteCoder-Terminal: Scaling Long-Horizon Terminal Environments for Learning Language Agents Paper • 2605.29559 • Published 3 days ago • 8
Automatic Image-Level Morphological Trait Annotation for Organismal Images Paper • 2604.01619 • Published Apr 2 • 8
Humanoid Everyday: A Comprehensive Robotic Dataset for Open-World Humanoid Manipulation Paper • 2510.08807 • Published Oct 9, 2025 • 5
RankJudge: A Multi-Turn LLM-as-a-Judge Synthetic Benchmark Generator Paper • 2605.21748 • Published 11 days ago • 14
Towards Real-world Human Behavior Simulation: Benchmarking Large Language Models on Long-horizon, Cross-scenario, Heterogeneous Behavior Traces Paper • 2604.08362 • Published Apr 9 • 16