view article Article Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H By Hcompany • about 15 hours ago • 40
Cora: Correspondence-aware image editing using few step diffusion Paper • 2505.23907 • Published 5 days ago • 10
EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering Paper • 2505.24417 • Published 5 days ago • 10
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment Paper • 2505.18600 • Published 11 days ago • 43
LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers Paper • 2505.23758 • Published 5 days ago • 23
view article Article Exploring Quantization Backends in Diffusers By derekl35 and 2 others • 14 days ago • 31
My MCP-ready spaces [WIP] Collection Progressive list of MCP server ready trending spaces maintained by fffiloni • 13 items • Updated 5 days ago • 4
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data Paper • 2505.18445 • Published 11 days ago • 63
HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters Paper • 2505.20156 • Published 9 days ago • 1
HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation Paper • 2503.18860 • Published Mar 24 • 6
LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models Paper • 2505.19223 • Published 10 days ago • 8
Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression Paper • 2505.19602 • Published 9 days ago • 13
Packing Input Frame Context in Next-Frame Prediction Models for Video Generation Paper • 2504.12626 • Published Apr 17 • 50
Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention Paper • 2505.17412 • Published 12 days ago • 18
MedGemma Release Collection Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 4 items • Updated 5 days ago • 145
Model Already Knows the Best Noise: Bayesian Active Noise Selection via Attention in Video Diffusion Model Paper • 2505.17561 • Published 12 days ago • 30
Training-Free Efficient Video Generation via Dynamic Token Carving Paper • 2505.16864 • Published 13 days ago • 21