GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains Paper • 2505.18700 • Published 14 days ago • 4
EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering Paper • 2505.24417 • Published 8 days ago • 12
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework Paper • 2504.12395 • Published Apr 16 • 17
Running on Zero 78 78 FLUX.1 Dev ControlNet Union Pro 2.0 🔥 Create images using prompts and control images
Score-CAM: Score-Weighted Visual Explanations for Convolutional Neural Networks Paper • 1910.01279 • Published Oct 3, 2019
One-shot Implicit Animatable Avatars with Model-based Priors Paper • 2212.02469 • Published Dec 5, 2022
Image Watermarks are Removable Using Controllable Regeneration from Clean Noise Paper • 2410.05470 • Published Oct 7, 2024